Connect with us

Hi, what are you looking for?

AI Technology

MediaTek Deploys NVIDIA DGX SuperPOD to Process 60B Tokens Monthly for AI Development

MediaTek leverages NVIDIA DGX SuperPOD to process 60 billion tokens monthly, revolutionizing AI development and enhancing model training efficiency.

The implementation of the NVIDIA DGX SuperPOD has significantly transformed the AI development lifecycle at MediaTek. This high-performance computing solution is critical for managing extensive and continuous AI workloads, reflecting the growing demands of modern AI applications. “Our AI factory, powered by DGX SuperPOD, processes approximately 60 billion tokens per month for inference and completes thousands of model-training iterations every month,” said David Ku, Co-COO and CFO at MediaTek.

Model inferencing, especially with cutting-edge large language models (LLMs), necessitates loading entire models into GPU memory. Given that models can contain hundreds of billions of parameters, they often exceed the memory capacity of a single GPU server, necessitating their partitioning across multiple GPUs. The DGX SuperPOD, consisting of tightly coupled DGX systems and high-performance NVIDIA networking, is designed to deliver the ultra-fast, coordinated GPU memory and compute power needed for training and inference on the largest AI workloads.

According to Ku, “The DGX SuperPOD is indispensable for our inference workloads. It allows us to deploy and run massive models that wouldn’t fit on a single GPU or even a single server, ensuring we achieve the best performance and accuracy for our most demanding AI applications.” MediaTek leverages these large models not only for core research and development but also for a centralized, high-demand API. The company subsequently distills smaller versions for specific edge or mobile applications, ensuring optimal performance and accuracy across its offerings.

With the DGX platform, MediaTek has streamlined its product development pipeline by integrating AI agents into research and development workflows. One notable application is AI-assisted code completion, which has significantly reduced both programming time and error rates. An AI agent, developed using domain-adapted LLMs, aids engineers in understanding, analyzing, and optimizing designs by extracting information from design flowcharts and state diagrams as part of the chip design process. This advancement allows for the production of technical documentation in days, a marked improvement compared to the weeks it previously required.

In addition, MediaTek utilizes NVIDIA NeMo™, a software suite designed for building, training, and deploying large language models, to fine-tune these models. This ensures both optimal performance and domain-specific accuracy, further enhancing the company’s capabilities in AI development. The shift toward such technologies underscores a broader trend within the tech industry, where companies are increasingly reliant on advanced AI systems to maintain competitiveness and innovate rapidly in their respective fields.

As AI applications continue to expand, the role of powerful computing solutions like the NVIDIA DGX SuperPOD is expected to grow even more critical. MediaTek’s successful integration of these technologies exemplifies how leading tech firms are adapting to the demands of modern AI workloads. The company’s strategy not only emphasizes efficiency but also positions it to capitalize on the evolving landscape of artificial intelligence, ensuring that it remains at the forefront of innovation in this rapidly changing sector.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Technology

Synopsys partners with Lightmatter to enhance AI infrastructure, integrating optical technology to boost data transfer speeds and energy efficiency in a $134B market.

Top Stories

NVIDIA CEO Jensen Huang warns that China could soon outpace the U.S. in AI innovation unless urgent federal action accelerates research and removes regulatory...

Top Stories

EU's new law mandates stricter AI regulations, potentially reshaping development for tech giants like Google and Microsoft amid escalating geopolitical tensions.

AI Business

MediaTek launches an advanced IoT platform for smart retail at NRF 2026, integrating 5G, Wi-Fi 7, and edge AI to enhance customer experiences and...

Top Stories

iKKO partners with MediaTek and Silicon Motion to launch the MindOne smartphone, featuring global connectivity in 140+ countries for seamless “Always-On AI” services.

AI Generative

Grid Dynamics surges 8.2% after announcing a multi-year partnership with AWS to develop enterprise generative AI solutions, aiming for $551.2M revenue by 2028.

Top Stories

Epson integrates Google TV with Gemini in Lifestudio projectors, enhancing user experience with AI-driven content discovery and smart home control.

AI Regulation

China's Cyberspace Administration proposes new regulations for AI chatbots, mandating safeguards against addiction and emotional manipulation by early 2026.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.