AI Technology

MediaTek Deploys NVIDIA DGX SuperPOD to Process 60B Tokens Monthly for AI Development

MediaTek leverages NVIDIA DGX SuperPOD to process 60 billion tokens monthly, revolutionizing AI development and enhancing model training efficiency.

Staff

Published

3 hours ago

The implementation of the NVIDIA DGX SuperPOD has significantly transformed the AI development lifecycle at MediaTek. This high-performance computing solution is critical for managing extensive and continuous AI workloads, reflecting the growing demands of modern AI applications. “Our AI factory, powered by DGX SuperPOD, processes approximately 60 billion tokens per month for inference and completes thousands of model-training iterations every month,” said David Ku, Co-COO and CFO at MediaTek.

Model inferencing, especially with cutting-edge large language models (LLMs), necessitates loading entire models into GPU memory. Given that models can contain hundreds of billions of parameters, they often exceed the memory capacity of a single GPU server, necessitating their partitioning across multiple GPUs. The DGX SuperPOD, consisting of tightly coupled DGX systems and high-performance NVIDIA networking, is designed to deliver the ultra-fast, coordinated GPU memory and compute power needed for training and inference on the largest AI workloads.

According to Ku, “The DGX SuperPOD is indispensable for our inference workloads. It allows us to deploy and run massive models that wouldn’t fit on a single GPU or even a single server, ensuring we achieve the best performance and accuracy for our most demanding AI applications.” MediaTek leverages these large models not only for core research and development but also for a centralized, high-demand API. The company subsequently distills smaller versions for specific edge or mobile applications, ensuring optimal performance and accuracy across its offerings.

With the DGX platform, MediaTek has streamlined its product development pipeline by integrating AI agents into research and development workflows. One notable application is AI-assisted code completion, which has significantly reduced both programming time and error rates. An AI agent, developed using domain-adapted LLMs, aids engineers in understanding, analyzing, and optimizing designs by extracting information from design flowcharts and state diagrams as part of the chip design process. This advancement allows for the production of technical documentation in days, a marked improvement compared to the weeks it previously required.

In addition, MediaTek utilizes NVIDIA NeMo™, a software suite designed for building, training, and deploying large language models, to fine-tune these models. This ensures both optimal performance and domain-specific accuracy, further enhancing the company’s capabilities in AI development. The shift toward such technologies underscores a broader trend within the tech industry, where companies are increasingly reliant on advanced AI systems to maintain competitiveness and innovate rapidly in their respective fields.

As AI applications continue to expand, the role of powerful computing solutions like the NVIDIA DGX SuperPOD is expected to grow even more critical. MediaTek’s successful integration of these technologies exemplifies how leading tech firms are adapting to the demands of modern AI workloads. The company’s strategy not only emphasizes efficiency but also positions it to capitalize on the evolving landscape of artificial intelligence, ensuring that it remains at the forefront of innovation in this rapidly changing sector.

AI Technology

Synopsys Partners with Lightmatter to Enhance AI Infrastructure with Optical Technology

Synopsys partners with Lightmatter to enhance AI infrastructure, integrating optical technology to boost data transfer speeds and energy efficiency in a $134B market.

Staff6 days ago

NVIDIA’s Huang: U.S. Risks Losing AI Race to China Without Urgent Action on Innovation

NVIDIA CEO Jensen Huang warns that China could soon outpace the U.S. in AI innovation unless urgent federal action accelerates research and removes regulatory...

Staff26 January, 2026

EU’s New Law Could Revolutionize AI Development Amid Rising Geopolitical Tensions

EU's new law mandates stricter AI regulations, potentially reshaping development for tech giants like Google and Microsoft amid escalating geopolitical tensions.

Staff20 January, 2026

AI Business

MediaTek Unveils Next-Gen IoT Platform for Smart Retail with 5G, Wi-Fi 7 Support at NRF 2026

MediaTek launches an advanced IoT platform for smart retail at NRF 2026, integrating 5G, Wi-Fi 7, and edge AI to enhance customer experiences and...

Marcus Chen11 January, 2026

Silicon Motion Transforms AI Landscape with MindOne’s Global Connectivity Launch

iKKO partners with MediaTek and Silicon Motion to launch the MindOne smartphone, featuring global connectivity in 140+ countries for seamless “Always-On AI” services.

Staff11 January, 2026

AI Generative

Grid Dynamics Gains 8.2% After Multi-Year AWS Generative AI Partnership Announcement

Grid Dynamics surges 8.2% after announcing a multi-year partnership with AWS to develop enterprise generative AI solutions, aiming for $551.2M revenue by 2028.

Staff11 January, 2026

Epson Reveals Gemini AI Upgrade for Google TV in Select Lifestudio Projectors

Epson integrates Google TV with Gemini in Lifestudio projectors, enhancing user experience with AI-driven content discovery and smart home control.

Staff6 January, 2026

AI Regulation

China Proposes New AI Chatbot Regulations to Mitigate Addiction and Ensure User Safety

China's Cyberspace Administration proposes new regulations for AI chatbots, mandating safeguards against addiction and emotional manipulation by early 2026.

Staff3 January, 2026

AIPRESSA.COM

AI Technology

MediaTek Deploys NVIDIA DGX SuperPOD to Process 60B Tokens Monthly for AI Development

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

AI Technology

Synopsys Partners with Lightmatter to Enhance AI Infrastructure with Optical Technology

Top Stories

NVIDIA’s Huang: U.S. Risks Losing AI Race to China Without Urgent Action on Innovation

Top Stories

EU’s New Law Could Revolutionize AI Development Amid Rising Geopolitical Tensions

AI Business

MediaTek Unveils Next-Gen IoT Platform for Smart Retail with 5G, Wi-Fi 7 Support at NRF 2026

Top Stories

Silicon Motion Transforms AI Landscape with MindOne’s Global Connectivity Launch

AI Generative

Grid Dynamics Gains 8.2% After Multi-Year AWS Generative AI Partnership Announcement

Top Stories

Epson Reveals Gemini AI Upgrade for Google TV in Select Lifestudio Projectors

AI Regulation

China Proposes New AI Chatbot Regulations to Mitigate Addiction and Ensure User Safety