AI Technology

Meta Unveils Four New MTIA Chips with 25x Compute Gains for AI Inference

Meta unveils four new MTIA chips, achieving up to 25x compute gains for AI inference, aimed at revolutionizing its AI ecosystem and reducing Nvidia reliance.

Staff

Published

12 March, 2026

Meta announced the development of four new generations of its in-house chips, the Meta Training and Inference Accelerator (MTIA), designed in collaboration with Broadcom. These chips, slated for deployment over the next two years, aim to enhance the company’s AI capabilities significantly. “We’ve developed a competitive strategy for MTIA by prioritizing rapid, iterative development,” Meta stated in its press release, highlighting an inference-first focus and seamless adoption through native support for industry standards.

The four models unveiled are the MTIA 300, 400, 450, and 500. The MTIA 300 is already in production for ranking and recommendations training, while the MTIA 400 is undergoing lab testing prior to its data center deployment. The MTIA 450 and 500 are positioned for AI inference, with mass deployment expected in early and late 2027, respectively. Meta’s technical blog outlines significant advancements across these models, including a 4.5 times increase in HBM bandwidth and a 25 times uplift in compute FLOPs from MTIA 300 to MTIA 500.

Meta asserts that the MTIA 450 doubles the HBM bandwidth of the MTIA 400, claiming it surpasses the capabilities of existing top-tier commercial products, including Nvidia’s H100 and H200. The MTIA 500 is expected to provide an additional 50% increase in HBM bandwidth compared to its predecessor, alongside up to 80% more HBM capacity. This distinction is crucial, as HBM bandwidth is identified as the chief bottleneck during the decode phase of transformer inference. Current mainstream GPUs are designed to maximize FLOPs for large-scale pre-training, which incurs costs and power overhead that Meta argues are superfluous for inference tasks.

The MTIA chips are characterized by distinct performance metrics. The MTIA 300 focuses on ranking and recommendations, with a thermal design power (TDP) of 800 W and an HBM bandwidth of 6.1 TB/s. The MTIA 400 has a TDP of 1,200 W and an HBM bandwidth of 9.2 TB/s, while the MTIA 450 and 500 feature TDPs of 1,400 W and 1,700 W, respectively, with HBM bandwidths of 18.4 TB/s and 27.6 TB/s. The peak performance also scales, with the MTIA 500 boasting up to 30 PFLOPS.

Moreover, Meta’s approach incorporates hardware acceleration for FlashAttention and mixture-of-experts feed-forward network computations, alongside custom low-precision data types specifically designed for inference. The MTIA 450 supports MX4 performance, delivering six times the MX4 FLOPs of FP16/BF16, effectively reducing the software overhead associated with data type conversion.

In terms of deployment, the MTIA 400, 450, and 500 will utilize a common chassis, rack, and network infrastructure. This modularity allows for easier interchangeability of chip generations, contributing to an accelerated chip cadence of approximately six months, outpacing the industry’s typical one- to two-year cycles. The software stack for these chips is compatible with popular frameworks such as PyTorch, vLLM, and Triton, permitting simultaneous deployment of production models on both GPUs and MTIA without the need for MTIA-specific adjustments.

Currently, Meta has already deployed hundreds of thousands of MTIA chips across its applications for inference on organic content and advertisements. This announcement comes shortly after the company revealed a long-term, $100 billion AI infrastructure partnership with AMD. This suggests a broader strategy to reduce reliance on Nvidia across various components of Meta’s AI ecosystem while maintaining the MTIA chips as the foundation for its inference workloads.

AI Finance

AI Stocks Nvidia, Broadcom, and Amazon Set to Propel Nasdaq to New Highs

Nvidia, Broadcom, and Amazon are set to drive the Nasdaq to new highs, with Nvidia projecting staggering revenue growth of 79% in Q1 and...

Marcus Chen10 hours ago

AI Tools

Meta and Microsoft Announce 16,000 Job Cuts Amid Rising AI Investment Costs

Meta and Microsoft plan to cut up to 16,000 jobs—10% of Meta's workforce—amid escalating AI investment costs, with Meta's spending projected to reach $135...

Staff12 hours ago

AI Technology

Nvidia vs. Broadcom: Which AI Stock Will Surpass $100B Revenue by 2027?

Nvidia projects a remarkable 124% revenue growth by 2027, while Broadcom aims for $100 billion in AI revenue, positioning both as top investment choices.

Staff14 hours ago

AI Cybersecurity

Microsoft Invests in AI Infrastructure, Eyes $250 Trillion Market Potential by 2040

Microsoft targets a $250 trillion AI market by 2040, investing heavily in infrastructure to secure its position in this transformative tech landscape.

Rachel Torres17 hours ago

Meta and Microsoft Cut 16,000 Jobs Amid 92,000 Tech Layoffs, Raising AI Job Security Concerns

Meta and Microsoft cut 16,000 jobs, part of 92,000 tech layoffs in 2026, raising alarms over job security as AI investments surge to $700...

Staff18 hours ago

OpenAI, Meta, Microsoft Data Centers Could Emit 129M Tons of CO2, Exceeding Morocco

OpenAI, Meta, and Microsoft data centers are projected to emit over 129 million tons of CO2 annually, surpassing Morocco's total emissions.

Staff20 hours ago

AI Technology

AI Chip Market: Broadcom’s Revenue Set to Surge 35.6%, Overtaking Nvidia’s Growth

Broadcom's revenue is projected to soar by 35.6%, potentially surpassing Nvidia's growth as the semiconductor market shifts towards custom AI chip solutions.

Staff24 hours ago

Meta Cuts 8,000 Jobs as Microsoft Offers Voluntary Buyouts to 8,750 Employees

Meta cuts 8,000 jobs amid a strategic pivot to AI investment, while Microsoft offers buyouts to 8,750 employees as tech companies adapt to evolving...

Staff2 days ago

AIPRESSA.COM

AI Technology

Meta Unveils Four New MTIA Chips with 25x Compute Gains for AI Inference

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

AI Finance

AI Stocks Nvidia, Broadcom, and Amazon Set to Propel Nasdaq to New Highs

AI Tools

Meta and Microsoft Announce 16,000 Job Cuts Amid Rising AI Investment Costs

AI Technology

Nvidia vs. Broadcom: Which AI Stock Will Surpass $100B Revenue by 2027?

AI Cybersecurity

Microsoft Invests in AI Infrastructure, Eyes $250 Trillion Market Potential by 2040

Top Stories

Meta and Microsoft Cut 16,000 Jobs Amid 92,000 Tech Layoffs, Raising AI Job Security Concerns

Top Stories

OpenAI, Meta, Microsoft Data Centers Could Emit 129M Tons of CO2, Exceeding Morocco

AI Technology

AI Chip Market: Broadcom’s Revenue Set to Surge 35.6%, Overtaking Nvidia’s Growth

Top Stories

Meta Cuts 8,000 Jobs as Microsoft Offers Voluntary Buyouts to 8,750 Employees