AWS Unveils NVLink Fusion for Rapid Trainium4 AI Chip Deployment, Enhancing Performance and Scalability

AWS partners with NVIDIA to launch NVLink Fusion, accelerating Trainium4 AI chip deployment and boosting performance to 260 TB/s bandwidth for hyperscalers.

Staff

Published

4 December, 2025

Amazon Web Services and NVIDIA Team Up to Accelerate AI Infrastructure Deployment

As the demand for artificial intelligence (AI) continues to surge, major cloud service providers, often referred to as hyperscalers, are seeking ways to enhance the deployment of specialized AI infrastructure. In a significant move announced at AWS re:Invent, Amazon Web Services (AWS) has partnered with NVIDIA to introduce the integration of NVIDIA NVLink Fusion. This innovative rack-scale platform enables industries to construct custom AI infrastructure using NVIDIA NVLink scale-up interconnect technology along with a broad ecosystem of partners, facilitating the deployment of the latest Trainium4 AI chips, Graviton CPUs, Elastic Fabric Adapters (EFAs), and the Nitro System virtualization infrastructure.

The collaboration marks the beginning of a multigenerational effort between NVIDIA and AWS, with the design of Trainium4 aimed at compatibility with NVLink 6 and the NVIDIA MGX rack architecture. This initiative is expected to streamline the development of next-generation AI infrastructure.

NVLink Fusion is designed to alleviate several challenges hyperscalers face when deploying custom AI silicon. As AI workloads become increasingly large and complex, the pressure to rapidly implement compute infrastructure that meets the evolving market demands intensifies. New workloads, such as planning, reasoning, and agentic AI, require systems capable of processing hundreds of billions to trillions of parameters, necessitating numerous accelerators working in parallel within a unified fabric.

The integration of a scale-up network, such as NVLink, is crucial for connecting entire racks of accelerators with high bandwidth and low latency. However, hyperscalers encounter significant hurdles in deploying specialized solutions, including long development cycles and the complexities of managing intricate supplier ecosystems. Designing custom AI chips requires additional networking solutions and thorough architectural planning, potentially costing billions and extending deployment timelines over several years. Furthermore, coordinating with a diverse supplier network to source the necessary components poses considerable logistical challenges.

NVLink Fusion aims to address these issues by enhancing performance, minimizing deployment risks, and accelerating the time to market for custom AI silicon. The NVLink Fusion chiplet allows hyperscalers to seamlessly integrate with NVLink scale-up networking, connecting up to 72 custom ASICs at 3.6 TB/s per ASIC, culminating in a total of 260 TB/s of scale-up bandwidth. This capability enables accelerated data management and processing, making it a vital asset for AI-driven initiatives.

Central to NVLink Fusion is the NVLink Switch, which permits peer-to-peer memory access through direct loads, stores, and atomic operations, bolstered by NVIDIA’s Scalable Hierarchical Aggregation and Reduction Protocol (SHARP). This technology is instrumental for in-network reductions and multicast acceleration, providing a competitive edge in the AI landscape. Compared to alternative scale-up networking methods, NVLink has established itself as a proven, widely adopted solution that delivers significant performance improvements in AI inference.

NVLink Fusion also provides a modular portfolio of AI factory technology, including NVIDIA MGX rack architecture, GPUs, NVIDIA Vera CPUs, and various networking components, along with an ecosystem of ASIC designers and manufacturers. This comprehensive technology suite enables hyperscalers to significantly reduce development costs and time to market. By leveraging the NVLink Fusion infrastructure, AWS can harness a vast supply chain for full rack-scale deployment, mitigating the risks associated with custom AI infrastructure projects.

In addition to performance enhancements, NVLink Fusion supports the development of heterogeneous silicon, allowing AWS to utilize the same physical footprint for a range of AI solutions, thereby streamlining its operational efficiencies. Hyperscalers can deploy as much or as little of the NVLink Fusion platform as needed, enabling rapid scalability to accommodate the demands of intensive AI workloads.

As AWS integrates NVLink Fusion into its strategy for deploying Trainium4, the collaboration with NVIDIA is set to propel innovation cycles and expedite the delivery of advanced AI capabilities. This partnership not only underscores the growing emphasis on AI infrastructure but also highlights the broader implications for industries aiming to leverage AI technologies. By lowering barriers to entry and accelerating deployment timelines, NVLink Fusion is poised to foster significant advancements in the AI landscape.

For further information, visit AWS and NVIDIA.

Runway Secures $315 Million in Series E Funding, Valuation Soars to $5.3 Billion

Runway secures $315 million in Series E funding, boosting its valuation to $5.3 billion to enhance next-gen AI video generation and world modeling technologies

Staff9 hours ago

AI Business

Arinox AI and KOGO Launch India’s First Sovereign AI Box for Enhanced Data Security

Arinox AI and KOGO unveil CommandCORE, India's first sovereign AI box, ensuring greater data security and privacy for enterprises at ₹10 lakh.

Marcus Chen17 hours ago

Akamai Launches NVIDIA-Powered Inference Cloud, Shares Surge 17.5% After Strong Q3 Results

Akamai Technologies reports strong Q3 results with a 17.5% share surge after launching its NVIDIA-powered Inference Cloud, projecting EPS of $6.93 to $7.13.

Staff1 day ago

Hugging Face Rejects Nvidia’s $500 Million Offer to Maintain Strategic Neutrality

Hugging Face rejects Nvidia's $500 million investment to uphold its strategic neutrality and maintain open access for 13 million users in the AI ecosystem.

Staff1 day ago

Amazon Shares Fall 18% as $200 Billion AI Investment Raises Profitability Concerns

Amazon shares plummet 18% to $198.79 as a $200 billion AI investment plan stirs profitability doubts, marking a challenging market landscape for tech stocks.

Staff1 day ago

Global AI Leaders Converge in Delhi for Landmark Summit, Driving India’s Tech Future

India AI Impact Summit 2026 in New Delhi, featuring leaders like Sundar Pichai and Sam Altman, aims to reshape global AI governance and investment...

Staff2 days ago

AI Technology

Nvidia, Broadcom, and TSMC Lead 5 Best AI Stocks to Buy This February

Nvidia and Broadcom are set to benefit from a surge in AI investments, with Nvidia's GPUs leading the market and Microsoft Azure seeing a...

Staff2 days ago

AI Technology

Intel Acquires AI Startup SambaNova, Unveils Z Angle Memory Prototype for Data Centers

Intel acquires AI startup SambaNova to enhance enterprise AI capabilities and introduces Z Angle Memory prototype to address data center workload efficiency.

Staff2 days ago

AIPRESSA.COM

Top Stories

AWS Unveils NVLink Fusion for Rapid Trainium4 AI Chip Deployment, Enhancing Performance and Scalability

Amazon Web Services and NVIDIA Team Up to Accelerate AI Infrastructure Deployment

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

Top Stories

Runway Secures $315 Million in Series E Funding, Valuation Soars to $5.3 Billion

AI Business

Arinox AI and KOGO Launch India’s First Sovereign AI Box for Enhanced Data Security

Top Stories

Akamai Launches NVIDIA-Powered Inference Cloud, Shares Surge 17.5% After Strong Q3 Results

Top Stories

Hugging Face Rejects Nvidia’s $500 Million Offer to Maintain Strategic Neutrality

Top Stories

Amazon Shares Fall 18% as $200 Billion AI Investment Raises Profitability Concerns

Top Stories

Global AI Leaders Converge in Delhi for Landmark Summit, Driving India’s Tech Future

AI Technology

Nvidia, Broadcom, and TSMC Lead 5 Best AI Stocks to Buy This February

AI Technology

Intel Acquires AI Startup SambaNova, Unveils Z Angle Memory Prototype for Data Centers