AI Technology

AWS Launches Tranium 3 AI Chip Promising 50% Cost Savings Over Nvidia GPUs

AWS unveils Tranium 3 AI chip, boasting 4x performance increase and 50% cost savings over Nvidia GPUs, signaling a shift in AI computing efficiency.

Staff

Published

3 December, 2025

Following Google, Amazon Web Services (AWS) has launched its own energy-efficient artificial intelligence (AI) chip, marking a potential shift in a market long dominated by Nvidia. As demand for training massive AI models surges, rising costs, power consumption, and supply chain constraints have prompted companies to develop their own computing architectures. Analysts are closely monitoring whether these in-house AI chips, optimized for performance per watt, can effectively challenge Nvidia’s longstanding dominance.

At its annual “AWS re:Invent 2025” event in Las Vegas on December 2, AWS officially unveiled its custom AI chip, Tranium 3. The company showcased ultra servers capable of accommodating up to 144 Tranium 3 chips, which are available for immediate deployment. AWS claims that Tranium 3 delivers four times the computational performance of its previous-generation chips while consuming 40 percent less power. The company further noted that utilizing Tranium 3 could reduce AI model training and operational costs by up to 50 percent compared to systems using comparable graphics processing units (GPUs). During his keynote, AWS CEO Matt Garman emphasized that Tranium 3 offers the industry’s best cost efficiency for AI training and inference.

Google’s custom-developed tensor processing units (TPUs) mirror these advantages, featuring low power consumption and reduced operational costs. The TPUs powered the training and deployment of Google’s recently unveiled AI model, Gemini 3, developed in collaboration with U.S. semiconductor fabless company Broadcom. AI startup Anthropic plans to utilize up to one million TPUs for model development, while Meta is reportedly adopting Google TPUs within its own data centers. OpenAI is also collaborating with Broadcom to co-develop custom AI chips for training and operating its models, including ChatGPT.

The primary motivation behind tech giants developing custom AI chips is to secure a stable supply and mitigate costs. Nvidia GPUs, which are capable of processing massive amounts of data simultaneously, are vital to the AI ecosystem but remain chronically scarce, even for well-funded companies. As global AI investment escalates, Nvidia, the first company to market GPUs, has solidified its status as the dominant player in the field, controlling approximately 90 percent of the GPU-based AI chip market.

Each GPU is priced between $30,000 and $40,000, or about 44 million to 59 million won. When energy expenses are factored in, companies have concluded that purpose-built chips optimized for specific computations are more efficient over the long term. The distinct requirements of individual companies also drive the development of custom AI chips. AWS needs chips tailored for cloud services, while Google seeks processors specifically for training large language models such as Gemini. Although general-purpose GPUs can handle most computations, chips designed for a company’s unique model architecture can perform similar tasks using less power.

Despite these advancements, industry analysts caution that Nvidia’s dominance is unlikely to be challenged in the immediate future. The global AI research and development ecosystem is heavily centered on Nvidia GPUs and its CUDA software platform. Given the scale of existing infrastructure investments and the potential costs associated with switching, companies are not expected to replace Nvidia hardware soon. Nvidia has acknowledged Google’s strides in AI but maintains that its products remain a generation ahead of competitors.

As the competitive landscape intensifies, the move by major tech firms to develop proprietary AI chips reflects a broader trend aimed at enhancing efficiency and reducing dependency on a singular supplier. This trend not only signals a potential reshaping of the AI market but also underscores the increasing importance of customized solutions tailored to specific operational needs. The ongoing evolution of AI technology and infrastructure will be instrumental in determining whether companies can effectively challenge Nvidia’s supremacy in the near future.

AI Technology

New Report Reveals 74% of Big Tech’s AI Climate Claims Are Unproven, Exposing Greenwashing

A new report reveals that 74% of climate claims by tech giants like Google and Microsoft lack evidence, highlighting serious environmental costs of AI...

Staff7 hours ago

AI Impact Summit Set to Unlock ₹8 Lakh Crore Investments, Position India as Global Tech Leader

AI Impact Summit in India aims to unlock ₹8 lakh crore in investments, gathering leaders like Bill Gates and Sundar Pichai to shape global...

Staff9 hours ago

AI Education

UGA Launches $800K AI Pilot Program for Students, Access to ChatGPT Edu and Gemini Pro

UGA invests $800,000 to launch a pilot program providing students access to premium AI tools like ChatGPT Edu and Gemini Pro starting spring 2026.

David Park11 hours ago

Runway Secures $315 Million in Series E Funding, Valuation Soars to $5.3 Billion

Runway secures $315 million in Series E funding, boosting its valuation to $5.3 billion to enhance next-gen AI video generation and world modeling technologies

Staff14 hours ago

AI Business

Arinox AI and KOGO Launch India’s First Sovereign AI Box for Enhanced Data Security

Arinox AI and KOGO unveil CommandCORE, India's first sovereign AI box, ensuring greater data security and privacy for enterprises at ₹10 lakh.

Marcus Chen23 hours ago

AI Technology

Peter Steinberger Joins OpenAI; OpenClaw to Remain Open Source Project

OpenAI hires OpenClaw creator Peter Steinberger, sustaining the project's open-source status amidst fierce competition for AI engineering talent.

Staff1 day ago

AI Technology

Smartkarma Launches AI-Enhanced Investing Platform with 55,000+ Users, Free Preview Pass Available

Smartkarma unveils a free Preview Pass for its AI-augmented investing platform, boosting access for over 55,000 investors managing $13 trillion in assets.

Staff1 day ago

Akamai Launches NVIDIA-Powered Inference Cloud, Shares Surge 17.5% After Strong Q3 Results

Akamai Technologies reports strong Q3 results with a 17.5% share surge after launching its NVIDIA-powered Inference Cloud, projecting EPS of $6.93 to $7.13.

Staff1 day ago

AIPRESSA.COM

AI Technology

AWS Launches Tranium 3 AI Chip Promising 50% Cost Savings Over Nvidia GPUs

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

Top Stories

DeepMind Achieves Breakthroughs with AlphaFold and AlphaZero, Transforming AI Landscape

You May Also Like

AI Technology

New Report Reveals 74% of Big Tech’s AI Climate Claims Are Unproven, Exposing Greenwashing

Top Stories

AI Impact Summit Set to Unlock ₹8 Lakh Crore Investments, Position India as Global Tech Leader

AI Education

UGA Launches $800K AI Pilot Program for Students, Access to ChatGPT Edu and Gemini Pro

Top Stories

Runway Secures $315 Million in Series E Funding, Valuation Soars to $5.3 Billion

AI Business

Arinox AI and KOGO Launch India’s First Sovereign AI Box for Enhanced Data Security

AI Technology

Peter Steinberger Joins OpenAI; OpenClaw to Remain Open Source Project

AI Technology

Smartkarma Launches AI-Enhanced Investing Platform with 55,000+ Users, Free Preview Pass Available

Top Stories

Akamai Launches NVIDIA-Powered Inference Cloud, Shares Surge 17.5% After Strong Q3 Results