AI Technology

OpenAI Explores AMD, Cerebras, and Groq for Enhanced Inference Performance

OpenAI evaluates AMD, Cerebras, and Groq to enhance real-time inference performance, signaling a shift in AI hardware dynamics amid rising consumer demand.

Staff

Published

3 February, 2026

OpenAI is re-evaluating its hardware strategy as artificial intelligence transitions from training models to efficiently running them in real time, known as inference. While Nvidia continues to dominate the market for chips used in training large AI models, sources indicate that OpenAI is exploring alternatives, including Advanced Micro Devices (AMD), Cerebras, and Groq. This exploration seeks to enhance speed and efficiency for specific workloads that prioritize inference, reflecting a broader industry trend toward specialized hardware as the demand for consumer-facing AI escalates.

In San Francisco, discussions have been ongoing since last year with various hardware suppliers. Although Nvidia’s GPUs remain central to OpenAI’s infrastructure, the shift in focus towards real-time inference tasks—such as coding tools—has prompted OpenAI to seek hardware that can deliver lower latency and improved memory access. This shift is crucial as inference requires different capabilities compared to training, where massive parallel processing power is essential.

Nvidia’s chips have long been the standard for AI training, but the requirements for inference, where trained models generate responses to queries, demand rapid memory access and reduced latency. Consequently, OpenAI is evaluating alternative architectures that utilize embedded SRAM to potentially offer speed advantages for real-time applications. This reassessment may mark a significant shift in the competitive landscape of AI hardware.

Among the companies that OpenAI has engaged with, AMD has been assessed for its GPUs to expand the hardware framework, while Cerebras, known for its wafer-scale chips with extensive on-chip memory, has developed a partnership with OpenAI focused on enhancing inference performance. Discussions with Groq also took place regarding compute capacity; however, Groq’s recent substantial licensing agreement with Nvidia—valued at roughly $20 billion—has redirected the company’s focus toward software and cloud services.

Despite these exploratory efforts, both OpenAI and Nvidia assert that Nvidia’s technology continues to underpin the majority of OpenAI’s operations, providing strong value for performance. Nvidia’s CEO, Jensen Huang, characterized any notion of discord with OpenAI as “nonsense,” while OpenAI’s CEO, Sam Altman, emphasized that Nvidia produces “the best AI chips in the world” and indicated that OpenAI aims to remain a significant customer.

This reassessment by OpenAI mirrors a larger trend as AI technology transitions from research to the mass production of consumer and enterprise applications. The importance of inference costs and performance is becoming more pronounced. Notably, companies like Google are also investing in custom TPUs designed specifically for real-time AI tasks. Rather than looking to replace Nvidia entirely, OpenAI’s strategy appears to be one of diversification; this approach aims to minimize reliance on a single supplier while still keeping Nvidia as a key partner in its infrastructure.

The implications of OpenAI’s hardware diversification are significant. As AI services such as ChatGPT expand globally, inference is on track to become the dominant performance battleground. Exploring suppliers beyond Nvidia can give OpenAI leverage regarding pricing, capacity, and performance. Increased competition in the market could also spur innovations in memory-centric AI accelerators from companies like AMD, Cerebras, and others.

Moreover, reports indicating OpenAI’s shift toward hardware diversification and the postponement of Nvidia investment discussions have affected Nvidia’s stock in some markets, highlighting investor sensitivity to changes in the AI supply chain. As inference gains prominence relative to training, the hardware landscape may evolve into a more competitive environment, decreasing Nvidia’s centrality over time. This dynamic may redefine the relationships and power structures among major tech players in the AI field.

Looking ahead, OpenAI’s efforts at diversifying its hardware partnerships could significantly shape the future of real-time AI applications and their underlying infrastructure. As companies adapt to the escalating demands of consumer-facing AI, the potential for accelerated innovation in specialized chips presents both opportunities and challenges for established leaders in the market.

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

US Department of Defense partners with tech giants including SpaceX and OpenAI to launch an "AI-first" initiative aimed at enhancing military decision-making efficiency.

Staff3 May, 2026

AI Research

OpenAI’s AI Model Achieves 81.6% Diagnostic Accuracy, Surpassing Human Doctors in ER Tests

OpenAI's o1 model achieves 81.6% diagnostic accuracy in emergency situations, surpassing human doctors and signaling a major shift in medical practice.

Staff3 May, 2026

AI Technology

AMD Launches Ryzen AI Halo Mini-PC with 128GB RAM and NPU for Local AI Development

AMD unveils the Ryzen AI Halo Mini-PC, boasting a 16-core Ryzen AI Max+ 395 APU and the capability to process models with up to...

Staff3 May, 2026

AI Generative

Nvidia Expands Partnerships with Asian Firms, Boosting AI Chip Demand by 90%

Nvidia's partnerships with Asian firms like LG and Nanya surge AI chip demand to 90% of production costs, reshaping the tech landscape in Asia.

Staff3 May, 2026

AI Generative

OpenAI Launches GPT Image 2, Surpassing Google Nano Banana 2 in Key Categories

OpenAI unveils GPT Image 2, achieving a record 242-point lead over competitors, transforming the AI image generation landscape with native reasoning capabilities.

Staff2 May, 2026

AI Business

Jensen Huang Critiques AI Doom Predictions, Calls for Fact-Based Discussions

Nvidia CEO Jensen Huang urges industry leaders to avoid alarmist claims about AI's future, citing concerns over inaccurate predictions like a 50% job displacement...

Marcus Chen2 May, 2026

AI Technology

Apple Faces Mac Mini and Studio Shortage as OpenClaw Drives AI Demand Surge

Apple CEO Tim Cook warns of several-month supply shortages for the Mac mini and Mac Studio as demand surges, pushing Mac revenue to $8.4...

Staff2 May, 2026

DeepSeek Launches V4 Open-Source Model, Underpricing GPT-5.5 and Claude Opus 4.7

DeepSeek's V4 open-source model undercuts GPT-5.5 and Claude Opus 4.7 with costs of $1.74 per million tokens, promising a disruptive shift in AI pricing...

Staff2 May, 2026

AIPRESSA.COM

AI Technology

OpenAI Explores AMD, Cerebras, and Groq for Enhanced Inference Performance

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

AI Research

OpenAI’s AI Model Achieves 81.6% Diagnostic Accuracy, Surpassing Human Doctors in ER Tests

AI Technology

AMD Launches Ryzen AI Halo Mini-PC with 128GB RAM and NPU for Local AI Development

AI Generative

Nvidia Expands Partnerships with Asian Firms, Boosting AI Chip Demand by 90%

AI Generative

OpenAI Launches GPT Image 2, Surpassing Google Nano Banana 2 in Key Categories

AI Business

Jensen Huang Critiques AI Doom Predictions, Calls for Fact-Based Discussions

AI Technology

Apple Faces Mac Mini and Studio Shortage as OpenClaw Drives AI Demand Surge

Top Stories

DeepSeek Launches V4 Open-Source Model, Underpricing GPT-5.5 and Claude Opus 4.7