Connect with us

Hi, what are you looking for?

Top Stories

Amazon Launches On-Prem AI Factories with Nvidia GPUs for Enhanced Data Control

Amazon launches AI Factories with Nvidia’s Blackwell GPUs, enabling on-premises high-performance computing while ensuring data sovereignty and regulatory compliance.

Amazon is expanding its ambitions in high-performance computing and artificial intelligence with a new offering called AI Factories, developed in partnership with Nvidia. This managed service is designed to bring high-performance computing capabilities directly into the on-premises data centers of its customers, allowing them to maintain control over their sensitive data while leveraging cloud-like functionalities.

The AI Factories service integrates Nvidia’s latest Blackwell-class GPUs and provides AWS-managed compute resources to on-premises environments. Customers are responsible for supplying the necessary power and space, while AWS manages the setup and operations of the AI clusters. This arrangement enables organizations to continue using the same TensorFlow stack they utilize in the AWS cloud, ensuring consistency across different environments.

This offering effectively serves as a private supercomputer for AI, fully managed by AWS. Importantly, data remains on-site unless customers opt for federation with the public cloud, thereby adhering to regulatory requirements around data sovereignty and security. The structure not only addresses latency-sensitive applications—such as factory vision and clinical imaging—but also simplifies compliance with various regulations, including GDPR and national security mandates.

Amazon’s move into the on-premises AI space comes as other tech giants ramp up their competitive offerings. Microsoft is developing its own Nvidia-powered AI Factory infrastructure through its Azure Local service, while Google has introduced its Distributed Cloud lineup aimed at both hosted and sovereign solutions. Oracle is also targeting regulated markets with its Oracle Alloy and dedicated region offerings. However, Amazon distinguishes itself by tightly integrating on-premises delivery with its existing AI platform services and offering customers a choice between Nvidia’s Blackwell GPUs and AWS’s Trainium3 accelerators.

Efficiency is central to the Nvidia plus Trainium equation. The Blackwell generation is engineered for large-scale training and high-throughput inference, while Trainium3 aims to deliver superior price-performance ratios. Customers can standardize on Blackwell for broad software ecosystem support or select Trainium3 for optimized total cost of ownership. AWS further enhances this offering with its Nitro and Elastic Fabric Adapter technologies, which simplify management through features like capacity planning and incident response.

As AI Factories require significant power—often between 30 to 60 kW—organizations must consider the implications for cooling infrastructure and power distribution. This power density presents challenges that the Uptime Institute and other industry bodies have flagged as critical issues for data center operators. The managed model alleviates lifecycle risks and complexities for customers while still enabling them to protect their data locality.

The benefits for regulated industries are clear: organizations can conduct AI model training and inference within their facilities while still accessing the broader AWS ecosystem as needed. Early adopters include banks refining multilingual models with proprietary transaction data, healthcare providers training imaging models on governed datasets, and manufacturers employing vision systems that require near-instantaneous processing.

Ultimately, Amazon’s AI Factories represent a strategic pivot toward hybrid AI operating models, enabling clients to maintain control over data while tapping into cloud capabilities. This approach aligns with growing demand for AI solutions that prioritize data residency and governance in an increasingly complex regulatory landscape. As businesses seek to balance the rapid evolution of technology with the imperative for data security, Amazon is positioning itself as a leader in the next generation of AI infrastructure.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

Bill Ackman’s Pershing Square Capital invests 10% in Meta Platforms, capitalizing on AI-driven ad revenue potential amid a $165 billion capital expenditure plan.

Top Stories

Runway secures $315 million in Series E funding, boosting its valuation to $5.3 billion to enhance next-gen AI video generation and world modeling technologies

Top Stories

Alphabet and Amazon boost AI capital expenditures by billions to establish sovereign data centers, responding to surging global demand and geopolitical pressures.

AI Business

Arinox AI and KOGO unveil CommandCORE, India's first sovereign AI box, ensuring greater data security and privacy for enterprises at ₹10 lakh.

Top Stories

Akamai Technologies reports strong Q3 results with a 17.5% share surge after launching its NVIDIA-powered Inference Cloud, projecting EPS of $6.93 to $7.13.

Top Stories

Hugging Face rejects Nvidia's $500 million investment to uphold its strategic neutrality and maintain open access for 13 million users in the AI ecosystem.

Top Stories

Amazon shares plummet 18% to $198.79 as a $200 billion AI investment plan stirs profitability doubts, marking a challenging market landscape for tech stocks.

Top Stories

Amazon veteran Hemant Virmani, laid off after 11 years, pivots to AI upskilling while seeking impactful engineering roles in the evolving tech landscape.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.