Connect with us

Hi, what are you looking for?

Top Stories

Amazon Launches On-Prem AI Factories with Nvidia GPUs for Enhanced Data Control

Amazon launches AI Factories with Nvidia’s Blackwell GPUs, enabling on-premises high-performance computing while ensuring data sovereignty and regulatory compliance.

Amazon is expanding its ambitions in high-performance computing and artificial intelligence with a new offering called AI Factories, developed in partnership with Nvidia. This managed service is designed to bring high-performance computing capabilities directly into the on-premises data centers of its customers, allowing them to maintain control over their sensitive data while leveraging cloud-like functionalities.

The AI Factories service integrates Nvidia’s latest Blackwell-class GPUs and provides AWS-managed compute resources to on-premises environments. Customers are responsible for supplying the necessary power and space, while AWS manages the setup and operations of the AI clusters. This arrangement enables organizations to continue using the same TensorFlow stack they utilize in the AWS cloud, ensuring consistency across different environments.

This offering effectively serves as a private supercomputer for AI, fully managed by AWS. Importantly, data remains on-site unless customers opt for federation with the public cloud, thereby adhering to regulatory requirements around data sovereignty and security. The structure not only addresses latency-sensitive applications—such as factory vision and clinical imaging—but also simplifies compliance with various regulations, including GDPR and national security mandates.

Amazon’s move into the on-premises AI space comes as other tech giants ramp up their competitive offerings. Microsoft is developing its own Nvidia-powered AI Factory infrastructure through its Azure Local service, while Google has introduced its Distributed Cloud lineup aimed at both hosted and sovereign solutions. Oracle is also targeting regulated markets with its Oracle Alloy and dedicated region offerings. However, Amazon distinguishes itself by tightly integrating on-premises delivery with its existing AI platform services and offering customers a choice between Nvidia’s Blackwell GPUs and AWS’s Trainium3 accelerators.

Efficiency is central to the Nvidia plus Trainium equation. The Blackwell generation is engineered for large-scale training and high-throughput inference, while Trainium3 aims to deliver superior price-performance ratios. Customers can standardize on Blackwell for broad software ecosystem support or select Trainium3 for optimized total cost of ownership. AWS further enhances this offering with its Nitro and Elastic Fabric Adapter technologies, which simplify management through features like capacity planning and incident response.

As AI Factories require significant power—often between 30 to 60 kW—organizations must consider the implications for cooling infrastructure and power distribution. This power density presents challenges that the Uptime Institute and other industry bodies have flagged as critical issues for data center operators. The managed model alleviates lifecycle risks and complexities for customers while still enabling them to protect their data locality.

The benefits for regulated industries are clear: organizations can conduct AI model training and inference within their facilities while still accessing the broader AWS ecosystem as needed. Early adopters include banks refining multilingual models with proprietary transaction data, healthcare providers training imaging models on governed datasets, and manufacturers employing vision systems that require near-instantaneous processing.

Ultimately, Amazon’s AI Factories represent a strategic pivot toward hybrid AI operating models, enabling clients to maintain control over data while tapping into cloud capabilities. This approach aligns with growing demand for AI solutions that prioritize data residency and governance in an increasingly complex regulatory landscape. As businesses seek to balance the rapid evolution of technology with the imperative for data security, Amazon is positioning itself as a leader in the next generation of AI infrastructure.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Government

US Department of Defense partners with tech giants including SpaceX and OpenAI to launch an "AI-first" initiative aimed at enhancing military decision-making efficiency.

AI Technology

AMD unveils the Ryzen AI Halo Mini-PC, boasting a 16-core Ryzen AI Max+ 395 APU and the capability to process models with up to...

AI Generative

Nvidia's partnerships with Asian firms like LG and Nanya surge AI chip demand to 90% of production costs, reshaping the tech landscape in Asia.

AI Business

Nvidia CEO Jensen Huang urges industry leaders to avoid alarmist claims about AI's future, citing concerns over inaccurate predictions like a 50% job displacement...

AI Technology

Apple CEO Tim Cook warns of several-month supply shortages for the Mac mini and Mac Studio as demand surges, pushing Mac revenue to $8.4...

Top Stories

Apple's Q2 earnings reveal a price hike for the Mac mini to $799, fueled by AI memory demand, as Google and Amazon also report...

AI Technology

Major tech giants, including Google and Amazon, are set to invest $3.7 trillion in AI infrastructure over five years, reshaping the workforce and economy.

AI Technology

Amazon and Anthropic expand their partnership with a $100B investment in AWS, enhancing AI infrastructure and accelerating generative AI adoption globally.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.