Connect with us

Hi, what are you looking for?

Top Stories

Amazon Launches On-Prem AI Factories with Nvidia GPUs for Enhanced Data Control

Amazon launches AI Factories with Nvidia’s Blackwell GPUs, enabling on-premises high-performance computing while ensuring data sovereignty and regulatory compliance.

Amazon is expanding its ambitions in high-performance computing and artificial intelligence with a new offering called AI Factories, developed in partnership with Nvidia. This managed service is designed to bring high-performance computing capabilities directly into the on-premises data centers of its customers, allowing them to maintain control over their sensitive data while leveraging cloud-like functionalities.

The AI Factories service integrates Nvidia’s latest Blackwell-class GPUs and provides AWS-managed compute resources to on-premises environments. Customers are responsible for supplying the necessary power and space, while AWS manages the setup and operations of the AI clusters. This arrangement enables organizations to continue using the same TensorFlow stack they utilize in the AWS cloud, ensuring consistency across different environments.

This offering effectively serves as a private supercomputer for AI, fully managed by AWS. Importantly, data remains on-site unless customers opt for federation with the public cloud, thereby adhering to regulatory requirements around data sovereignty and security. The structure not only addresses latency-sensitive applications—such as factory vision and clinical imaging—but also simplifies compliance with various regulations, including GDPR and national security mandates.

Amazon’s move into the on-premises AI space comes as other tech giants ramp up their competitive offerings. Microsoft is developing its own Nvidia-powered AI Factory infrastructure through its Azure Local service, while Google has introduced its Distributed Cloud lineup aimed at both hosted and sovereign solutions. Oracle is also targeting regulated markets with its Oracle Alloy and dedicated region offerings. However, Amazon distinguishes itself by tightly integrating on-premises delivery with its existing AI platform services and offering customers a choice between Nvidia’s Blackwell GPUs and AWS’s Trainium3 accelerators.

Efficiency is central to the Nvidia plus Trainium equation. The Blackwell generation is engineered for large-scale training and high-throughput inference, while Trainium3 aims to deliver superior price-performance ratios. Customers can standardize on Blackwell for broad software ecosystem support or select Trainium3 for optimized total cost of ownership. AWS further enhances this offering with its Nitro and Elastic Fabric Adapter technologies, which simplify management through features like capacity planning and incident response.

As AI Factories require significant power—often between 30 to 60 kW—organizations must consider the implications for cooling infrastructure and power distribution. This power density presents challenges that the Uptime Institute and other industry bodies have flagged as critical issues for data center operators. The managed model alleviates lifecycle risks and complexities for customers while still enabling them to protect their data locality.

The benefits for regulated industries are clear: organizations can conduct AI model training and inference within their facilities while still accessing the broader AWS ecosystem as needed. Early adopters include banks refining multilingual models with proprietary transaction data, healthcare providers training imaging models on governed datasets, and manufacturers employing vision systems that require near-instantaneous processing.

Ultimately, Amazon’s AI Factories represent a strategic pivot toward hybrid AI operating models, enabling clients to maintain control over data while tapping into cloud capabilities. This approach aligns with growing demand for AI solutions that prioritize data residency and governance in an increasingly complex regulatory landscape. As businesses seek to balance the rapid evolution of technology with the imperative for data security, Amazon is positioning itself as a leader in the next generation of AI infrastructure.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

DeepSeek forecasts Nvidia's stock will surge 50% to $265 by 2026, driven by new technology and strong institutional confidence amid market challenges.

AI Technology

Meta's new KernelEvolve system automates kernel optimization, boosting AI model throughput by over 60%, revolutionizing performance across diverse hardware platforms.

AI Technology

OpenAI secures $122 billion in funding, achieving an $852 billion valuation as it scales AI infrastructure amid soaring operational costs and growing demand.

AI Technology

Nvidia, Digital Realty, and Credo Technology are positioned to capitalize on a $700 billion AI infrastructure boom as major tech firms ramp up investments.

Top Stories

Amazon announces a $200 billion investment in AI by 2026, while Apple partners with Alphabet to enhance Siri with the Gemini LLM

AI Technology

Nvidia invests $2 billion in Marvell to create advanced AI infrastructure, enhancing custom silicon solutions amid a projected $630 billion industry push this year.

Top Stories

Malaysia targets 900 AI start-ups as it strengthens its governance framework, positioning itself as a regional digital hub amid global tech investments.

AI Research

Meta assembles a top-tier AI team, led by VP Yang Song, to revolutionize Facebook and Instagram algorithms amid fierce competition for ad revenue.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.