Amazon is expanding its ambitions in high-performance computing and artificial intelligence with a new offering called AI Factories, developed in partnership with Nvidia. This managed service is designed to bring high-performance computing capabilities directly into the on-premises data centers of its customers, allowing them to maintain control over their sensitive data while leveraging cloud-like functionalities.
The AI Factories service integrates Nvidia’s latest Blackwell-class GPUs and provides AWS-managed compute resources to on-premises environments. Customers are responsible for supplying the necessary power and space, while AWS manages the setup and operations of the AI clusters. This arrangement enables organizations to continue using the same TensorFlow stack they utilize in the AWS cloud, ensuring consistency across different environments.
This offering effectively serves as a private supercomputer for AI, fully managed by AWS. Importantly, data remains on-site unless customers opt for federation with the public cloud, thereby adhering to regulatory requirements around data sovereignty and security. The structure not only addresses latency-sensitive applications—such as factory vision and clinical imaging—but also simplifies compliance with various regulations, including GDPR and national security mandates.
Amazon’s move into the on-premises AI space comes as other tech giants ramp up their competitive offerings. Microsoft is developing its own Nvidia-powered AI Factory infrastructure through its Azure Local service, while Google has introduced its Distributed Cloud lineup aimed at both hosted and sovereign solutions. Oracle is also targeting regulated markets with its Oracle Alloy and dedicated region offerings. However, Amazon distinguishes itself by tightly integrating on-premises delivery with its existing AI platform services and offering customers a choice between Nvidia’s Blackwell GPUs and AWS’s Trainium3 accelerators.
Efficiency is central to the Nvidia plus Trainium equation. The Blackwell generation is engineered for large-scale training and high-throughput inference, while Trainium3 aims to deliver superior price-performance ratios. Customers can standardize on Blackwell for broad software ecosystem support or select Trainium3 for optimized total cost of ownership. AWS further enhances this offering with its Nitro and Elastic Fabric Adapter technologies, which simplify management through features like capacity planning and incident response.
As AI Factories require significant power—often between 30 to 60 kW—organizations must consider the implications for cooling infrastructure and power distribution. This power density presents challenges that the Uptime Institute and other industry bodies have flagged as critical issues for data center operators. The managed model alleviates lifecycle risks and complexities for customers while still enabling them to protect their data locality.
The benefits for regulated industries are clear: organizations can conduct AI model training and inference within their facilities while still accessing the broader AWS ecosystem as needed. Early adopters include banks refining multilingual models with proprietary transaction data, healthcare providers training imaging models on governed datasets, and manufacturers employing vision systems that require near-instantaneous processing.
Ultimately, Amazon’s AI Factories represent a strategic pivot toward hybrid AI operating models, enabling clients to maintain control over data while tapping into cloud capabilities. This approach aligns with growing demand for AI solutions that prioritize data residency and governance in an increasingly complex regulatory landscape. As businesses seek to balance the rapid evolution of technology with the imperative for data security, Amazon is positioning itself as a leader in the next generation of AI infrastructure.
See also
Amazon CloudWatch Launches Unified Data Store to Streamline Log Management and Insights
Grok Chooses Self-Sacrifice Over Harming Elon Musk, Citing 6 Million Jewish Lives
The Life You Can Save Unveils AI SmartAdvisor to Enhance Donation Impact This Giving Tuesday
Butterfly Network Announces $15M Midjourney Licensing Deal, Launches Compass AI to Boost Revenue




















































