Connect with us

Hi, what are you looking for?

AI Technology

Nvidia Achieves 10x Cost Savings in AI Inference with Open-Source Models on Blackwell

Nvidia’s Blackwell architecture cuts AI inference costs by 90%, achieving 5 cents per token and enhancing healthcare efficiency with open-source models.

Nvidia has announced significant advancements in its AI infrastructure, revealing that the cost per token for its services has dropped from 20 cents using the older Hopper platform to just 10 cents on the new Blackwell architecture. Furthermore, by utilizing Blackwell’s native low-precision NVFP4 format, the cost has been reduced to a mere 5 cents per token. This transition illustrates a fourfold improvement in cost efficiency while maintaining the accuracy expected by customers.

In a recent blog post, Nvidia outlined four industry deployments that showcase how the integration of Blackwell infrastructure, NVFP4, optimized software stacks, and open-source models can lead to substantial cost reductions. One of the highlighted sectors is healthcare, which faces challenges such as time-consuming tasks related to medical coding, documentation, and insurance management. These routine activities often detract from the time healthcare professionals can spend with patients.

Sully.ai has emerged as a solution to address these challenges by leveraging AI agents to perform these repetitive tasks. However, the proprietary and closed-source models initially employed by Sully.ai did not provide the scalability necessary for widespread adoption. In a strategic pivot, Sully.ai adopted Baseten’s open-source Model API on Blackwell GPUs, incorporating the NVFP4 data format, the TensorRT-LLM library, and the Dynamo inference framework. This shift resulted in a remarkable 90% decrease in inference costs, representing a tenfold reduction compared to the previous closed-source implementation. Additionally, response times for critical workflows, such as generating medical notes, improved by 65%.

The optimization of costs and performance through Nvidia’s technology highlights the growing importance of open-source solutions in driving efficiencies within the healthcare sector. By reducing costs and enhancing the speed of processes, AI can help alleviate some of the burdens faced by healthcare providers, allowing them to focus more on direct patient care.

As AI technologies continue to evolve, organizations across various sectors are increasingly turning to innovative solutions to streamline operations and improve efficiency. Nvidia’s advancements in AI infrastructure not only signify a leap in technological capability but also reflect a broader trend of integrating open-source models into commercial applications. This shift is likely to resonate throughout the industry as companies seek to harness the power of AI while managing operational costs.

The implications of these developments extend beyond immediate cost savings. With AI becoming more accessible through such advancements, smaller firms and startups may find themselves better equipped to compete with larger players in the market. As the landscape of AI technology continues to evolve, Nvidia’s Blackwell platform may serve as a catalyst for further innovation and efficiency across a variety of sectors.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

OpenAI secures a historic $122B funding round at an $852B valuation, aiming to unify its products into an "AI superapp" amidst rapid enterprise revenue...

Top Stories

Nvidia's DLSS-5 technology faces backlash for AI-altered female character designs in gaming, prompting concerns over artistic integrity and beauty standards.

AI Technology

OpenAI secures $122 billion in funding, achieving an $852 billion valuation, to scale AI infrastructure and enhance consumer and enterprise products.

Top Stories

DeepSeek shifts to Huawei chips, revealing a 50% spike in Chinese representation in US AI research, as Western firms struggle with $15M daily costs...

AI Marketing

Nvidia's stock surged 5.56% to $174.40 following a $2 billion AI partnership with Marvell Technology, signaling heightened demand in the AI sector.

Top Stories

Nvidia stock plunges 9% to $165 as a critical head-and-shoulders pattern emerges amid a market sell-off triggered by Google’s TurboQuant announcement.

AI Technology

Nvidia's stock climbs as Mistral AI secures $830M in funding for a Paris data center, ordering 13,800 GPUs that could yield $575M in sales.

AI Technology

NVIDIA's Burkov reveals that inadequate infrastructure, not flawed models, stalls healthcare AI scaling, hindering innovation and patient outcomes.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.