Connect with us

Hi, what are you looking for?

AI Technology

Nvidia Achieves 10x Cost Savings in AI Inference with Open-Source Models on Blackwell

Nvidia’s Blackwell architecture cuts AI inference costs by 90%, achieving 5 cents per token and enhancing healthcare efficiency with open-source models.

Nvidia has announced significant advancements in its AI infrastructure, revealing that the cost per token for its services has dropped from 20 cents using the older Hopper platform to just 10 cents on the new Blackwell architecture. Furthermore, by utilizing Blackwell’s native low-precision NVFP4 format, the cost has been reduced to a mere 5 cents per token. This transition illustrates a fourfold improvement in cost efficiency while maintaining the accuracy expected by customers.

In a recent blog post, Nvidia outlined four industry deployments that showcase how the integration of Blackwell infrastructure, NVFP4, optimized software stacks, and open-source models can lead to substantial cost reductions. One of the highlighted sectors is healthcare, which faces challenges such as time-consuming tasks related to medical coding, documentation, and insurance management. These routine activities often detract from the time healthcare professionals can spend with patients.

Sully.ai has emerged as a solution to address these challenges by leveraging AI agents to perform these repetitive tasks. However, the proprietary and closed-source models initially employed by Sully.ai did not provide the scalability necessary for widespread adoption. In a strategic pivot, Sully.ai adopted Baseten’s open-source Model API on Blackwell GPUs, incorporating the NVFP4 data format, the TensorRT-LLM library, and the Dynamo inference framework. This shift resulted in a remarkable 90% decrease in inference costs, representing a tenfold reduction compared to the previous closed-source implementation. Additionally, response times for critical workflows, such as generating medical notes, improved by 65%.

The optimization of costs and performance through Nvidia’s technology highlights the growing importance of open-source solutions in driving efficiencies within the healthcare sector. By reducing costs and enhancing the speed of processes, AI can help alleviate some of the burdens faced by healthcare providers, allowing them to focus more on direct patient care.

As AI technologies continue to evolve, organizations across various sectors are increasingly turning to innovative solutions to streamline operations and improve efficiency. Nvidia’s advancements in AI infrastructure not only signify a leap in technological capability but also reflect a broader trend of integrating open-source models into commercial applications. This shift is likely to resonate throughout the industry as companies seek to harness the power of AI while managing operational costs.

The implications of these developments extend beyond immediate cost savings. With AI becoming more accessible through such advancements, smaller firms and startups may find themselves better equipped to compete with larger players in the market. As the landscape of AI technology continues to evolve, Nvidia’s Blackwell platform may serve as a catalyst for further innovation and efficiency across a variety of sectors.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Government

US Department of Defense partners with tech giants including SpaceX and OpenAI to launch an "AI-first" initiative aimed at enhancing military decision-making efficiency.

AI Technology

AMD unveils the Ryzen AI Halo Mini-PC, boasting a 16-core Ryzen AI Max+ 395 APU and the capability to process models with up to...

AI Generative

Nvidia's partnerships with Asian firms like LG and Nanya surge AI chip demand to 90% of production costs, reshaping the tech landscape in Asia.

AI Business

Nvidia CEO Jensen Huang urges industry leaders to avoid alarmist claims about AI's future, citing concerns over inaccurate predictions like a 50% job displacement...

AI Technology

Apple CEO Tim Cook warns of several-month supply shortages for the Mac mini and Mac Studio as demand surges, pushing Mac revenue to $8.4...

Top Stories

Apple's Q2 earnings reveal a price hike for the Mac mini to $799, fueled by AI memory demand, as Google and Amazon also report...

Top Stories

Cambricon surges to $423M in Q1 revenue with a 160% increase, outpacing Nvidia's dwindling market share in China, now below 60%.

Top Stories

Nvidia enters South Korea's AI market by launching 7 million Korean-language personas and the multimodal Nemotron3 Nano, aiming to establish market dominance.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.