Nvidia has announced significant advancements in its AI infrastructure, revealing that the cost per token for its services has dropped from 20 cents using the older Hopper platform to just 10 cents on the new Blackwell architecture. Furthermore, by utilizing Blackwell’s native low-precision NVFP4 format, the cost has been reduced to a mere 5 cents per token. This transition illustrates a fourfold improvement in cost efficiency while maintaining the accuracy expected by customers.
In a recent blog post, Nvidia outlined four industry deployments that showcase how the integration of Blackwell infrastructure, NVFP4, optimized software stacks, and open-source models can lead to substantial cost reductions. One of the highlighted sectors is healthcare, which faces challenges such as time-consuming tasks related to medical coding, documentation, and insurance management. These routine activities often detract from the time healthcare professionals can spend with patients.
Sully.ai has emerged as a solution to address these challenges by leveraging AI agents to perform these repetitive tasks. However, the proprietary and closed-source models initially employed by Sully.ai did not provide the scalability necessary for widespread adoption. In a strategic pivot, Sully.ai adopted Baseten’s open-source Model API on Blackwell GPUs, incorporating the NVFP4 data format, the TensorRT-LLM library, and the Dynamo inference framework. This shift resulted in a remarkable 90% decrease in inference costs, representing a tenfold reduction compared to the previous closed-source implementation. Additionally, response times for critical workflows, such as generating medical notes, improved by 65%.
The optimization of costs and performance through Nvidia’s technology highlights the growing importance of open-source solutions in driving efficiencies within the healthcare sector. By reducing costs and enhancing the speed of processes, AI can help alleviate some of the burdens faced by healthcare providers, allowing them to focus more on direct patient care.
As AI technologies continue to evolve, organizations across various sectors are increasingly turning to innovative solutions to streamline operations and improve efficiency. Nvidia’s advancements in AI infrastructure not only signify a leap in technological capability but also reflect a broader trend of integrating open-source models into commercial applications. This shift is likely to resonate throughout the industry as companies seek to harness the power of AI while managing operational costs.
The implications of these developments extend beyond immediate cost savings. With AI becoming more accessible through such advancements, smaller firms and startups may find themselves better equipped to compete with larger players in the market. As the landscape of AI technology continues to evolve, Nvidia’s Blackwell platform may serve as a catalyst for further innovation and efficiency across a variety of sectors.
See also
George Soros Invests $69M in AI Stocks Broadcom and Tesla Amid Market Declines
Nvidia CEO Jensen Huang Cancels India Summit Visit Amid Global AI Competition
OpenAI’s Sherwin Wu Predicts AI Will Spark Golden Age for Small Startups
Intel Acquires AI Startup SambaNova, Unveils Z Angle Memory Prototype for Data Centers





















































