Connect with us

Hi, what are you looking for?

AI Generative

NVIDIA Reveals RTX Upgrades Boosting 4K AI Video Generation by 3x with 60% VRAM Savings

NVIDIA’s latest RTX upgrades triple 4K AI video generation speed while slashing VRAM usage by 60%, revolutionizing local generative AI for creators.

NVIDIA has unveiled a series of groundbreaking advancements in generative AI technology at CES 2025, marking a pivotal moment for AI development on personal computers. The new features, which promise to enhance the performance and usability of generative AI on PC-class devices, come in response to a significant uptick in interest among creators, gamers, and productivity users. Small language models (SLMs) have seen a nearly 2x improvement in accuracy compared to 2024, helping to bridge the gap with larger, cloud-based language models.

Among the major announcements, NVIDIA highlighted enhancements to its GeForce RTX, RTX PRO, and DGX Spark devices, aiming to unlock the necessary performance and memory for developers to effectively deploy generative AI on personal computers. These upgrades include up to 3x performance improvements and a 60% reduction in VRAM usage for video and image generative AI through optimizations in PyTorch-CUDA. Additionally, the RTX Video Super Resolution feature has been integrated into ComfyUI, enabling accelerated 4K video generation.

The new video generation pipeline, powered by NVIDIA’s RTX technology, allows creators to produce videos three times faster while maintaining a high quality, upscaled to 4K. This innovation is particularly significant given the challenges associated with generating high-resolution videos directly on PCs, where model sizes often exceed available VRAM. The pipeline includes features such as a 3D object generator, a 3D-guided image generator, and a video generator that animates videos based on user-defined keyframes, allowing for greater artistic control.

The introduction of Lightricks’ LTX-2 model, now available for download, represents a major milestone for local AI video creation. This model can generate up to 20 seconds of 4K video with impressive visual fidelity, complete with built-in audio and advanced conditioning capabilities. The collaboration between NVIDIA and ComfyUI has yielded a 40% performance optimization on NVIDIA GPUs, with new support for NVFP4 and NVFP8 data formats, resulting in a substantial increase in efficiency.

A key feature of the latest software updates includes improvements to weight streaming, allowing ComfyUI to utilize system RAM when VRAM capacity is exceeded. This facilitates the use of larger models and more complex workflows on mid-range RTX GPUs, broadening access for creators working in AI.

Alongside these advancements, Nexa.ai introduced a beta version of Hyperlink, a local search agent that transforms traditional PC file searching into a more intuitive experience. By enabling natural language queries, Hyperlink allows users to search their documents, slides, and images without relying solely on file names. The new version extends this capability to video content, enabling users to search through videos for specific objects and actions, catering to a diverse user base from content creators to gamers.

In addition, NVIDIA has reported a 35% increase in inference performance for SLMs using models like Llama.cpp and Ollama, which have gained traction in the open-source community. These improvements are expected to enhance application performance in various AI-driven tasks, exemplified by the upcoming MSI AI Robot app that will leverage these optimizations to allow users to manage their device settings seamlessly.

The latest updates to the NVIDIA Broadcast app, including Version 2.1, improve the quality of microphone and webcam input for livestreaming and video conferencing, making advanced AI features like Virtual Key Light accessible to a broader range of users. This update enhances performance and expands compatibility, catering to content creators and professionals seeking to improve their streaming quality.

Finally, NVIDIA’s DGX Spark, a compact supercomputer designed for AI development, has experienced substantial performance improvements since its recent launch, now boasting up to 2.6x faster processing capabilities. This technology is tailored for those looking to experiment with large language models or develop AI workflows without compromising their primary computing resources.

As NVIDIA continues to push the envelope on local AI capabilities, these advancements not only signify a shift in how generative AI can be utilized on personal computers but also highlight the growing emphasis on privacy and performance in AI development. The trajectory for 2025 suggests a significant shift towards enabling a broader range of users to engage with and benefit from cutting-edge AI technologies.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

Nvidia projects a staggering $147.8 billion in revenue for 2025, driving AI investment alongside IBM's quantum computing leap and Astera Labs' infrastructure growth.

Top Stories

AI investments are set to surpass $2 trillion by 2026, with tech giants like Microsoft and Meta leading the charge in groundbreaking infrastructure projects.

AI Finance

AMD launches its Helios AI data center rack to rival Nvidia, promising a groundbreaking 1,000x performance boost with the upcoming MI500 series GPUs.

Top Stories

Runway unveils its Gen-4.5 video generation model, leveraging NVIDIA's Rubin platform for real-time, high-fidelity video creation utilizing 50 petaflops of GPU power.

AI Finance

xAI secures $20 billion in Series E funding to enhance AI infrastructure and support the deployment of Grok to 600 million monthly active users.

AI Technology

Siemens and NVIDIA unveil an AI-driven industrial revolution set to transform manufacturing, while Razer showcases groundbreaking AI tech prototypes at CES 2026.

Top Stories

NYSE reports stable pre-market as tech stocks rally ahead of CES 2026; Tortoise Capital launches AI Infrastructure ETF with anticipated market impact.

AI Technology

Razer unveils Project AVA, an AI desk companion with adaptive features and a 5.5-inch avatar, alongside breakthrough gaming innovations at CES 2026.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.