AI Technology

Samsung Reveals Compression Tech to Run 30B-Parameter AI Models on Smartphones Under 3GB

Samsung unveils innovative compression technology enabling 30-billion-parameter AI models to run on smartphones with under 3GB of memory, revolutionizing mobile intelligence.

Staff

Published

20 November, 2025

In a groundbreaking announcement, Samsung has unveiled innovative compression technology that enables cloud-level AI capabilities on smartphones. This breakthrough allows a 30-billion-parameter AI model, which would typically require over 16GB of memory, to operate using less than 3GB on a mobile device. Dr. MyungJoo Ham of the Samsung Research AI Center detailed these advancements in an exclusive interview, illustrating how the company aims to enhance the intelligence of smartphones to rival that of cloud-based systems.

The innovations from Samsung promise to redefine the boundaries of mobile AI. The research team has achieved a feat that many believed impractical just months ago, enabling massive AI models to run locally on devices. Dr. Ham explained, “Running a highly advanced model that performs billions of computations directly on a smartphone would quickly drain the battery, increase heat, and slow response times. Model compression technology emerged to address these issues.”

At the heart of this technology lies a sophisticated quantization process, akin to photo compression, which maintains visual quality while significantly reducing file sizes. Samsung’s proprietary algorithms transform 32-bit floating-point calculations into 8-bit or even 4-bit integers, thereby drastically cutting down both memory usage and computational load.

What sets Samsung’s approach apart is its nuanced understanding of neural network weights. Dr. Ham noted that not all components of an AI model carry equal significance. The compression methodology identifies critical neural network weights, ensuring their preservation at higher precision while more aggressively compressing less important elements. “Because each model weight has a different level of importance, we preserve critical weights with higher precision while compressing less important ones more aggressively,” he stated.

Introducing the AI Runtime Engine

Samsung has also developed an “AI runtime engine,” which acts as the engine control unit for AI models running on smartphones. This component functions like a smart traffic controller, determining the optimal processor—CPU, GPU, or NPU—to execute each operation efficiently. This strategic allocation minimizes memory access, ensuring maximum performance for AI tasks on mobile devices. “The AI runtime is essentially the model’s engine control unit,” Dr. Ham explained. “When a model runs across multiple processors, the runtime automatically assigns each operation to the optimal chip and minimizes memory access to boost overall AI performance.”

The implications of these advancements are vast. With the ability to run sophisticated AI models directly on smartphones, users can expect enhanced functionalities previously limited to cloud computing. This shift not only promises faster response times but also improved user experiences across applications, from virtual assistants to real-time data processing.

As the competition in the mobile AI space intensifies, Samsung’s compression technology positions it at the forefront of innovation. By successfully integrating such advanced AI capabilities into smartphones, the company is paving the way for a future where mobile devices become increasingly autonomous and capable of processing complex tasks without relying solely on cloud infrastructures.

In summary, Samsung’s developments in AI compression and the introduction of an efficient runtime engine are significant steps toward making smartphones more intelligent and user-friendly. As these technologies continue to evolve, they could fundamentally alter how we interact with our devices and leverage AI in daily life.

NVIDIA Acquires Groq for $20B, Secures Key AI Talent and Technology Amid Market Shift

NVIDIA acquires Groq for $20B, securing key AI talent and technology to eliminate competition while leaving 90% of Groq's workforce with cash settlements.

Staff1 day ago

Google’s DeepMind Plans AI-Powered Smart Glasses with In-Lens Display for 2026 Launch

Google's DeepMind, led by Demis Hassabis, plans to launch AI-powered smart glasses with an in-lens display in 2026, aiming to reshape its tech strategy.

Staff3 days ago

FuriosaAI Launches RNGD Chip, Promises Double Efficiency Over Nvidia GPUs

FuriosaAI's RNGD chip launches this month, offering double the power efficiency of Nvidia GPUs while targeting a $700 million valuation in the competitive AI...

Staff4 days ago

Samsung Galaxy S26 Ultra: Charging Limits Shift Strategy for Long-Term Battery Health

Samsung's Galaxy S26 Ultra may implement conservative charging limits to enhance battery longevity, prioritizing safety and stability over peak speed competition.

Staff5 days ago

Samsung Enhances Bixby with Perplexity AI in One UI 8.5 Beta for Contextual Responses

Samsung integrates Perplexity AI into Bixby for One UI 8.5, enabling complex, research-backed responses to enhance user interactions and elevate AI capabilities.

Staff7 days ago

Samsung Tests Perplexity AI Integration for Bixby Upgrade in One UI 8.5 Beta

Samsung tests a Perplexity-powered upgrade for Bixby in One UI 8.5 beta, enhancing complex query handling ahead of the Galaxy S26 launch.

Staff30 December, 2025

AI Technology

Nvidia, Lenovo, and Samsung Unveil AI-Driven Products at CES 2026 Amid Consumer Skepticism

Nvidia, Samsung, and Lenovo unveil AI-centered home devices at CES 2026, aiming to shift consumer skepticism despite past market setbacks.

Staff29 December, 2025

AI Technology

AMD and Google Partner with Samsung for U.S. 2nm AI Chip Production in Texas

AMD and Google advance talks with Samsung to produce next-gen 2nm AI chips at its Texas facility, addressing soaring demand amid geopolitical tensions.

Staff28 December, 2025

AIPRESSA.COM

AI Technology

Samsung Reveals Compression Tech to Run 30B-Parameter AI Models on Smartphones Under 3GB

Introducing the AI Runtime Engine

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

Top Stories

NVIDIA Acquires Groq for $20B, Secures Key AI Talent and Technology Amid Market Shift

Top Stories

Google’s DeepMind Plans AI-Powered Smart Glasses with In-Lens Display for 2026 Launch

Top Stories

FuriosaAI Launches RNGD Chip, Promises Double Efficiency Over Nvidia GPUs

Top Stories

Samsung Galaxy S26 Ultra: Charging Limits Shift Strategy for Long-Term Battery Health

Top Stories

Samsung Enhances Bixby with Perplexity AI in One UI 8.5 Beta for Contextual Responses

Top Stories

Samsung Tests Perplexity AI Integration for Bixby Upgrade in One UI 8.5 Beta

AI Technology

Nvidia, Lenovo, and Samsung Unveil AI-Driven Products at CES 2026 Amid Consumer Skepticism

AI Technology

AMD and Google Partner with Samsung for U.S. 2nm AI Chip Production in Texas