Multiverse Launches LittleLamb AI Models on Hugging Face, Reducing Size by 50%

Multiverse Computing unveils the LittleLamb AI model family on Hugging Face, reducing model size by 50% while enhancing performance for edge and mobile applications.

Staff

Published

1 hour ago

Multiverse Computing, a leader in AI model compression, has launched the LittleLamb open-source model family on Hugging Face. This release features three ultra-compact AI models—LittleLamb 0.3B, LittleLamb 0.3B Tool-Calling, and LittleLamb 0.3B Mobile—specifically designed for edge, mobile, and offline deployment. The models, developed with Multiverse’s proprietary CompactifAI technology, are available for free as the company aims to enhance real-world AI applications while maintaining a smaller footprint.

Each model in the LittleLamb family has been compressed to approximately half the size of the original Qwen3-0.6B architecture, allowing for efficient inference with reduced latency and resource usage. All models support bilingual English and Spanish, providing developers with dual inference modes: a “thinking mode” for complex reasoning tasks like math and science, and a “non-thinking mode” that prioritizes speed for general dialogue.

According to Enrique Lizaso Olmos, CEO of Multiverse Computing, the launch of LittleLamb underscores the company’s commitment to making efficient AI accessible across diverse deployment environments. “With CompactifAI, we’ve demonstrated that compression doesn’t require sacrificing intelligence or capability,” he stated. The models are engineered to be not just lightweight but effective in various environments, challenging the notion that advanced AI must rely heavily on cloud infrastructure.

The three models offer distinct features tailored for specific use cases. LittleLamb 0.3B serves as a versatile bilingual model suitable for conversational AI, virtual assistants, and basic Q&A applications. In contrast, LittleLamb 0.3B Tool-Calling has been fine-tuned for tasks requiring API interactions and structured outputs, making it ideal for developers looking to integrate AI into automation pipelines. LittleLamb 0.3B Mobile, on the other hand, is optimized for resource-constrained environments and targets on-device assistants and offline applications.

Performance metrics reveal that both LittleLamb 0.3B and LittleLamb 0.3B Tool-Calling surpass the original Qwen3-0.6B model and demonstrate better results than many models in the Gemma 270M class on HLE testing. This performance improvement reflects Multiverse’s ongoing efforts to develop compressed models that remain competitive relative to more substantial architectures. The new models also enhance system throughput, output speed, and TTFT benchmarks, ensuring reliability in various applications.

Multiverse’s CompactifAI technology, which applies quantum-inspired tensor network mathematics, allows for model size reduction of up to 95% with minimal precision loss of only 2–3%. This contrasts sharply with the industry standard, where compression often results in a 20–30% decrease in accuracy at similar rates. Such advancements enable the deployment of AI in lighter, more accessible forms in mobile and edge environments.

With the introduction of the LittleLamb model family, Multiverse is making significant strides in edge-native AI, expanding its portfolio of open-source models aimed at increasing the practicality of advanced AI for developers. The availability of these models addresses a growing demand for AI that is not only theoretically accessible but also practical for real-world applications, especially in scenarios where privacy, latency, or computing power are critical concerns.

Developers interested in exploring Multiverse Computing’s offerings can access all released models on Hugging Face at https://huggingface.co/MultiverseComputingCAI. The specific models, including LittleLamb 0.3B, Tool-Calling, and Mobile, are available through direct links provided on the platform. For additional technical details, documentation, and integration guides, users are encouraged to visit the company’s Hugging Face page and official website, https://huggingface.co/MultiverseComputingCAI.

DeepSeek Launches V4, Surpassing GPT-5 and Claude in Key AI Benchmarks

DeepSeek's V4-Pro eclipses GPT-5 and Claude in key benchmarks, achieving a Codeforces rating of 3,206 while undercutting OpenAI's costs by 89% per million tokens.

Staff1 day ago

Hugging Face Launches ML Intern, Outperforming Claude Code in Scientific Reasoning

Hugging Face launches ML Intern, an open-source AI agent that surpasses Claude Code in scientific reasoning with a 32% GPQA score, offering $1,000 in...

Staff5 days ago

Anonymous Developer Claims 235M Parameter LLM Trained on Single RTX 5080 GPU

Anonymous developer RizenML claims to have trained a 235M parameter language model on a single Nvidia RTX 5080 in 14 days, challenging traditional AI...

Staff6 days ago

Hugging Face Vulnerability Exploited to Deploy NKAbuse Blockchain Malware in RCE Attacks

Threat actors exploit the Marimo Python notebook vulnerability (CVE-2026-39987) to deploy NKAbuse malware via Hugging Face, launching 662 attacks in just three days.

Staff20 April, 2026

Hugging Face Launches HoloTab Browser Agent to Enhance AI-Driven Computer Use

Hugging Face's HoloTab Chrome extension enables AI models to mimic human behavior in web applications, enhancing automation without site-specific integrations.

Staff17 April, 2026

MiniMax Launches M2.7 AI Model Free, Surpassing Gemini 3.1 Pro with 229 Billion Parameters

MiniMax launches the free M2.7 AI model with 229 billion parameters, outperforming Gemini 3.1 Pro in key benchmarks and enhancing multi-agent capabilities.

Staff13 April, 2026

AI Generative

MegaTrain Achieves 120B Parameter LLM Training on Single GPU, Bypassing HBM Limits

MegaTrain enables the training of 120 billion parameter language models on a single NVIDIA H200 GPU, revolutionizing AI development by bypassing HBM limits.

Staff12 April, 2026

Hugging Face Contributes Safetensors to PyTorch Foundation to Enhance AI Security

Hugging Face donates its Safetensors project to the PyTorch Foundation, enhancing AI security by mitigating risks associated with arbitrary code execution.

Staff12 April, 2026

AIPRESSA.COM

Top Stories

Multiverse Launches LittleLamb AI Models on Hugging Face, Reducing Size by 50%

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

Top Stories

DeepSeek Launches V4, Surpassing GPT-5 and Claude in Key AI Benchmarks

Top Stories

Hugging Face Launches ML Intern, Outperforming Claude Code in Scientific Reasoning

Top Stories

Anonymous Developer Claims 235M Parameter LLM Trained on Single RTX 5080 GPU

Top Stories

Hugging Face Vulnerability Exploited to Deploy NKAbuse Blockchain Malware in RCE Attacks

Top Stories

Hugging Face Launches HoloTab Browser Agent to Enhance AI-Driven Computer Use

Top Stories

MiniMax Launches M2.7 AI Model Free, Surpassing Gemini 3.1 Pro with 229 Billion Parameters

AI Generative

MegaTrain Achieves 120B Parameter LLM Training on Single GPU, Bypassing HBM Limits

Top Stories

Hugging Face Contributes Safetensors to PyTorch Foundation to Enhance AI Security