AI Business

Red Hat Reveals Small Language Models as Key to Scaling Enterprise AI Agents

Red Hat advances enterprise AI with Small Language Models that achieve over 98% validity in structured tasks, prioritizing reliability and data sovereignty.

Marcus Chen

Published

3 days ago

In a significant shift for the AI industry, experts are moving away from the prevailing mindset that larger models equate to smarter outcomes. Over the past three years, the focus has been on scaling artificial intelligence systems—chasing parameter counts into the trillions. However, as the industry evolves, the emphasis is now on delivering reliable and deterministic outcomes, especially for enterprises. Red Hat has positioned itself at the forefront of this change, arguing that the most powerful technologies are those that are distributed, open, and specifically designed for their intended purposes.

Small language models (SLMs) are emerging as a key component of this transformation. While the distinction between SLMs and large language models (LLMs) has garnered attention, the architectural role that these models serve is becoming more critical. SLMs offer functional sovereignty that is essential for enterprises seeking to streamline operations. This shift represents a transition from a world dominated by conversational AI to one defined by agentic AI, where specialized models will perform the actual work of businesses.

As companies prepare for this new era, the question of how many AI agents they will employ in their operations is becoming paramount. Just as businesses once questioned the necessity of an email address in 1995 or a website in 2005, by 2026, they will likely be asking, “How many agents do I have running?” The future suggests that there may be more AI agents than people in the workforce, enabling firms to deploy a diverse range of specialized agents. These include customer-facing agents capable of solving complex logistics, workflow agents that automate inter-departmental processes, and headless agents that manage API calls for tasks such as inventory reconciliation and payment processing.

However, creating a sustainable fleet of agentic models will require a strategic approach, particularly in choosing the right infrastructure. Red Hat emphasizes that relying on third-party cloud services is not a sustainable solution. Instead, SLMs are positioned as a necessary tool for enterprises looking to scale effectively. They enable low-latency execution and deterministic reliability, both of which are critical for business automation.

SLMs offer several advantages over their larger counterparts. While high-parameter frontier models may provide impressive capabilities, they often lack the speed and efficiency required for agile business operations. For example, research indicates that even a 350 million-parameter model fine-tuned on quality synthetic data can outperform larger models in specific tasks such as tool-calling and API orchestration. This highlights the importance of specialization over sheer scale when it comes to developing a robust agentic backend.

One of the challenges enterprises face with AI implementation is non-determinism, where the same input might yield different outputs. SLMs can mitigate this risk through architectural control, making it easier to ensure consistent, reliable results. By employing constrained decoding methods such as JSON Schema or Context-Free Grammars, the model can be restricted in its token selection, helping to ensure that responses are valid and reliable. These techniques allow SLMs to achieve over 98% validity in structured tasks, a significant improvement for workflows that demand precision.

Data sovereignty also plays a crucial role in this evolving landscape. In a world where AI models will manage sensitive information such as customer relationships and proprietary code, relinquishing that data to a third-party provider can pose significant risks. By operating SLMs in-house or within a controlled hybrid cloud environment, enterprises can retain ownership of their intellectual property, maintaining a “zero trust” architecture that keeps sensitive data secure. This approach is particularly important for industries with stringent regulatory requirements, including healthcare, finance, and government.

Looking ahead, the AI landscape is poised for a dramatic transformation. As enterprises transition from generative AI—primarily focused on conversation and content creation—to agentic AI that actively takes action, the focus will shift from the sheer size of AI models to the reliability and security of the infrastructure that supports them. The traditional “black box” cloud models may no longer suffice as businesses increasingly recognize the necessity for sovereignty, speed, and precision in their AI operations.

Red Hat firmly believes that the path forward is one defined by openness and adaptability. By leveraging curated small language models that can be fine-tuned, served, and orchestrated with the Red Hat AI portfolio, companies can effectively integrate AI into their core business functions. As the industry proceeds at a rapid pace, the imperative is clear: stop chasing the giants and start constructing a robust backbone for the future of AI, which is small, fast, and grounded in open hybrid cloud technology.

AI Research

OpenAI’s AI Model Achieves 81.6% Diagnostic Accuracy, Surpassing Human Doctors in ER Tests

OpenAI's o1 model achieves 81.6% diagnostic accuracy in emergency situations, surpassing human doctors and signaling a major shift in medical practice.

Staff3 days ago

AI Regulation

Korea Ventures Launches AI Initiative to Enhance Fund Management and Policy Efficiency

Korea Venture Investment Corp. unveils AI-driven fund management systems by integrating Nvidia H200 GPUs to enhance efficiency and support unicorn growth.

Staff3 days ago

AI Technology

Apple Raises Mac Mini Price to $799 Amid AI-Driven Supply Shortages

Apple raises Mac mini starting price to $799 amid AI-driven inventory shortages, eliminating the $599 model in response to surging demand for advanced computing.

Staff3 days ago

AI Research

IBM Launches Chicago Quantum Hub, Creating 750 AI Jobs and Expanding MIT Research Lab

IBM launches a Chicago Quantum Hub to create 750 AI jobs and expands its MIT partnership to advance quantum computing and AI integration.

Staff3 days ago

AI Government

71% of Aussies Use Generative AI, Yet Only 36% Trust Its Implementation, Says Expert

71% of Australian employees use generative AI daily, but only 36% trust its implementation, highlighting urgent calls for better policy frameworks and safeguards.

Staff3 days ago

AI Regulation

Academy Confirms AI Performances Ineligible for Oscars Amid Growing Industry Tensions

The Academy of Motion Picture Arts and Sciences bars AI performances from Oscar eligibility, emphasizing human-authored content amid rising industry tensions over generative AI's...

Staff3 days ago

AI Tools

Workday Updates AI Products, Sees 49.8% Undervaluation Amid Earnings Optimism

Workday's stock jumps 3.73% to $126.96 amid AI product updates and earnings optimism, yet analysts cite a 49.8% undervaluation risk at $253.14.

Staff3 days ago

AI Education

AI Enrolments Surge 90% Among Mid-Career Professionals, Non-Tech Roles Drive Demand

AI course enrollments soar 90% at upGrad as mid-career professionals and non-tech roles increasingly seek essential AI skills for competitive advantage

David Park3 days ago

AIPRESSA.COM

AI Business

Red Hat Reveals Small Language Models as Key to Scaling Enterprise AI Agents

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Research

OpenAI’s AI Model Achieves 81.6% Diagnostic Accuracy, Surpassing Human Doctors in ER Tests

AI Regulation

Korea Ventures Launches AI Initiative to Enhance Fund Management and Policy Efficiency

AI Technology

Apple Raises Mac Mini Price to $799 Amid AI-Driven Supply Shortages

AI Research

IBM Launches Chicago Quantum Hub, Creating 750 AI Jobs and Expanding MIT Research Lab

AI Government

71% of Aussies Use Generative AI, Yet Only 36% Trust Its Implementation, Says Expert

AI Regulation

Academy Confirms AI Performances Ineligible for Oscars Amid Growing Industry Tensions

AI Tools

Workday Updates AI Products, Sees 49.8% Undervaluation Amid Earnings Optimism

AI Education

AI Enrolments Surge 90% Among Mid-Career Professionals, Non-Tech Roles Drive Demand