AI Technology

Meta Unveils KernelEvolve, Boosting AI Model Throughput by 60% with Automated Optimization

Meta’s new KernelEvolve system automates kernel optimization, boosting AI model throughput by over 60%, revolutionizing performance across diverse hardware platforms.

Staff

Published

2 hours ago

Meta Platforms Inc. has unveiled a groundbreaking system aimed at optimizing the performance of artificial intelligence (AI) models across diverse hardware platforms, significantly enhancing the speed at which these models can operate. The new system, dubbed KernelEvolve, is part of the company’s ongoing effort to streamline its ads ranking capabilities and tackle the growing complexity of AI model deployment across its infrastructure. With the rise of heterogeneous hardware, including NVIDIA and AMD GPUs, along with its proprietary MTIA silicon chips, Meta faced a bottleneck in kernel optimization, which was previously reliant on expert engineering efforts spanning weeks.

KernelEvolve addresses these challenges by using a search-based approach to kernel optimization. By treating the kernel creation process as a structured search problem, the system autonomously generates and refines multiple kernel candidates in a fraction of the time it would take human experts. This automated process not only accelerates development but also enhances performance, achieving over a 60% increase in inference throughput for the Andromeda Ads model on NVIDIA GPUs and a more than 25% improvement for ads models on Meta’s MTIA silicon.

As the complexity of AI models grows, the challenge of optimizing kernels for various hardware configurations becomes increasingly daunting. KernelEvolve effectively scales the optimization process across a multitude of models and hardware types, generating kernels in high-level domain-specific languages (DSLs) and translating them into lower-level programming languages like CUDA and MTIA C++. This versatility ensures that it can adapt to the specific requirements of different hardware architectures and model families.

The system’s architecture integrates several advanced technologies. A retrieval-augmented knowledge base injects relevant platform-specific documentation into the kernel generation process, allowing the underlying large language model (LLM) to generate code optimized for hardware it has never encountered before. This is particularly crucial for proprietary chips like the MTIA, where standard coding assistants lack the necessary context for optimization. KernelEvolve’s ability to draw on this knowledge dynamically ensures that the system remains flexible and capable of evolving alongside new hardware developments.

KernelEvolve operates through a closed-loop evaluation framework that rigorously tests each generated kernel for both correctness and performance. By employing a suite of profiling tools, the system doesn’t just determine which kernel is faster; it provides insights into why one kernel performs better than another, informing subsequent iterations of kernel generation. This comprehensive evaluation mechanism enables KernelEvolve to continuously improve its output by learning from the performance of previous candidates.

The impact of KernelEvolve has already been demonstrated through substantial performance gains in both benchmark testing and real-world applications. Achieving a 100% pass rate on the KernelBench suite, with all generated kernels outperforming their PyTorch reference implementations, exemplifies its effectiveness. In production settings, the improvement in throughput not only enhances Meta’s operational efficiency but also translates into better service delivery for billions of daily inference requests.

As Meta continues to expand its portfolio of AI models and hardware platforms, KernelEvolve represents a significant advancement in how the company approaches kernel optimization. This system allows Meta to keep pace with the rapid evolution of both AI technologies and hardware capabilities, ultimately fostering innovation in machine learning applications. The success of KernelEvolve underscores the potential of AI-driven automation in optimizing performance-critical tasks and sets a new standard for engineering efficiency in the tech industry.

AI Technology

OpenAI Secures $122 Billion Funding, Achieves $852 Billion Valuation Amid AI Costs Surge

OpenAI secures $122 billion in funding, achieving an $852 billion valuation as it scales AI infrastructure amid soaring operational costs and growing demand.

Staff5 hours ago

AI Generative

Microsoft Launches Three Advanced AI Foundational Models to Compete with Rivals

Microsoft boosts its AI leadership with three new models, including Copilot AI for coding, Insights AI for data analysis, and Conversational AI for enhanced...

Staff8 hours ago

AI Technology

Nvidia, Digital Realty, Credo Technology Set to Benefit from $700 Billion AI Capex Surge

Nvidia, Digital Realty, and Credo Technology are positioned to capitalize on a $700 billion AI infrastructure boom as major tech firms ramp up investments.

Staff8 hours ago

Oracle’s Stock Slips 0.9% as AI Expansion Raises Dividend Stability Concerns

Oracle's shares fall 0.9% to $138.40 as rising AI infrastructure costs raise concerns over long-term dividend sustainability amid negative free cash flow.

Staff9 hours ago

AI Generative

Identify AI-Generated Videos: 6 Key Signs Everyone Should Know Now

As AI-generated videos surge, platforms like Meta and YouTube enforce transparency with tagging and labeling to combat misinformation and enhance viewer discernment.

Staff12 hours ago

AIPRESSA.COM

AI Technology

Meta Unveils KernelEvolve, Boosting AI Model Throughput by 60% with Automated Optimization

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

AI Technology

OpenAI Secures $122 Billion Funding, Achieves $852 Billion Valuation Amid AI Costs Surge

AI Generative

Microsoft Launches Three Advanced AI Foundational Models to Compete with Rivals

AI Technology

Nvidia, Digital Realty, Credo Technology Set to Benefit from $700 Billion AI Capex Surge

Top Stories

Oracle’s Stock Slips 0.9% as AI Expansion Raises Dividend Stability Concerns

AI Generative

Identify AI-Generated Videos: 6 Key Signs Everyone Should Know Now

AI Technology

Nvidia Invests $2 Billion in Marvell to Enhance Custom AI Infrastructure Solutions

Top Stories

Meta Launches Elite AI Research Team for Enhanced Social App Recommendations

Top Stories

Malaysia Boosts AI Ecosystem with 900 Start-Ups and Stricter Governance Frameworks