Connect with us

Hi, what are you looking for?

AI Technology

Meta Unveils KernelEvolve, Boosting AI Model Throughput by 60% with Automated Optimization

Meta’s new KernelEvolve system automates kernel optimization, boosting AI model throughput by over 60%, revolutionizing performance across diverse hardware platforms.

Meta Platforms Inc. has unveiled a groundbreaking system aimed at optimizing the performance of artificial intelligence (AI) models across diverse hardware platforms, significantly enhancing the speed at which these models can operate. The new system, dubbed KernelEvolve, is part of the company’s ongoing effort to streamline its ads ranking capabilities and tackle the growing complexity of AI model deployment across its infrastructure. With the rise of heterogeneous hardware, including NVIDIA and AMD GPUs, along with its proprietary MTIA silicon chips, Meta faced a bottleneck in kernel optimization, which was previously reliant on expert engineering efforts spanning weeks.

KernelEvolve addresses these challenges by using a search-based approach to kernel optimization. By treating the kernel creation process as a structured search problem, the system autonomously generates and refines multiple kernel candidates in a fraction of the time it would take human experts. This automated process not only accelerates development but also enhances performance, achieving over a 60% increase in inference throughput for the Andromeda Ads model on NVIDIA GPUs and a more than 25% improvement for ads models on Meta’s MTIA silicon.

As the complexity of AI models grows, the challenge of optimizing kernels for various hardware configurations becomes increasingly daunting. KernelEvolve effectively scales the optimization process across a multitude of models and hardware types, generating kernels in high-level domain-specific languages (DSLs) and translating them into lower-level programming languages like CUDA and MTIA C++. This versatility ensures that it can adapt to the specific requirements of different hardware architectures and model families.

The system’s architecture integrates several advanced technologies. A retrieval-augmented knowledge base injects relevant platform-specific documentation into the kernel generation process, allowing the underlying large language model (LLM) to generate code optimized for hardware it has never encountered before. This is particularly crucial for proprietary chips like the MTIA, where standard coding assistants lack the necessary context for optimization. KernelEvolve’s ability to draw on this knowledge dynamically ensures that the system remains flexible and capable of evolving alongside new hardware developments.

KernelEvolve operates through a closed-loop evaluation framework that rigorously tests each generated kernel for both correctness and performance. By employing a suite of profiling tools, the system doesn’t just determine which kernel is faster; it provides insights into why one kernel performs better than another, informing subsequent iterations of kernel generation. This comprehensive evaluation mechanism enables KernelEvolve to continuously improve its output by learning from the performance of previous candidates.

The impact of KernelEvolve has already been demonstrated through substantial performance gains in both benchmark testing and real-world applications. Achieving a 100% pass rate on the KernelBench suite, with all generated kernels outperforming their PyTorch reference implementations, exemplifies its effectiveness. In production settings, the improvement in throughput not only enhances Meta’s operational efficiency but also translates into better service delivery for billions of daily inference requests.

As Meta continues to expand its portfolio of AI models and hardware platforms, KernelEvolve represents a significant advancement in how the company approaches kernel optimization. This system allows Meta to keep pace with the rapid evolution of both AI technologies and hardware capabilities, ultimately fostering innovation in machine learning applications. The success of KernelEvolve underscores the potential of AI-driven automation in optimizing performance-critical tasks and sets a new standard for engineering efficiency in the tech industry.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Technology

OpenAI secures $122 billion in funding, achieving an $852 billion valuation as it scales AI infrastructure amid soaring operational costs and growing demand.

AI Generative

Microsoft boosts its AI leadership with three new models, including Copilot AI for coding, Insights AI for data analysis, and Conversational AI for enhanced...

AI Technology

Nvidia, Digital Realty, and Credo Technology are positioned to capitalize on a $700 billion AI infrastructure boom as major tech firms ramp up investments.

Top Stories

Oracle's shares fall 0.9% to $138.40 as rising AI infrastructure costs raise concerns over long-term dividend sustainability amid negative free cash flow.

AI Generative

As AI-generated videos surge, platforms like Meta and YouTube enforce transparency with tagging and labeling to combat misinformation and enhance viewer discernment.

AI Technology

Nvidia invests $2 billion in Marvell to create advanced AI infrastructure, enhancing custom silicon solutions amid a projected $630 billion industry push this year.

Top Stories

Meta invests $600 billion in AI by forming the elite MRS Research team, led by Yang Song, to enhance engagement across its social apps.

Top Stories

Malaysia targets 900 AI start-ups as it strengthens its governance framework, positioning itself as a regional digital hub amid global tech investments.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.