Leanstral Launches as First Open-Source Code Agent for Lean 4 with Superior Efficiency

Leanstral launches as the first open-source code agent for Lean 4, boasting 6 billion parameters and outperforming competitors with a score of 26.3 for just $36.

Staff

Published

16 March, 2026

AI agents have increasingly become essential tools in code generation; however, their deployment in high-stakes domains such as advanced research mathematics and critical software development often reveals a significant hurdle: the necessity of human review. The extensive time and specialized expertise required to manually verify outputs have emerged as the primary bottleneck to engineering velocity.

In response to these challenges, a new frontier of coding agents is being envisioned—agents that not only execute tasks but also formally validate their implementations against stringent specifications. This shift is aimed at minimizing the need for debugging machine-generated logic, allowing users to focus on defining their requirements instead. In this context, the launch of Leanstral, an open-source code agent specifically designed for Lean 4, marks a significant milestone.

Leanstral aims to address the limitations of existing proving systems, which typically either act as wrappers around vast generalist models or concentrate on isolated mathematical problems. With 6 billion active parameters, Leanstral is specifically optimized for efficiency and tailored for realistic formal repositories, positioning itself as a robust alternative in the domain of proof engineering.

Available under an Apache 2.0 license, Leanstral can be accessed in an agent mode via Mistral Vibe or through a free API endpoint. The developers also plan to release a comprehensive technical report detailing their training methods and the FLTEval evaluation suite, which aims to broaden the scope of assessments beyond competitive mathematics.

Leanstral’s architecture employs a highly sparse design optimized for proof engineering tasks. By leveraging parallel inference and utilizing Lean as a reliable verifier, the model demonstrates both performance and cost-efficiency compared to existing closed-source competitors. The model’s capabilities will also be enhanced through support for arbitrary MCPs, specifically designed for maximal performance with the frequently used lean-lsp-mcp.

Evaluation metrics reveal Leanstral’s superiority in practical proof engineering scenarios. It has been benchmarked for completing formal proofs and accurately defining new mathematical concepts within all pull requests to the FLT project. In comparisons with leading coding agents and open-source models, Leanstral has demonstrated significant efficiency advantages. For instance, while models like GLM5 and Kimi-K2.5 experienced scaling challenges, capping their FLTEval scores at approximately 16.6 and 20.1, respectively, Leanstral scored 26.3 with just a single pass, indicating its remarkable performance with less computational investment.

When juxtaposed with the Claude family of models, Leanstral presents a cost-effective alternative. It achieves a pass@2 score of 26.3, surpassing Sonnet by 2.6 points while costing merely $36, in stark contrast to Sonnet’s $549. Even at pass@16, Leanstral reaches a score of 31.9, comfortably outperforming Sonnet by 8 points. Though Claude Opus 4.6 remains the benchmark for quality, its operational cost is a staggering $1,650—92 times higher than Leanstral’s expenses.

Real-world applications of Leanstral underscore its practical utility. In one case study, the model was tasked with resolving a compilation issue arising from a recent update in Lean. It not only diagnosed the problem but also proposed a straightforward fix, illustrating its capacity to assist users effectively.

Additionally, Leanstral successfully translated programming definitions into Lean, demonstrating its versatility. In one example, the model converted definitions from Rocq and proved properties about programs, showcasing its competency in reasoning about complex programming constructs.

Leanstral is now available for public use, allowing developers to experience its features without the need for extensive setup. Users can access the model through the integrated Mistral Vibe platform or utilize the Labs API for a limited time, thereby contributing invaluable feedback to refine future iterations. With the option to download the model under the Apache 2.0 license, Leanstral positions itself as a transformative tool in the evolving landscape of code generation and verification.

As the importance of verified code continues to grow, Leanstral’s innovative approach offers a promising avenue for enhancing the speed and accuracy of software development processes, potentially reshaping how coding agents are utilized across various industries.

AI Business

Red Hat Reveals Small Language Models as Key to Scaling Enterprise AI Agents

Red Hat advances enterprise AI with Small Language Models that achieve over 98% validity in structured tasks, prioritizing reliability and data sovereignty.

Marcus Chen3 May, 2026

AI Research

OpenAI’s AI Model Achieves 81.6% Diagnostic Accuracy, Surpassing Human Doctors in ER Tests

OpenAI's o1 model achieves 81.6% diagnostic accuracy in emergency situations, surpassing human doctors and signaling a major shift in medical practice.

Staff3 May, 2026

AI Regulation

Korea Ventures Launches AI Initiative to Enhance Fund Management and Policy Efficiency

Korea Venture Investment Corp. unveils AI-driven fund management systems by integrating Nvidia H200 GPUs to enhance efficiency and support unicorn growth.

Staff3 May, 2026

AI Technology

Apple Raises Mac Mini Price to $799 Amid AI-Driven Supply Shortages

Apple raises Mac mini starting price to $799 amid AI-driven inventory shortages, eliminating the $599 model in response to surging demand for advanced computing.

Staff3 May, 2026

AI Research

IBM Launches Chicago Quantum Hub, Creating 750 AI Jobs and Expanding MIT Research Lab

IBM launches a Chicago Quantum Hub to create 750 AI jobs and expands its MIT partnership to advance quantum computing and AI integration.

Staff3 May, 2026

AI Government

71% of Aussies Use Generative AI, Yet Only 36% Trust Its Implementation, Says Expert

71% of Australian employees use generative AI daily, but only 36% trust its implementation, highlighting urgent calls for better policy frameworks and safeguards.

Staff3 May, 2026

AI Regulation

Academy Confirms AI Performances Ineligible for Oscars Amid Growing Industry Tensions

The Academy of Motion Picture Arts and Sciences bars AI performances from Oscar eligibility, emphasizing human-authored content amid rising industry tensions over generative AI's...

Staff2 May, 2026

AI Tools

Workday Updates AI Products, Sees 49.8% Undervaluation Amid Earnings Optimism

Workday's stock jumps 3.73% to $126.96 amid AI product updates and earnings optimism, yet analysts cite a 49.8% undervaluation risk at $253.14.

Staff2 May, 2026

AIPRESSA.COM

Top Stories

Leanstral Launches as First Open-Source Code Agent for Lean 4 with Superior Efficiency

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Business

Red Hat Reveals Small Language Models as Key to Scaling Enterprise AI Agents

AI Research

OpenAI’s AI Model Achieves 81.6% Diagnostic Accuracy, Surpassing Human Doctors in ER Tests

AI Regulation

Korea Ventures Launches AI Initiative to Enhance Fund Management and Policy Efficiency

AI Technology

Apple Raises Mac Mini Price to $799 Amid AI-Driven Supply Shortages

AI Research

IBM Launches Chicago Quantum Hub, Creating 750 AI Jobs and Expanding MIT Research Lab

AI Government

71% of Aussies Use Generative AI, Yet Only 36% Trust Its Implementation, Says Expert

AI Regulation

Academy Confirms AI Performances Ineligible for Oscars Amid Growing Industry Tensions

AI Tools

Workday Updates AI Products, Sees 49.8% Undervaluation Amid Earnings Optimism