Connect with us

Hi, what are you looking for?

Top Stories

Xiaomi Launches MiMo-V2.5 Series, Achieving 50% Token Efficiency Gain Over Competitors

Xiaomi’s MiMo-V2.5 series achieves a groundbreaking 50% token efficiency gain over competitors while introducing advanced models for intelligent agent applications.

Xiaomi has unveiled significant advancements to its MiMo large model series just one month after its last update, notably achieving a 42% reduction in token usage compared to its competitor, Kimi K2.6. The announcement was reported by Zhidx on April 23, highlighting the introduction of four new models: the flagship inference model MiMo-V2.5, the full-modal Agent model V2.5-Pro, which is currently in public testing and will soon be open-sourced, as well as the upcoming V2.5-TTS Series and V2.5-ASR.

Leading the MiMo project is Luo Fuli, a prominent industry figure previously associated with DeepSeek. Since the last major update of the MiMo-V2 series, Luo has indicated the intention to open-source the model once it demonstrates sufficient stability. The entire MiMo-V2.5 series is tailored for intelligent agent applications. The MiMo-V2.5-Pro model focuses on complex Agent tasks, while the standard MiMo-V2.5 addresses general agent scenarios.

Xiaomi has also provided a comprehensive usage guide, noting that MiMo-V2.5 features native full-modal capabilities, enabling it to process images, audio, and video. Compared to the Pro version, it offers faster inference speeds, making it more suitable for latency-sensitive tasks. The enhanced model not only boasts performance upgrades but also significantly improved token efficiency. In achieving comparable results on the ClawEval benchmark for intelligent agents, MiMo-V2.5-Pro outperformed Kimi K2.6 by saving 42% on tokens, while MiMo-V2.5 achieved a 50% reduction compared to Meta’s Muse Spark, a closed-source model released earlier this month.

Furthermore, Xiaomi has revamped its Token Plan, eliminating the previous 4x Credits billing method and removing the differentiation between 256k and 1M context for billing purposes. The new plan includes exclusive discounted rates for nighttime usage and an auto-renewal option, addressing prior user complaints regarding high costs and insufficient token allocations.

In a practical demonstration, Zhidx tasked MiMo-V2.5-Pro with creating a 3D side-scrolling fighting game. The model successfully generated 1,123 lines of code in a matter of minutes, resulting in a playable “Dragon and Tiger Fighting Game.” Although the interface included essential features like health bars and countdown timers, it was noted that character models were simplistic.

Earlier in March, Xiaomi’s MiMo-V2-Pro appeared on the OpenRouter platform under the moniker Hunter Alpha, leading to speculation about its relation to DeepSeek’s anticipated V4 model. This latest release aligns with expectations that DeepSeek V4 will also be announced this week.

Performance and Technical Capabilities

Xiaomi claims that MiMo-V2.5-Pro represents the pinnacle of its MiMo lineup, particularly in handling complex intelligent agent tasks. Internal tests indicate that it can execute extensive tasks, involving nearly a thousand tool calls, while demonstrating improved instruction-following capabilities. The model’s performance has drawn comparisons with leading global agents, scoring 73.7 on the MiMo Coding Bench, closely trailing Claude Opus 4.6, which scored 77.1.

In one notable test, a Twitter user queried MiMo-V2.5-Pro about whether to walk or drive to a car wash located 50 meters away, to which the model provided the accurate response. Xiaomi has also showcased several use cases for MiMo-V2.5-Pro, including the construction of a complete compiler in Rust, a task typically requiring weeks for undergraduate students, which MiMo-V2.5-Pro completed in just 4.3 hours. In another case, it developed a multi-functional video editing application in 11.5 hours, generating over 8,000 lines of code.

In the field of analog circuit design, the model successfully optimized a voltage regulator design, a task conventionally taking days, achieving results in under an hour through iterative simulations. These accomplishments underscore Xiaomi’s ambition to enhance the capabilities and applications of its AI models.

MiMo-V2.5 is distinguished as a native full-modal model, designed to facilitate simultaneous processing of visual, auditory, and textual information. This model surpassed its predecessor, MiMo-V2-Pro, in agent capabilities and demonstrated substantial improvements in multi-modal perception, achieving lower API costs. Benchmark tests revealed that MiMo-V2.5 competes closely with other leading models, including Claude Opus 4.6 and GPT-5.4, particularly in programming tasks.

As part of the broader evolution of its AI offerings, Xiaomi’s Token Plan presents an updated billing structure that eliminates previous complexities, further enhancing user accessibility. The new plan reflects the company’s commitment to balancing cost efficiency with advanced technological capabilities, making the MiMo series more appealing to developers and businesses alike.

With these advancements, Xiaomi positions itself as a significant player in the rapidly evolving field of AI, indicating a future where intelligent agents become integral to various applications across industries. The release of MiMo-V2.5 not only marks a technological leap for Xiaomi but also raises the competitive stakes among AI model providers.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

DeepSeek's V4 open-source model undercuts GPT-5.5 and Claude Opus 4.7 with costs of $1.74 per million tokens, promising a disruptive shift in AI pricing...

AI Technology

US lawmakers initiate a probe into PRC-developed AI systems, citing national security risks and potential exploitation of American innovations by companies like DeepSeek and...

AI Generative

DeepSeek unveils V4 AI model with advanced reasoning and agentic capabilities, outperforming OpenAI's GPT-5.2 while integrating Huawei chips for enhanced autonomy.

Top Stories

Anuma launches a privacy-first AI platform allowing users access to 10 leading models with a unique encrypted memory, enhancing data control and context retention.

Top Stories

DeepSeek's V4-Pro eclipses GPT-5 and Claude in key benchmarks, achieving a Codeforces rating of 3,206 while undercutting OpenAI's costs by 89% per million tokens.

AI Technology

DeepSeek unveils its 1.6 trillion parameter V4 model optimized for Huawei chips, priced at $3.48 per million tokens, amid U.S. IP theft allegations.

Top Stories

OpenAI slashes token prices to $5, pressuring Anthropic’s premium Claude Opus model as competition intensifies in the AI market.

Top Stories

DeepSeek's DeepSeek-V4 model, boasting 1.6 trillion parameters, outperforms Claude Opus 4.6, achieving top benchmarks with 1/3.7th the processing time.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.