Connect with us

Hi, what are you looking for?

AI Generative

Meta Launches Muse Spark, Outperforming GPT-5.4 in Health AI Benchmark by 2%

Meta launches Muse Spark, outperforming GPT-5.4 by over 2% in health AI benchmarks while cutting computational power by an order of magnitude.

Meta Platforms Inc. unveiled its latest reasoning model, Muse Spark, today, showcasing its capabilities in answering health-related inquiries and analyzing multimodal data. The company plans to integrate this advanced algorithm into its consumer-oriented Meta AI service over the coming weeks while also offering it to developers through a private preview of its application programming interface (API).

Muse Spark has demonstrated superior performance compared to competitors, including Claude 4.6 Opus, Gemini 3.1 Pro, and GPT 5.4, across multiple benchmarks. Notably, it exceeded GPT 5.4’s score by over 2% in the HealthBench Hard evaluation, which assesses AI models’ proficiency in addressing medical questions. This achievement is attributed to a clinical training dataset developed in collaboration with more than 1,000 physicians as part of a comprehensive overhaul of Meta’s AI development processes.

In a blog post, Meta emphasized that Muse Spark operates with significantly less computational power than its predecessor, Llama 4 Maverick, stating, “We can reach the same capabilities with over an order of magnitude less compute than our previous model.” This efficiency positions Muse Spark as a more sustainable alternative to existing baseline models.

Besides health-related inquiries, Muse Spark reportedly excels in scientific chart analysis, outpacing Opus 4.6 and other competitors on the CharXiv Reasoning benchmark, which evaluates technical graphs. This visual reasoning capability is applicable to other scenarios, such as allowing users of the Meta AI app to upload images of grocery store shelves for calorie count estimations of food items.

Meta’s assessments of Muse Spark spanned more than half a dozen benchmarks, showing that it was competitive with leading models like Opus 4.6, Gemini 3.1 Pro, and GPT 5.4. In various evaluations, Muse Spark outperformed at least one of its rivals, covering diverse use cases, including code generation, robot navigation, and tool utilization.

To enhance output quality, users can activate a feature known as Contemplating mode, which launches multiple AI agents to decompose tasks into smaller components executed in parallel. Meta claimed this innovation improved Muse Spark’s score on HLE, a challenging benchmark in the AI sector, by approximately 8%.

Muse Spark is the first of a planned series of multimodal reasoning models. In its blog post, Meta expressed optimism about further developments, stating, “We’re on a predictable and efficient scaling trajectory. We look forward to sharing increasingly capable models on the path to personal superintelligence soon.”

This introduction comes at a time when AI technologies are rapidly evolving, with companies across various sectors racing to improve their capabilities. As competition intensifies, innovations like Muse Spark may redefine standards for AI performance, especially in specialized fields such as healthcare and data analysis. The ability to tackle complex queries efficiently and effectively could significantly impact how consumers and professionals leverage AI in their daily lives.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Generative

Moonshot AI releases Kimi-K2.6, an open-source LLM surpassing GPT-5.4 with 1 trillion parameters and achieving a benchmark score of 54 on the challenging HLE-Full...

Top Stories

Meta considers 8,000 job cuts to fund AI investments as its stock rises 5.86% year-to-date, aiming to redefine its business model and enhance efficiency

Top Stories

Meta's Muse Spark AI model launches with deep integration across Instagram, WhatsApp, and Facebook, boosting shares by 6% amid $72B investment in AI innovation.

Top Stories

MiniMax's M2.7 AI model achieves 56.22% on SWE-Pro benchmarks but restricts commercial use through new licensing, raising concerns among developers.

Top Stories

Meta’s Muse Spark launch boosts stock nearly 10% as the company aims to compete in the AI race, generating $201B revenue in 2025 with...

AI Generative

Google's Android Bench ranks OpenAI's GPT 5.4 and Gemini 3.1 Pro Preview at 72.4%, establishing them as top AI models for Android app development.

Top Stories

Meta unveils Muse Spark AI model to boost reasoning and multimodal functions, enhancing user interactions across platforms like Instagram and WhatsApp.

AI Generative

Meta unveils Muse Spark, its latest AI model, achieving a 58% score on Humanity's Last Exam while enhancing multimodal reasoning and multi-agent workflows.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.