Connect with us

Hi, what are you looking for?

AI Generative

Meta Launches Muse Spark, Outperforming GPT-5.4 in Health AI Benchmark by 2%

Meta launches Muse Spark, outperforming GPT-5.4 by over 2% in health AI benchmarks while cutting computational power by an order of magnitude.

Meta Platforms Inc. unveiled its latest reasoning model, Muse Spark, today, showcasing its capabilities in answering health-related inquiries and analyzing multimodal data. The company plans to integrate this advanced algorithm into its consumer-oriented Meta AI service over the coming weeks while also offering it to developers through a private preview of its application programming interface (API).

Muse Spark has demonstrated superior performance compared to competitors, including Claude 4.6 Opus, Gemini 3.1 Pro, and GPT 5.4, across multiple benchmarks. Notably, it exceeded GPT 5.4’s score by over 2% in the HealthBench Hard evaluation, which assesses AI models’ proficiency in addressing medical questions. This achievement is attributed to a clinical training dataset developed in collaboration with more than 1,000 physicians as part of a comprehensive overhaul of Meta’s AI development processes.

In a blog post, Meta emphasized that Muse Spark operates with significantly less computational power than its predecessor, Llama 4 Maverick, stating, “We can reach the same capabilities with over an order of magnitude less compute than our previous model.” This efficiency positions Muse Spark as a more sustainable alternative to existing baseline models.

Besides health-related inquiries, Muse Spark reportedly excels in scientific chart analysis, outpacing Opus 4.6 and other competitors on the CharXiv Reasoning benchmark, which evaluates technical graphs. This visual reasoning capability is applicable to other scenarios, such as allowing users of the Meta AI app to upload images of grocery store shelves for calorie count estimations of food items.

Meta’s assessments of Muse Spark spanned more than half a dozen benchmarks, showing that it was competitive with leading models like Opus 4.6, Gemini 3.1 Pro, and GPT 5.4. In various evaluations, Muse Spark outperformed at least one of its rivals, covering diverse use cases, including code generation, robot navigation, and tool utilization.

To enhance output quality, users can activate a feature known as Contemplating mode, which launches multiple AI agents to decompose tasks into smaller components executed in parallel. Meta claimed this innovation improved Muse Spark’s score on HLE, a challenging benchmark in the AI sector, by approximately 8%.

Muse Spark is the first of a planned series of multimodal reasoning models. In its blog post, Meta expressed optimism about further developments, stating, “We’re on a predictable and efficient scaling trajectory. We look forward to sharing increasingly capable models on the path to personal superintelligence soon.”

This introduction comes at a time when AI technologies are rapidly evolving, with companies across various sectors racing to improve their capabilities. As competition intensifies, innovations like Muse Spark may redefine standards for AI performance, especially in specialized fields such as healthcare and data analysis. The ability to tackle complex queries efficiently and effectively could significantly impact how consumers and professionals leverage AI in their daily lives.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

Meta Platforms, led by Alexandr Wang, pivots to a partial open-source AI model strategy to enhance user access while addressing safety concerns amidst fierce...

Top Stories

Meta cuts 200 jobs as part of a $10B investment in AI infrastructure, aiming to boost efficiency and reposition itself for long-term growth in...

AI Technology

Hyperscale giants like Google and AWS are transitioning to Arm CPUs, predicting a 90% adoption in custom AI servers by 2029, up from 25%...

Top Stories

Meta's upcoming Ray-Ban smart glasses will feature AI-driven food logging and advice, raising serious concerns over privacy and mental health impacts.

AI Technology

Meta's new KernelEvolve system automates kernel optimization, boosting AI model throughput by over 60%, revolutionizing performance across diverse hardware platforms.

Top Stories

Meta invests $600 billion in AI by forming the elite MRS Research team, led by Yang Song, to enhance engagement across its social apps.

AI Research

Meta assembles a top-tier AI team, led by VP Yang Song, to revolutionize Facebook and Instagram algorithms amid fierce competition for ad revenue.

AI Technology

A Quinnipiac poll reveals 55% of Americans fear AI will harm jobs and education, as tech giants invest $650 billion in AI infrastructure this...

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.