Connect with us

Hi, what are you looking for?

AI Generative

Meta Launches Muse Spark, Outperforming GPT-5.4 in Health AI Benchmark by 2%

Meta launches Muse Spark, outperforming GPT-5.4 by over 2% in health AI benchmarks while cutting computational power by an order of magnitude.

Meta Platforms Inc. unveiled its latest reasoning model, Muse Spark, today, showcasing its capabilities in answering health-related inquiries and analyzing multimodal data. The company plans to integrate this advanced algorithm into its consumer-oriented Meta AI service over the coming weeks while also offering it to developers through a private preview of its application programming interface (API).

Muse Spark has demonstrated superior performance compared to competitors, including Claude 4.6 Opus, Gemini 3.1 Pro, and GPT 5.4, across multiple benchmarks. Notably, it exceeded GPT 5.4’s score by over 2% in the HealthBench Hard evaluation, which assesses AI models’ proficiency in addressing medical questions. This achievement is attributed to a clinical training dataset developed in collaboration with more than 1,000 physicians as part of a comprehensive overhaul of Meta’s AI development processes.

In a blog post, Meta emphasized that Muse Spark operates with significantly less computational power than its predecessor, Llama 4 Maverick, stating, “We can reach the same capabilities with over an order of magnitude less compute than our previous model.” This efficiency positions Muse Spark as a more sustainable alternative to existing baseline models.

Besides health-related inquiries, Muse Spark reportedly excels in scientific chart analysis, outpacing Opus 4.6 and other competitors on the CharXiv Reasoning benchmark, which evaluates technical graphs. This visual reasoning capability is applicable to other scenarios, such as allowing users of the Meta AI app to upload images of grocery store shelves for calorie count estimations of food items.

Meta’s assessments of Muse Spark spanned more than half a dozen benchmarks, showing that it was competitive with leading models like Opus 4.6, Gemini 3.1 Pro, and GPT 5.4. In various evaluations, Muse Spark outperformed at least one of its rivals, covering diverse use cases, including code generation, robot navigation, and tool utilization.

To enhance output quality, users can activate a feature known as Contemplating mode, which launches multiple AI agents to decompose tasks into smaller components executed in parallel. Meta claimed this innovation improved Muse Spark’s score on HLE, a challenging benchmark in the AI sector, by approximately 8%.

Muse Spark is the first of a planned series of multimodal reasoning models. In its blog post, Meta expressed optimism about further developments, stating, “We’re on a predictable and efficient scaling trajectory. We look forward to sharing increasingly capable models on the path to personal superintelligence soon.”

This introduction comes at a time when AI technologies are rapidly evolving, with companies across various sectors racing to improve their capabilities. As competition intensifies, innovations like Muse Spark may redefine standards for AI performance, especially in specialized fields such as healthcare and data analysis. The ability to tackle complex queries efficiently and effectively could significantly impact how consumers and professionals leverage AI in their daily lives.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Generative

OpenAI unveils GPT Image 2, achieving a record 242-point lead over competitors, transforming the AI image generation landscape with native reasoning capabilities.

Top Stories

Google DeepMind's AI co-clinician outperformed GPT-5.4 in doctor tests, achieving 67 preferences in primary care queries and a remarkable 95% quality score in open-ended...

Top Stories

Meta enhances AI recommendations, driving a 10% increase in Instagram Reels engagement and an 8% rise in global Facebook video time in Q1 FY26.

AI Marketing

Meta expands its AI business assistant to major global markets, enhancing marketing campaign effectiveness with actionable insights and advanced analytics.

Top Stories

Meta's "Name Tag" feature for smart glasses raises significant privacy concerns, as ACLU warns it could endanger vulnerable communities by enabling covert surveillance.

AI Technology

DeepSeek unveils its 1.6 trillion parameter V4 model optimized for Huawei chips, priced at $3.48 per million tokens, amid U.S. IP theft allegations.

AI Generative

OpenAI unveils GPT-5.5 for paid subscribers, enhancing efficiency and accuracy with a 900 million weekly user base, just six weeks after GPT-5.4.

AI Generative

Moonshot AI releases Kimi-K2.6, an open-source LLM surpassing GPT-5.4 with 1 trillion parameters and achieving a benchmark score of 54 on the challenging HLE-Full...

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.