AI Research

APOLLO AI Launches, Analyzing 25B Medical Events to Predict Future Diseases

Mass General Brigham unveils APOLLO, a transformative AI model trained on 25 billion medical events, achieving a 0.92 AUROC for predicting schizophrenia risks.

Staff

Published

4 days ago

In a significant advancement for healthcare analytics, researchers have unveiled “APOLLO,” a groundbreaking multimodal temporal foundation model trained on 25 billion medical events from 7.2 million patients, aimed at bridging the gap in utilizing vast healthcare data. The model, introduced in a recent arXiv preprint, integrates diverse medical modalities—28 in total—to provide insights into disease prediction, patient care, and long-hidden patterns within healthcare systems. This initiative arises from the troubling statistic that only 3% of the approximately 50 petabytes of annual healthcare data is used for clinical insights.

The research team, led by Faisal Mahmood from Mass General Brigham (MGB), developed APOLLO using the MGB-7M dataset, which encompasses 33 years of records from 17 institutions. This extensive dataset includes 1.4 billion laboratory tests, 158 million progress notes, and over 1.1 million medical images. The model employs a transformer-based architecture and utilizes a technique called tokenization to convert various medical events into a format suitable for analysis. This is crucial for capturing the complex interplay of patient data over time.

By leveraging this data, APOLLO has demonstrated superior performance across 322 clinical tasks, including predicting the onset of schizophrenia with an Area Under the Receiver Operating Characteristic curve (AUROC) of 0.92 and a balanced accuracy of 0.97 for forecasting in-hospital dialysis dependence. The model’s ability to analyze both structured and unstructured data positions it as a potential game-changer in healthcare, addressing a longstanding data silo challenge where patient histories are often split into separate categories, hindering comprehensive analysis.

APOLLO’s training method included Masked Token Modeling (MTM), allowing it to reconstruct parts of patient records while maintaining temporal context—a key factor in understanding chronic disease progression. To mitigate risks related to patient privacy, the model’s architecture is designed to isolate the transformer from raw patient data using modality-specific projectors, thereby reducing the chance of protected health information (PHI) leakage.

Evaluations of APOLLO’s predictive capabilities reveal its prowess in disease onset predictions, outperforming traditional statistical models in 74 of 95 tasks. For instance, it predicted a three-year risk of heart failure with an AUROC of 0.88, surpassing the baseline of 0.77, and it achieved an AUROC of 0.85 for predicting Type 2 diabetes risk, compared to 0.61 for traditional methods. Notably, in the oncology sector, the model improved survival prediction for trastuzumab therapy in HER2-positive breast cancer patients to an AUROC of 0.93, significantly exceeding the existing baseline of 0.66.

The study underscores the need for integrated multimodal approaches in clinical settings. APOLLO’s mean AUROC for overall cancer progression reached 0.735, outperforming existing AI implementations that rely on structured data alone or are limited to task-specific supervised training. Additionally, it has shown potential as a “medical search engine,” accurately retrieving similar patient cases based on queries related to pathology slides, even in instances where traditional diagnostic codes were absent from the data.

Despite its promising capabilities, researchers caution that APOLLO’s predictions are correlational rather than causal due to the observational nature of the EHR training dataset. Its analyses focus on stratifying risk within patients already receiving specific therapies rather than determining the most effective treatments for individual patients. Nonetheless, the model’s ability to condense entire clinical histories into unified digital signatures could facilitate precision trial matching and personalized prognostic stratification within healthcare systems.

As healthcare continues to evolve toward a more proactive model, APOLLO represents a significant step towards realizing the goals of computable medicine. With the potential to transform how patient data is leveraged for clinical insights, this model could pave the way for enhanced patient care and outcomes in the future.

AI Research

Friendly AI Chatbots 30% Less Accurate, 40% More Likely to Support Conspiracy Theories, Study Finds

Oxford researchers find friendly AI chatbots are 30% less accurate and 40% more likely to support conspiracy theories, raising concerns over reliability.

Staff6 days ago

Hugging Face Launches ML Intern, Outperforming Claude Code in Scientific Reasoning

Hugging Face launches ML Intern, an open-source AI agent that surpasses Claude Code in scientific reasoning with a 32% GPQA score, offering $1,000 in...

Staff23 April, 2026

AI Research

Google’s AMIE AI Achieves Doctor-Level Diagnostic Insights in Urgent Care Study

Google’s AMIE AI successfully conducted pre-visit medical interviews for 100 patients, achieving diagnostic insights comparable to human doctors, enhancing patient attitudes significantly.

Staff17 April, 2026

AI Generative

Study Reveals 26 LLM Routers Injecting Malicious Code, Draining ETH Wallets

UC Santa Barbara study finds 26 LLM routers injecting malicious code, with one draining Ethereum wallets, exposing developers to severe security risks.

Staff14 April, 2026

AI Research

Small Quantum Computers Achieve Exponential Memory Efficiency in Machine Learning Tasks

Caltech and Google Quantum AI researchers reveal that small quantum computers can achieve up to 6x memory efficiency over classical systems in machine learning...

Staff11 April, 2026

AI Research

Google Unveils TurboQuant: 6x LLM Cache Compression with No Accuracy Loss

Google's TurboQuant algorithm achieves 6x reduction in LLM cache memory with zero accuracy loss, revolutionizing AI efficiency for smaller labs and businesses.

Staff11 April, 2026

AI Technology

BTQ Technologies Reveals Quantum Bitcoin Mining Costs: 10^23 Qubits Required by 2025

BTQ Technologies reveals that quantum Bitcoin mining could require an astronomical 10^23 qubits and 10^25 watts by 2025, urging immediate action on security vulnerabilities.

Staff6 April, 2026

Intel Acquires $14.2B Ireland Fab; Mistral AI Raises $830M for New Data Center

Intel invests $14.2B to fully acquire its Ireland semiconductor facility, while Mistral AI raises $830M to build a new European data center.

Staff4 April, 2026

AIPRESSA.COM

AI Research

APOLLO AI Launches, Analyzing 25B Medical Events to Predict Future Diseases

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Research

Friendly AI Chatbots 30% Less Accurate, 40% More Likely to Support Conspiracy Theories, Study Finds

Top Stories

Hugging Face Launches ML Intern, Outperforming Claude Code in Scientific Reasoning

AI Research

Google’s AMIE AI Achieves Doctor-Level Diagnostic Insights in Urgent Care Study

AI Generative

Study Reveals 26 LLM Routers Injecting Malicious Code, Draining ETH Wallets

AI Research

Small Quantum Computers Achieve Exponential Memory Efficiency in Machine Learning Tasks

AI Research

Google Unveils TurboQuant: 6x LLM Cache Compression with No Accuracy Loss

AI Technology

BTQ Technologies Reveals Quantum Bitcoin Mining Costs: 10^23 Qubits Required by 2025

Top Stories

Intel Acquires $14.2B Ireland Fab; Mistral AI Raises $830M for New Data Center