AI Generative

Northeastern Researchers Uncover AI Bias in Health Care Using Sparse Autoencoders

Northeastern researchers reveal AI bias in healthcare, showing racial prejudices in LLMs can harm patient care, underscoring urgent need for transparency.

Staff

Published

20 January, 2026

Artificial intelligence is increasingly being integrated into healthcare settings, aiding tasks from writing physicians’ notes to making patient recommendations. However, new research from Northeastern University highlights a critical concern: AI and large language models (LLMs) can perpetuate racial biases embedded in their training data, influencing outputs in ways that may not be immediately apparent to users. This study aims to decode the decision-making process of LLMs, shedding light on when race may be problematic in their recommendations.

Hiba Ahsan, a Ph.D. student and lead author of the study, emphasizes a significant finding: previous research indicates that Black patients are often less likely to receive pain medications despite reporting pain levels comparable to those of white patients. Ahsan warns that AI models could replicate this bias, making potentially harmful recommendations based on race.

In some medical contexts, considering race can be clinically important. For instance, gestational hypertension is more prevalent among individuals of African descent, while cystic fibrosis occurs more frequently in Northern Europeans, according to the Mayo Clinic. However, Ahsan notes that many biases in LLMs arise from irrelevant prejudices based on race, leading to outcomes that could adversely affect patient care.

The researchers utilized a tool called a sparse autoencoder to examine the intricate, often opaque workings of LLMs. Ahsan explains that these models simplify complex data inputs through a process called encoding, creating intermediate representations known as “latents.” While the model can interpret these numbers, human understanding remains limited.

By employing the sparse autoencoder, Ahsan and her advisor Byron Wallace, a machine learning expert, aimed to translate these latents into comprehensible data. This tool can indicate whether a particular data point is associated with race or another identifiable characteristic. Ahsan describes the process: if the autoencoder detects a latent related to race, it will signal that race is influencing the model’s output.

The researchers analyzed clinical notes and discharge summaries from the publicly available MIMIC dataset, which anonymizes personal information. They focused on notes where patients identified as white or Black. After processing these notes through an LLM named Gemma-2, they employed the sparse autoencoder to identify latents corresponding to race.

The findings revealed that racial biases were indeed embedded in the LLM. The autoencoder identified a notable frequency of latents associated with Black individuals alongside stigmatizing concepts such as “incarceration,” “gunshot,” and “cocaine use.” While previous evidence of racial bias in AI is well-documented, this research provides a rare glimpse into the underlying elements that influence LLM responses.

Ahsan emphasizes the challenges of interpretability in LLMs, describing them as “black boxes” that obscure the factors leading to specific decisions. Utilizing a sparse autoencoder to peer into these systems offers a pathway for physicians to discern when a patient’s race may be improperly factored into AI recommendations. Increased transparency could help doctors intervene and mitigate bias or seek alternative solutions, such as retraining the AI on more representative datasets.

Wallace acknowledges that while they did not invent sparse autoencoders, their application in clinical settings is pioneering. He asserts, “If we’re going to use these models in health care and want to do it safely, we probably need to improve the methods for interpreting them.” The sparse autoencoder method represents a significant step in addressing the interpretability challenges inherent in AI systems.

As AI continues to play a larger role in healthcare, the implications of this research are profound. With the ability to identify and understand biases in AI recommendations, healthcare providers can work towards ensuring more equitable treatment and decision-making, paving the way for advancements that prioritize patient welfare over algorithmic convenience.

AI Generative

AI Research Reveals 66% of Pseudonymous Users Can Be Unmasked with New Techniques

ETH Zurich and Anthropic reveal AI can unmask 66% of pseudonymous users online, challenging assumptions about digital privacy and anonymity.

Staff55 minutes ago

AI Cybersecurity

US Leverages Satellites and AI to Disrupt Iran’s Arsenal Amid Escalating Tensions

U.S. deploys AI and satellites to monitor Iranian military operations, reshaping regional security while enhancing tactical advantages amid rising tensions.

Rachel Torres2 hours ago

AI Regulation

AI Adoption in 2026: 50% Surge in Cloud Costs Without Automated Controls

AI adoption is set to surge cloud costs by 50%, pushing organizations to implement automated controls or face escalating expenditures and risks.

Staff2 hours ago

AI Research

Google Researchers Reveal 20% Accuracy Boost in AI with New Chain-of-Thought Prompting

Google's new chain-of-thought prompting boosts AI reasoning accuracy by 20%, optimizing complex tasks and driving a 30% reduction in operational costs.

Staff3 hours ago

AI Government

Brits Fear AI Could Dehumanize Public Services, Poll Reveals 37% See Risks Over Benefits

Ipsos survey reveals 37% of Brits see AI as a risk to public services, with 51% fearing reduced human interaction and 50% anticipating job...

Staff4 hours ago

AI Tools

Pegasystems Launches Vibe Coding in Pega Blueprint to Enhance AI Workflow Design

Pegasystems introduces vibe coding in its Pega Blueprint platform, aiming to simplify enterprise application design and drive $1.9 billion in projected revenue by 2028.

Staff4 hours ago

AI Finance

Finance Leaders Embrace AI: 63% Deploy Solutions, Transforming Strategies by 2030

Finance leaders are rapidly adopting AI, with 63% integrating solutions to transform strategies, anticipating a fundamental evolution in finance by 2030.

Marcus Chen4 hours ago

AI Generative

Digital Agency Launches AI Test for 180K Staff, Aiming for 2027 Implementation

Digital Agency tests generative AI platform "Gennai" with 180,000 staff in May, aiming for enhanced administrative efficiency and a 2027 rollout.

Staff5 hours ago

AIPRESSA.COM

AI Generative

Northeastern Researchers Uncover AI Bias in Health Care Using Sparse Autoencoders

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

Top Stories

DeepMind Achieves Breakthroughs with AlphaFold and AlphaZero, Transforming AI Landscape

You May Also Like

AI Generative

AI Research Reveals 66% of Pseudonymous Users Can Be Unmasked with New Techniques

AI Cybersecurity

US Leverages Satellites and AI to Disrupt Iran’s Arsenal Amid Escalating Tensions

AI Regulation

AI Adoption in 2026: 50% Surge in Cloud Costs Without Automated Controls

AI Research

Google Researchers Reveal 20% Accuracy Boost in AI with New Chain-of-Thought Prompting

AI Government

Brits Fear AI Could Dehumanize Public Services, Poll Reveals 37% See Risks Over Benefits

AI Tools

Pegasystems Launches Vibe Coding in Pega Blueprint to Enhance AI Workflow Design

AI Finance

Finance Leaders Embrace AI: 63% Deploy Solutions, Transforming Strategies by 2030

AI Generative

Digital Agency Launches AI Test for 180K Staff, Aiming for 2027 Implementation