AI Research

AI Achieves 95% Agreement with Clinician Assessments in Medical Interviewing Study

Researchers from Juntendo University reveal that generative AI achieves 95% agreement with clinicians in evaluating medical interviewing skills, enhancing training efficiency.

Staff

Published

15 April, 2026

In a significant development for medical education, researchers from Juntendo University in Japan have demonstrated that generative artificial intelligence (AI) can effectively evaluate clinical interviewing skills, traditionally assessed by experienced clinicians. Published on February 17, 2026, in the journal JMIR Medical Education, the study led by Dr. Hiromizu Takahashi and Professor Toshio Naito examined the efficiency of AI-based assessment (ABA) compared to human-based assessment (HBA), addressing a critical challenge in medical training.

Clinical interviewing is foundational for accurate diagnosis and effective patient care. However, the evaluation process is often labor-intensive, requiring repeated observations and detailed feedback. As medical education expands, the burden of assessment has become increasingly challenging. “Our central message is that AI may help make medical training fairer, faster, and more scalable,” Professor Naito stated.

The research team designed a cross-sectional validation study involving seven participants—medical students, resident physicians, and attending physicians—who conducted clinical interviews with an AI-simulated patient presenting with bilateral leg weakness. These interactions were recorded and converted into transcripts, which were evaluated using the Master Interview Rating Scale, a standardized tool assessing various communication aspects, including information gathering and empathy. The transcripts were analyzed through AI models, specifically GPT-o1 Pro and GPT-5 Pro, and concurrently reviewed by five experienced clinical instructors.

Results indicated a strong agreement between the AI evaluations and those of the clinicians, with only minimal score discrepancies. Notably, the AI’s assessments were more consistent across repeated evaluations, and the time spent on evaluating each transcript was reduced by over half. “Rather than replacing teachers, this research suggests a practical ‘AI-first, faculty-verified’ model in which AI handles the first pass and educators focus their time on coaching, judgment, and high-stakes decisions,” Dr. Takahashi explained.

This advancement has significant implications for medical education. In many training programs, delays in feedback limit opportunities for students to refine their communication skills. The ability for students to receive rapid, consistent evaluations could make repeated practice more accessible, especially in environments with constrained faculty resources. “Students could interview an AI-simulated patient and receive feedback almost immediately instead of waiting days or weeks,” Professor Naito added, emphasizing the potential for enhancing learning experiences.

However, the researchers caution that AI must be employed judiciously. While the technology performed well in their study, it was based on a limited participant pool and a singular clinical scenario. Furthermore, transcript-based evaluations cannot capture critical nonverbal cues, tone, or cultural nuances that play a vital role in real-world patient interactions. “AI should be used with human oversight, because text-only scoring can miss nuances such as tone, nonverbal communication, and cultural context,” both researchers noted.

This study highlights the increasing role of AI in medical education, suggesting that integrating the speed and consistency of AI with the expertise of clinicians could create more efficient and scalable training systems. As demand for high-quality medical education rises, such approaches may be essential in ensuring that future medical professionals receive optimal training while alleviating the workload for educators.

Source: Takahashi, H., et al. (2025). AI- vs Human-Based Assessment of Medical Interview Transcripts in a Generative AI–Simulated Patient System: Cross-Sectional Validation Study. JMIR Medical Education. DOI: 10.2196/81673. https://mededu.jmir.org/2026/1/e81673

AI Finance

Mizuho Bank Completes Merger, Integrates Oracle Autonomous AI for Enhanced Operations

Mizuho Bank merges with Mizuho Research & Technologies to enhance operations and leverage Oracle Autonomous AI, aiming for improved efficiency and security.

Marcus Chen2 May, 2026

Anthropic Plans Japan Expansion for Claude Mythos AI Amid U.S. Opposition

Anthropic expands Claude Mythos AI into Japan amid U.S. government scrutiny over potential national security risks and AI misuse concerns.

Staff1 May, 2026

AI Cybersecurity

Japan Establishes Framework to Combat Cyber Threats from Anthropic’s Mythos AI Model

Japan forms a task force to counter cyber threats from Anthropic's Claude Mythos AI, emphasizing urgent risks to financial infrastructure and national security.

Rachel Torres27 April, 2026

AI Technology

Japan’s Draft AI IP Code Risks Regulatory Misalignment with US Innovation Goals

Japan’s draft AI IP code could deter innovation by imposing impractical disclosure requirements on firms, risking regulatory misalignment with U.S. standards.

Staff26 April, 2026

AI Cybersecurity

Japan Establishes Task Force to Address Cybersecurity Risks from Anthropic’s Mythos AI

Japan's Finance Minister Satsuki Katayama announces a task force to tackle cybersecurity risks from Anthropic's Mythos AI, citing severe threats to financial stability.

Rachel Torres26 April, 2026

AI Technology

NEC Partners with Anthropic to Create AI-Native Engineering for 30,000 Employees in Japan

NEC collaborates with Anthropic to empower 30,000 employees with AI model Claude, targeting secure, industry-specific solutions for Japan's finance and manufacturing sectors.

Staff25 April, 2026

AI Cybersecurity

Japan Forms Task Force to Tackle Cybersecurity Risks from Anthropic’s Mythos AI Model

Japan forms a task force to combat cybersecurity threats from Anthropic's Mythos AI, which has already identified thousands of high-severity software vulnerabilities.

Rachel Torres24 April, 2026

AI Regulation

LDP Proposes AI Law Amendments to Penalize Misuse of Copyrighted Images

Japan's LDP proposes AI White Paper 2.0 to impose penalties on malicious AI misuse, urging 1 trillion yen investment to enhance copyright and national...

Staff24 April, 2026

AIPRESSA.COM

AI Research

AI Achieves 95% Agreement with Clinician Assessments in Medical Interviewing Study

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Finance

Mizuho Bank Completes Merger, Integrates Oracle Autonomous AI for Enhanced Operations

Top Stories

Anthropic Plans Japan Expansion for Claude Mythos AI Amid U.S. Opposition

AI Cybersecurity

Japan Establishes Framework to Combat Cyber Threats from Anthropic’s Mythos AI Model

AI Technology

Japan’s Draft AI IP Code Risks Regulatory Misalignment with US Innovation Goals

AI Cybersecurity

Japan Establishes Task Force to Address Cybersecurity Risks from Anthropic’s Mythos AI

AI Technology

NEC Partners with Anthropic to Create AI-Native Engineering for 30,000 Employees in Japan

AI Cybersecurity

Japan Forms Task Force to Tackle Cybersecurity Risks from Anthropic’s Mythos AI Model

AI Regulation

LDP Proposes AI Law Amendments to Penalize Misuse of Copyrighted Images