Connect with us

Hi, what are you looking for?

AI Research

AI Achieves 95% Agreement with Clinician Assessments in Medical Interviewing Study

Researchers from Juntendo University reveal that generative AI achieves 95% agreement with clinicians in evaluating medical interviewing skills, enhancing training efficiency.

In a significant development for medical education, researchers from Juntendo University in Japan have demonstrated that generative artificial intelligence (AI) can effectively evaluate clinical interviewing skills, traditionally assessed by experienced clinicians. Published on February 17, 2026, in the journal JMIR Medical Education, the study led by Dr. Hiromizu Takahashi and Professor Toshio Naito examined the efficiency of AI-based assessment (ABA) compared to human-based assessment (HBA), addressing a critical challenge in medical training.

Clinical interviewing is foundational for accurate diagnosis and effective patient care. However, the evaluation process is often labor-intensive, requiring repeated observations and detailed feedback. As medical education expands, the burden of assessment has become increasingly challenging. “Our central message is that AI may help make medical training fairer, faster, and more scalable,” Professor Naito stated.

The research team designed a cross-sectional validation study involving seven participants—medical students, resident physicians, and attending physicians—who conducted clinical interviews with an AI-simulated patient presenting with bilateral leg weakness. These interactions were recorded and converted into transcripts, which were evaluated using the Master Interview Rating Scale, a standardized tool assessing various communication aspects, including information gathering and empathy. The transcripts were analyzed through AI models, specifically GPT-o1 Pro and GPT-5 Pro, and concurrently reviewed by five experienced clinical instructors.

Results indicated a strong agreement between the AI evaluations and those of the clinicians, with only minimal score discrepancies. Notably, the AI’s assessments were more consistent across repeated evaluations, and the time spent on evaluating each transcript was reduced by over half. “Rather than replacing teachers, this research suggests a practical ‘AI-first, faculty-verified’ model in which AI handles the first pass and educators focus their time on coaching, judgment, and high-stakes decisions,” Dr. Takahashi explained.

This advancement has significant implications for medical education. In many training programs, delays in feedback limit opportunities for students to refine their communication skills. The ability for students to receive rapid, consistent evaluations could make repeated practice more accessible, especially in environments with constrained faculty resources. “Students could interview an AI-simulated patient and receive feedback almost immediately instead of waiting days or weeks,” Professor Naito added, emphasizing the potential for enhancing learning experiences.

However, the researchers caution that AI must be employed judiciously. While the technology performed well in their study, it was based on a limited participant pool and a singular clinical scenario. Furthermore, transcript-based evaluations cannot capture critical nonverbal cues, tone, or cultural nuances that play a vital role in real-world patient interactions. “AI should be used with human oversight, because text-only scoring can miss nuances such as tone, nonverbal communication, and cultural context,” both researchers noted.

This study highlights the increasing role of AI in medical education, suggesting that integrating the speed and consistency of AI with the expertise of clinicians could create more efficient and scalable training systems. As demand for high-quality medical education rises, such approaches may be essential in ensuring that future medical professionals receive optimal training while alleviating the workload for educators.

Source: Takahashi, H., et al. (2025). AI- vs Human-Based Assessment of Medical Interview Transcripts in a Generative AI–Simulated Patient System: Cross-Sectional Validation Study. JMIR Medical Education. DOI: 10.2196/81673. https://mededu.jmir.org/2026/1/e81673

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Finance

Mizuho Bank merges with Mizuho Research & Technologies to enhance operations and leverage Oracle Autonomous AI, aiming for improved efficiency and security.

Top Stories

Anthropic expands Claude Mythos AI into Japan amid U.S. government scrutiny over potential national security risks and AI misuse concerns.

AI Cybersecurity

Japan forms a task force to counter cyber threats from Anthropic's Claude Mythos AI, emphasizing urgent risks to financial infrastructure and national security.

AI Technology

Japan’s draft AI IP code could deter innovation by imposing impractical disclosure requirements on firms, risking regulatory misalignment with U.S. standards.

AI Cybersecurity

Japan's Finance Minister Satsuki Katayama announces a task force to tackle cybersecurity risks from Anthropic's Mythos AI, citing severe threats to financial stability.

AI Technology

NEC collaborates with Anthropic to empower 30,000 employees with AI model Claude, targeting secure, industry-specific solutions for Japan's finance and manufacturing sectors.

AI Cybersecurity

Japan forms a task force to combat cybersecurity threats from Anthropic's Mythos AI, which has already identified thousands of high-severity software vulnerabilities.

AI Regulation

Japan's LDP proposes AI White Paper 2.0 to impose penalties on malicious AI misuse, urging 1 trillion yen investment to enhance copyright and national...

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.