AINeutralarXiv – CS AI · 6h ago6/10
🧠
Are LLMs Ready to Assist Physicians? PhysAssistBench for Interactive Doctor-Patient-EHR Assistance
Researchers introduce PhysAssistBench, a new evaluation framework for testing large language models in real-world clinical settings where physicians, patients, and electronic health records interact simultaneously. The benchmark reveals that current leading LLMs struggle with coordinating medical knowledge, patient communication, and precise system interactions together, exposing a critical gap between isolated capability improvements and practical clinical assistance.