AINeutralarXiv – CS AI · 10h ago7/10
🧠
MEDLAYXPLAIN: Benchmarking the Expert-Lay Gap in Medical Vision-Language Models
Researchers introduce MedLayXPlain, a large-scale benchmark and dataset for evaluating medical vision-language models' ability to generate patient-accessible descriptions of diagnostic imaging. The study reveals a systematic gap between expert-level medical AI performance and lay-person comprehension, with medical VLMs excelling at technical accuracy but failing at accessibility, while general-purpose models prioritize clarity over clinical precision.