AINeutralarXiv – CS AI · 18h ago6/10
🧠
Beyond English benchmarks: clinical llm evaluation in Brazilian Portuguese
Researchers introduce ClinicalBr, the first bilingual clinical benchmark using 2,892 real Brazilian Portuguese-English case reports to evaluate large language models. The study reveals that English-language advantages in clinical AI are task-dependent, with Portuguese performing comparably in differential diagnosis, exam recommendations, and treatment planning.