AIBearisharXiv โ CS AI ยท 4h ago5
๐ง
Beyond Accuracy: Risk-Sensitive Evaluation of Hallucinated Medical Advice
Researchers propose a new risk-sensitive framework for evaluating AI hallucinations in medical advice that considers potential harm rather than just factual accuracy. The study reveals that AI models with similar performance show vastly different risk profiles when generating medical recommendations, highlighting critical safety gaps in current evaluation methods.