AINeutralarXiv – CS AI · 9h ago6/10
🧠
Beyond Confidence: Rethinking Self-Assessments for Performance Prediction in LLMs
Researchers propose using multidimensional self-assessment based on cognitive appraisal theory to predict LLM failures more reliably than confidence alone. Testing across 12 models and 38 tasks, they find effort and ability dimensions consistently outperform confidence, with task type determining which dimension proves most predictive.