AIBearisharXiv โ CS AI ยท 7h ago7/10
๐ง
Large language models show fragile cognitive reasoning about human emotions
Researchers introduced CoRE, a benchmark testing whether large language models can reason about human emotions through cognitive dimensions rather than just labels. The study found that while LLMs capture systematic relations between cognitive appraisals and emotions, they show misalignment with human judgments and instability across different contexts.