y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#core-benchmark News & Analysis

1 article tagged with #core-benchmark. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

1 articles
AIBearisharXiv โ€“ CS AI ยท Mar 167/10
๐Ÿง 

Large language models show fragile cognitive reasoning about human emotions

Researchers introduced CoRE, a benchmark testing whether large language models can reason about human emotions through cognitive dimensions rather than just labels. The study found that while LLMs capture systematic relations between cognitive appraisals and emotions, they show misalignment with human judgments and instability across different contexts.