y0news
AnalyticsDigestsSourcesRSSAICrypto
#disinformation-risk1 article
1 articles
AIBearisharXiv โ€“ CS AI ยท 7h ago7/10
๐Ÿง 

Beyond Surface Judgments: Human-Grounded Risk Evaluation of LLM-Generated Disinformation

A new study challenges the validity of using LLM judges as proxies for human evaluation of AI-generated disinformation, finding that eight frontier LLM judges systematically diverge from human reader responses in their scoring, ranking, and reliance on textual signals. The research demonstrates that while LLMs agree strongly with each other, this internal coherence masks fundamental misalignment with actual human perception, raising critical questions about the reliability of automated content moderation at scale.