AINeutralarXiv – CS AI · 15h ago6/10
🧠
TADDLE: A Tool-Augmented Agent for Detecting Deficient LLM-Generated Peer Reviews
Researchers introduce TADDLE, an AI system that detects quality deficiencies in LLM-generated peer reviews by decomposing analysis into specialized tools and multi-label classification. The work addresses a growing problem in academic publishing where AI-written reviews are fluent but potentially flawed, backed by the first expert-annotated benchmark of 1,800 reviews across six defect categories.