AIBullisharXiv – CS AI · 5h ago7/10
🧠
Insights Generator: Systematic Corpus-Level Trace Diagnostics for LLM Agents
Researchers introduce Insights Generator (IG), a multi-agent system that automates the diagnosis of failures in large language model agents by analyzing execution trace corpora at scale. IG produces evidence-backed natural language insights about systematic behavioral patterns, demonstrating 30.4 percentage point performance improvements when human experts implement its recommendations.