AINeutralarXiv – CS AI · 7h ago6/10
🧠
TraceGraph: Shared Decision Landscapes for Diagnosing and Improving Agent Trajectories
TraceGraph is a new graph-based framework that analyzes multi-model agent trajectories to create shared decision landscapes, revealing how different AI models navigate tasks differently. The tool identifies failure regions and trap states, enabling targeted improvements that increased resolved rates on SWE-bench by 3-4.8%, demonstrating that aggregate benchmark scores mask critical performance divergences.