AINeutralarXiv – CS AI · 6h ago6/10
🧠
FALAT: Tracing Failures in LLM Agent Trajectories via Dependency-Guided Search
Researchers introduce FALAT, a diagnostic framework that traces failures in LLM-based agent systems by analyzing dependencies across multi-step trajectories. The system identifies which agent caused a failure and which specific step introduced the decisive error, achieving 46% accuracy on algorithm-generated test cases.