AIBullisharXiv – CS AI · Mar 166/10
🧠
AI Planning Framework for LLM-Based Web Agents
Researchers introduce a formal planning framework that maps LLM-based web agents to traditional search algorithms, enabling better diagnosis of failures in autonomous web tasks. The study compares different agent architectures using novel evaluation metrics and a dataset of 794 human-labeled trajectories from WebArena benchmark.