AIBullisharXiv – CS AI · 8h ago7/10
🧠
Plan, Watch, Recover: A Benchmark and Architectures for Proactive Procedural Assistance
Researchers introduce EgoProactive, a large-scale egocentric dataset and unified benchmark (Pro²Bench) for training AI systems to provide real-time procedural guidance while detecting and recovering from user deviations. The proposed decoupled planner-interaction architecture outperforms proprietary AI models (GPT, Claude, Gemini) on intervention quality and off-plan recovery tasks across six diverse datasets.
🧠 Claude🧠 Gemini🧠 Llama