Plan, Watch, Recover: A Benchmark and Architectures for Proactive Procedural Assistance
Researchers introduce EgoProactive, a large-scale egocentric dataset and unified benchmark (Pro²Bench) for training AI systems to provide real-time procedural guidance while detecting and recovering from user deviations. The proposed decoupled planner-interaction architecture outperforms proprietary AI models (GPT, Claude, Gemini) on intervention quality and off-plan recovery tasks across six diverse datasets.