#failure-recovery News & Analysis

7 articles tagged with #failure-recovery. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

7 articles

AIBullisharXiv – CS AI · Jun 27/10

🧠

Self-Healing Agentic Orchestrators for Reliable Tool-Augmented Large Language Model Systems

Researchers present a self-healing orchestration framework for tool-augmented large language models that treats reliability as a bounded runtime control problem, achieving 98.8% task success by mapping failure signals to recovery actions and verifying results. The approach outperforms retry-only and full-replanning baselines across multiple benchmarks, particularly excelling when recovery budgets are constrained.

AIBullisharXiv – CS AI · May 287/10

🧠

ReflexGrad: Within-Episode Failure Recovery in LLM Agents via Progress-Gated Dual-Process Routing

ReflexGrad introduces a dual-process architecture enabling LLM agents to recover from failures within a single episode without requiring demonstrations. The system combines fast continuous refinement with slow causal diagnosis, achieving significant performance improvements on benchmark tasks with smaller models matching larger model performance through architectural innovation rather than scale.

🧠 GPT-5

AIBullisharXiv – CS AI · May 127/10

🧠

Debugging the Debuggers: Failure-Anchored Structured Recovery for Software Engineering Agents

Researchers present PROBE, a framework that improves how AI software engineering agents recover from failures by converting runtime telemetry into structured diagnoses and bounded recovery guidance. The system achieves 65% diagnosis accuracy and 21.8% recovery rates on previously unresolved cases, with a prototype deployed at Microsoft showing practical viability without disrupting existing workflows.

AIBullisharXiv – CS AI · May 127/10

🧠

RePO-VLA: Recovery-Driven Policy Optimization for Vision-Language-Action Models

Researchers introduce RePO-VLA, a policy optimization framework that improves Vision-Language-Action models' ability to recover from failures in complex manipulation tasks. The method increases adversarial robustness from 20% to 75% by learning from recovery trajectories rather than discarding failed attempts, with validation on both simulated and real-world robotic tasks.

AINeutralarXiv – CS AI · 4d ago6/10

🧠

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Researchers introduced PlanBench-XL, a benchmark testing how LLM agents plan and execute tasks across 1,665 tools in realistic scenarios. The study reveals significant vulnerabilities in current AI systems, with performance dropping from 51.9% to 11.36% accuracy when tools fail or behave unexpectedly, exposing critical gaps in adaptive planning capabilities.

🧠 GPT-5

AINeutralarXiv – CS AI · Jun 96/10

🧠

ReCoVLA: VLM-Guided Reward Compilation for Failure Recovery in Vision-Language-Action Policies

ReCoVLA introduces a framework that enhances vision-language-action (VLA) policies by using external vision-language models to identify failures and guide residual policy training for recovery. The approach freezes pretrained VLA policies and compiles structured rewards for correction, achieving 66.7% success in simulation and 61.7% in zero-shot real-world deployment compared to 36.7% for baseline methods.

AINeutralarXiv – CS AI · May 126/10

🧠

Route by State, Recover from Trace: STAR with Failure-Aware Markov Routing for Multi-Agent Spatiotemporal Reasoning

Researchers present STAR, a failure-aware routing framework for multi-agent AI systems that handles spatiotemporal reasoning tasks by intelligently routing between specialist agents based on typed failure states rather than generic success/failure signals. The system learns recovery transitions from execution traces and demonstrates improved performance across multiple benchmarks, suggesting that explicit failure-aware routing is more effective than implicit language-based decision-making in complex reasoning tasks.