AIBullisharXiv – CS AI · 3h ago7/10
🧠
ReflexGrad: Within-Episode Failure Recovery in LLM Agents via Progress-Gated Dual-Process Routing
ReflexGrad introduces a dual-process architecture enabling LLM agents to recover from failures within a single episode without requiring demonstrations. The system combines fast continuous refinement with slow causal diagnosis, achieving significant performance improvements on benchmark tasks with smaller models matching larger model performance through architectural innovation rather than scale.
🧠 GPT-5