AINeutralarXiv – CS AI · 6h ago6/10
🧠
Regimes: An Auditable, Held-Out-Gated Improvement Loop Demonstrated on LongMemEval with ActiveGraph
Researchers introduce Regimes, an auditable autonomous improvement loop built on the ActiveGraph event-sourced runtime that enables transparent, reproducible AI agent optimization. The system diagnoses failures, proposes repairs, and validates them through multiple gates before promotion, demonstrating 5-10% held-out accuracy improvements on long-context reading comprehension tasks.