Honest Lying: Understanding Memory Confabulation in Reflexive Agents
Researchers discovered that reflexive AI agents systematically store confident but false interpretations of tasks in their memory, a phenomenon called memory confabulation, causing them to repeat incorrect behaviors even when environments reset. The study introduces a metric to detect this failure mode and proposes programmatic solutions that significantly improve agent performance and reduce reliance on false reflective content.