A-MBER: Affective Memory Benchmark for Emotion Recognition
Researchers introduce A-MBER, a benchmark dataset designed to evaluate AI assistants' ability to recognize emotions based on long-term interaction history rather than immediate context. The benchmark tests whether models can retrieve relevant past interactions, infer current emotional states, and provide grounded explanationsβrevealing that memory's value lies in selective, context-aware interpretation rather than simple historical volume.