AIBullisharXiv – CS AI · 18h ago7/10
🧠
TAME: A Trustworthy Test-Time Evolution of Agent Memory with Systematic Benchmarking
Researchers introduce TAME, a trust-aware memory evolution framework that addresses the vulnerability of AI agents to safety misalignment during test-time learning. The system uses paired Executor and Evaluator components to selectively reinforce and reuse agent memories, demonstrating 14.6 percentage point accuracy improvements on mathematical benchmarks while maintaining trustworthiness.
🧠 GPT-5