y0news
#long-horizon-agents1 article
1 articles
AIBullisharXiv โ€“ CS AI ยท 4h ago2
๐Ÿง 

MemPO: Self-Memory Policy Optimization for Long-Horizon Agents

Researchers propose MemPO (Self-Memory Policy Optimization), a new algorithm that enables AI agents to autonomously manage their memory during long-horizon tasks. The method achieves significant performance improvements with 25.98% F1 score gains over base models while reducing token usage by 67.58%.