AINeutralarXiv – CS AI · 3h ago6/10
🧠
Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
Researchers introduce BudgetMem, a runtime memory framework for LLM agents that uses query-aware routing to dynamically allocate computational resources across memory modules at three cost tiers. The system employs reinforcement learning to optimize the performance-cost trade-off, demonstrating improvements over static memory approaches across multiple benchmark datasets.