AINeutralarXiv – CS AI · 7h ago6/10
🧠
BAGEN: Are LLM Agents Budget-Aware?
Researchers introduce BAGEN, a framework for evaluating whether large language model agents properly manage computational budgets during execution. The study reveals that frontier AI models consistently fail to predict remaining costs and continue spending resources on unlikely-to-succeed tasks, though budget-aware training can reduce token waste by 28-64% on failed trajectories.