Learning When Not to Act: Mitigating Tool Abuse in Agentic Reinforcement Learning
Researchers propose EAPO, a reinforcement learning framework that teaches AI agents to use external tools selectively rather than excessively. The method improves accuracy while reducing redundant tool calls by 18-25% across multiple language models, demonstrating that agents can learn optimal tool-use patterns without compromising reasoning capabilities.