AINeutralarXiv – CS AI · 8h ago6/10
🧠
From Knowing to Acting: Benchmarking Self-Awareness Capability of LLM Agents
Researchers introduce KAPRO, a framework for evaluating whether LLM agents can accurately determine when to use external tools versus relying on internal knowledge. The study reveals that open-source models suffer from tool overuse due to pattern matching, while proprietary models show better self-awareness, highlighting a critical gap in current AI agent capabilities.