AINeutralarXiv – CS AI · 3h ago6/10
🧠
AsyncTool: Evaluating the Asynchronous Function Calling Capability under Multi-Task Scenarios
Researchers introduce AsyncTool, a benchmark for evaluating how well LLM-based agents handle multiple concurrent tasks with realistic tool response delays. The study reveals that current AI agents struggle significantly with asynchronous multitasking, experiencing substantial performance degradation when tool feedback is delayed, highlighting a critical gap in real-world applicability.