AIBullisharXiv β CS AI Β· 14h ago7/10
π§
UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents
UniToolCall introduces a standardized framework unifying tool-use representation, training data, and evaluation for LLM agents. The framework combines 22k+ tools and 390k+ training instances with a unified evaluation methodology, enabling fine-tuned models like Qwen3-8B to achieve 93% precisionβsurpassing GPT, Gemini, and Claude in specific benchmarks.
π§ Claudeπ§ Gemini