AINeutralarXiv – CS AI · 9h ago6/10
🧠
CollabBench: Benchmarking and Unleashing Collaborative Ability of LLMs with Diverse Players via Proactive Engagement
Researchers introduce CollabBench, a benchmark for evaluating LLM-based agents' ability to collaborate with diverse human partners in cooperative game environments. The framework uses simulated player profiles and a hybrid training approach that balances task efficiency with emotional adaptation, achieving 19.5% higher efficiency and 24.4% improved affective performance compared to base models.