AIBullisharXiv โ CS AI ยท 5h ago1
๐ง
Adaptive Social Learning via Mode Policy Optimization for Language Agents
Researchers propose an Adaptive Social Learning (ASL) framework with Adaptive Mode Policy Optimization (AMPO) algorithm to improve language agents' reasoning abilities in social interactions. The system dynamically adjusts reasoning depth based on context, achieving 15.6% higher performance than GPT-4o while using 32.8% shorter reasoning chains.