AIBullisharXiv โ CS AI ยท 1d ago6/10
๐ง
Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization
Researchers propose Dual Guidance Optimization (DGO), a new framework that improves large language model training by combining external experience banks with internal knowledge to better mimic human learning patterns. The approach shows consistent improvements over existing reinforcement learning methods for reasoning tasks.