AIBullisharXiv – CS AI · 6h ago6/10
🧠
SIRI: Self-Internalizing Reinforcement Learning with Intrinsic Skills for LLM Agent Training
Researchers introduce SIRI, a three-phase reinforcement learning framework that enables LLM agents to autonomously discover, validate, and internalize reusable skills without external skill generators or inference-time skill banks. Testing on ALFWorld and WebShop benchmarks shows meaningful performance improvements over baseline methods while reducing deployment complexity and latency.