AINeutralarXiv – CS AI · 6h ago5/10
🧠
Skill Availability and Presentation Granularity in Large-Language-Model Agents: A Controlled SkillsBench Study
A controlled study examines how large-language-model agents perform with different skill documentation formats using SkillsBench, finding that skill availability dramatically improves task success (18-36 percentage points) while variations in presentation granularity produce minimal and uncertain effects across models.
🧠 GPT-5