AIBullisharXiv – CS AI · 18h ago7/10
🧠
CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents
Researchers introduce CUA-Gym, a scalable pipeline for generating verified training data for computer-use agents through co-generation of task instructions, environment states, and reward functions. The resulting dataset of 32,112 verified training tuples across 110 environments enables AI agents to achieve 62.1-72.6% performance on benchmarks, significantly advancing verifiable reinforcement learning for autonomous computer interaction.