AIBullisharXiv โ CS AI ยท 5d ago7/104
๐ง
EnterpriseBench Corecraft: Training Generalizable Agents on High-Fidelity RL Environments
Surge AI introduces CoreCraft, the first environment in EnterpriseBench for training AI agents on realistic enterprise workflows. Training GLM 4.6 on this high-fidelity customer support simulation improved task performance from 25% to 37% and showed positive transfer to other benchmarks, demonstrating that quality training environments enable generalizable AI capabilities.