y0news
AnalyticsDigestsSourcesRSSAICrypto
#surge-ai1 article
1 articles
AIBullisharXiv โ€“ CS AI ยท 5d ago7/104
๐Ÿง 

EnterpriseBench Corecraft: Training Generalizable Agents on High-Fidelity RL Environments

Surge AI introduces CoreCraft, the first environment in EnterpriseBench for training AI agents on realistic enterprise workflows. Training GLM 4.6 on this high-fidelity customer support simulation improved task performance from 25% to 37% and showed positive transfer to other benchmarks, demonstrating that quality training environments enable generalizable AI capabilities.