AIBullisharXiv – CS AI · Mar 57/10
🧠
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play
Researchers introduce Vision-Zero, a self-improving AI framework that trains vision-language models through competitive games without requiring human-labeled data. The system uses strategic self-play and can work with arbitrary images, achieving state-of-the-art performance on reasoning and visual understanding tasks while reducing training costs.