AIBullisharXiv โ CS AI ยท 2d ago7/10
๐ง
Are Video Reasoning Models Ready to Go Outside?
Researchers propose ROVA, a new training framework that improves vision-language models' robustness in real-world conditions by up to 24% accuracy gains. The framework addresses performance degradation from weather, occlusion, and camera motion that can cause up to 35% accuracy drops in current models.