AIBullisharXiv – CS AI · 9h ago7/10
🧠
Video Understanding Reward Modeling: A Robust Benchmark and Performant Reward Models
Researchers introduce Video Understanding Reward Bench (VURB), a comprehensive benchmark with 2,100 preference pairs for evaluating video reward models, alongside VUP-35K, a large-scale dataset of 35,000 preference examples. Two new models, VideoDRM and VideoGRM, achieve state-of-the-art performance on video understanding tasks, advancing multimodal AI capabilities beyond text and images.