AINeutralarXiv โ CS AI ยท 7h ago4/10
๐ง
Geometry-Guided Camera Motion Understanding in VideoLLMs
Researchers developed a framework to improve video-language models' understanding of camera motion through geometric analysis. The study introduces CameraMotionDataset and CameraMotionVQA benchmark, revealing that current VideoLLMs struggle with camera motion recognition and proposing a lightweight solution using 3D foundation models.