AIBullisharXiv – CS AI · 7h ago7/10
🧠
Video Reasoning without Training
Researchers introduce V-Reason, an inference-time optimization method for video reasoning in Large Multimodal Models that eliminates the need for costly reinforcement learning or supervised fine-tuning. By analyzing entropy patterns in model outputs, the method achieves near-RL performance while using 58.6% fewer tokens, offering significant efficiency gains for AI systems.