AINeutralarXiv – CS AI · 4h ago6/10
🧠
Omni-Perception Policy Optimization for Multimodal Emotion Reasoning
Researchers introduce OPPO, a reinforcement learning framework designed to improve how multimodal AI systems (Omni-MLLMs) understand emotion by better integrating visual, acoustic, and textual information. The method addresses critical failures where systems hallucinate cross-modal information and fail to fully utilize available data, achieving state-of-the-art results on emotion recognition benchmarks.