AIBullisharXiv – CS AI · 7h ago6/10
🧠
MSUE: Multi-Modal Soccer Understanding Expert
Researchers developed MSUE, a multi-expert question-answering system that achieved 0.95 accuracy in the 2026 SoccerNet VQA Challenge by combining vision-language models, large language models, and specialized experts. The solution uses an LLM router to dynamically dispatch questions to text, image, and video processing experts, demonstrating advances in multi-modal AI for domain-specific tasks.