MVCL-DAF++: Enhancing Multimodal Intent Recognition via Prototype-Aware Contrastive Alignment and Coarse-to-Fine Dynamic Attention Fusion
Researchers introduce MVCL-DAF++, an advanced multimodal intent recognition system that combines prototype-aware contrastive alignment with coarse-to-fine dynamic attention fusion to improve semantic understanding and robustness. The model achieves state-of-the-art performance on benchmark datasets, with notable improvements in rare-class recognition accuracy.