🧠 AI⚪ NeutralImportance 6/10

ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing

arXiv – CS AI|Yaopei Zeng, Congchao Wang, Blake JianHang Chen, Lu Lin|March 27, 2026 at 04:00 AM

🤖AI Summary

Researchers introduce ReLope, a new routing method for multimodal large language models that uses KL-regularized LoRA probes and attention mechanisms to improve cost-performance balance. The method addresses the challenge of degraded probe performance when visual inputs are added to text-only LLMs.

Key Takeaways

→Standard probe routing methods that work well for text-only LLMs perform poorly when applied to multimodal LLMs with visual inputs.
→Visual inputs weaken the separability of correctness signals in hidden states, making routing decisions more difficult.
→The Attention Probe aggregates hidden states from preceding layers based on attention scores to recover distributed correctness signals.
→ReLope uses lightweight LoRA adapters with KL regularization to learn routing-aware representations for better model selection.
→Comprehensive experiments show the new methods consistently outperform existing baselines for multimodal LLM routing.