AINeutralarXiv – CS AI · 6h ago6/10
🧠
SARA: Unlocking Multilingual Knowledge in Mixture-of-Experts via Semantically Anchored Routing Alignment
Researchers introduce SARA, a framework that improves multilingual performance in Mixture-of-Experts language models by aligning routing patterns between low-resource and high-resource languages. The method uses semantic anchoring and Jensen-Shannon divergence constraints to enable better expert sharing across languages, demonstrating measurable improvements on benchmark tests.