AINeutralarXiv – CS AI · 7h ago6/10
🧠
Learning When to Translate for Multilingual Reasoning
Researchers introduce Luar, a reinforcement learning framework that trains reasoning language models to selectively translate non-English inputs to English only when necessary for reliable reasoning. The approach achieves superior multilingual reasoning performance compared to standard baselines, particularly benefiting low-resource languages while avoiding unnecessary translation overhead.