AIBullisharXiv – CS AI · 9h ago7/10
🧠
MAGNIFIED: RL Fine-tuning of Multimodal Large Language Models for Motion Planning
Researchers propose MAGNIFIED, a reinforcement learning fine-tuning approach for multimodal large language models that optimizes autonomous driving planning by learning from planning-specific rewards rather than token prediction alone. Testing on the Waymo Open Motion Dataset shows substantial improvements including 10.5% reduction in trajectory overlap and 38.9% reduction in off-road violations compared to supervised fine-tuning baselines.