AINeutralarXiv – CS AI · 8h ago6/10
🧠
Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training
Researchers introduce Med-Scout, a reinforcement learning framework that addresses a critical flaw in multimodal large language models (MLLMs) used for medical diagnosis: geometric blindness, or the inability to ground outputs in objective spatial constraints. The system uses unlabeled medical images with three proxy tasks to derive supervision signals, achieving 40% performance improvements on a new Med-Scout-Bench benchmark while generalizing to broader medical understanding tasks.