AIBullisharXiv – CS AI · 6h ago7/10
🧠
SDR: Set-Distance Rewards for Radiology Report Generation
Researchers introduce Set-Distance Rewards (SDR), a novel reinforcement learning approach for chest X-ray report generation that treats medical reports as unordered sets rather than causal chains. The method achieves 4-8% improvements over supervised fine-tuning across multiple vision-language models and enables efficient test-time scaling by pruning low-quality candidates mid-generation.
🧠 GPT-4🧠 Gemini