AINeutralarXiv – CS AI · 6h ago6/10
🧠
SPOT-E: Test-Time Entropy Shaping with Visual Spotlights for Frozen VLMs
Researchers introduce SPOT-E, a test-time method that improves vision-language models' performance on evidence-intensive tasks by using entropy-shaping to identify and highlight critical visual information. The technique works without retraining frozen VLMs and demonstrates consistent improvements across benchmarks while maintaining robustness under visual corruption.