AIBullisharXiv – CS AI · 6h ago7/10
🧠
SPARC: Separating Perception And Reasoning Circuits for Test-time Scaling of VLMs
Researchers introduce SPARC, a modular framework that decouples visual perception from reasoning in vision-language models to improve test-time scaling efficiency. By separating tasks into explicit visual search and conditional reasoning stages, SPARC achieves significant performance gains on visual reasoning benchmarks while reducing computational token requirements by up to 200×.