AINeutralarXiv – CS AI · 5h ago6/10
🧠
Do vision-language models search like humans? Reasoning tokens as a reaction-time analog in classic visual-search paradigms
Researchers test whether vision-language models exhibit human-like visual search behaviors using reasoning tokens as a proxy for cognitive effort. The study finds VLMs reproduce some human signatures—like increased effort in conjunction search—but diverge significantly in others, suggesting reasoning tokens offer a novel lens for understanding machine visual cognition.