AINeutralarXiv โ CS AI ยท 8h ago7/10
๐ง
Sparse Visual Thought Circuits in Vision-Language Models
Research reveals that sparse autoencoder (SAE) features in vision-language models often fail to compose modularly for reasoning tasks. The study finds that combining task-selective feature sets frequently causes output drift and accuracy degradation, challenging assumptions used in AI model steering methods.