AINeutralarXiv – CS AI · 7h ago6/10
🧠
MLLM-Microscope: Unlocking Hidden Structure Within Multimodal Large Language Models
Researchers introduce MLLM-Microscope, a novel analytical system that examines the internal representations of multimodal large language models (MLLMs) by measuring linearity, intrinsic dimension, and anisotropy across transformer layers. Testing on LLaVA-NeXT and OmniFusion reveals that modality fusion approaches significantly influence how embeddings behave within the model architecture, with OmniFusion demonstrating more consistent dimensional properties across layers.