Zamba2-VL Technical Report
Zyphra released Zamba2-VL, a suite of vision-language models combining Mamba2 state-space layers with transformer blocks, achieving competitive performance with leading VLMs while delivering 10x faster time-to-first-token speeds. The three released models (1.2B, 2.7B, 7B parameters) represent a significant efficiency breakthrough for edge and on-device deployment.