AINeutralGoogle DeepMind Blog · 8h ago6/10
🧠
Introducing Gemma 4 12B: a unified, encoder-free multimodal model
Google introduces Gemma 4 12B, a unified multimodal AI model that combines text and image understanding without separate encoders, advancing efficiency in lightweight language models. The encoder-free architecture represents a technical shift toward more streamlined multimodal AI systems accessible to developers and researchers.