AIBullisharXiv – CS AI · 15h ago7/10
🧠
The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence
MiniMax introduces the M2 series, a Mixture-of-Experts language model with 229.9B total parameters but only 9.8B activated per token, achieving frontier-tier performance on agentic tasks through agent-driven data pipelines and a custom reinforcement learning system called Forge. The M2.7 checkpoint demonstrates early self-evolution capabilities, autonomously debugging and modifying its own training scaffold.