AIBullishMicrosoft Research Blog Β· 4h ago1
π§
Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model
Microsoft Research announces Phi-4-reasoning-vision-15B, a 15 billion parameter open-weight multimodal reasoning model. The model is designed for vision-language tasks including image captioning and is available through Microsoft Foundry, HuggingFace, and GitHub.