AINeutralarXiv โ CS AI ยท 4h ago4/10
๐ง
Moondream Segmentation: From Words to Masks
Researchers present Moondream Segmentation, an AI vision-language model that can segment specific objects in images based on text descriptions. The model achieves strong performance with 80.2% cIoU on RefCOCO validation and uses reinforcement learning to improve mask quality through iterative refinement.
$MATIC