AINeutralarXiv – CS AI · 7h ago6/10
🧠
GeoSAM-3D: Geodesic Prompt Propagation for Open-Vocabulary 3D Scene Segmentation from Monocular Video
GeoSAM-3D introduces a novel approach to 3D scene segmentation from monocular video by combining foundation models with Gaussian Splatting and geodesic propagation, enabling users to segment objects with simple clicks or text prompts without requiring RGB-D cameras or pre-reconstructed meshes.