AINeutralarXiv – CS AI · 3h ago6/10
🧠
SSR3D-LLM: Structured Spatial Reasoning via Latent Steps for Fine-Grained Grounding in Unified 3D-LLMs
SSR3D-LLM introduces a structured spatial reasoning approach for 3D object grounding in unified large language models, enabling fine-grained localization of objects in 3D scenes through sequential reasoning steps rather than single-pointer decisions. The method achieves state-of-the-art results across multiple benchmarks while maintaining compatibility with existing 3D-LLM architectures.