🧠 AI🟢 BullishImportance 6/10

NavCrafter: Exploring 3D Scenes from a Single Image

arXiv – CS AI|Hongbo Duan, Peiyu Zhuang, Yi Liu, Zhengyang Zhang, Yuxin Zhang, Pengting Luo, Fangming Liu, Xueqian Wang|April 6, 2026 at 04:00 AM

🤖AI Summary

NavCrafter is a new AI framework that creates flexible 3D scenes from a single image by generating novel-view video sequences with controllable camera movement. The system uses video diffusion models and enhanced 3D Gaussian Splatting to achieve superior 3D reconstruction and novel-view synthesis under large viewpoint changes.

Key Takeaways

→NavCrafter introduces a novel framework for exploring 3D scenes from single images, addressing costly direct 3D data acquisition challenges.
→The system leverages video diffusion models with geometry-aware expansion to progressively extend scene coverage.
→Multi-stage camera control enables controllable multi-view synthesis through dual-branch camera injection and attention modulation.
→Enhanced 3D Gaussian Splatting pipeline includes depth-aligned supervision and structural regularization for improved reconstruction.
→Experimental results show state-of-the-art performance in novel-view synthesis under large viewpoint shifts.