AINeutralarXiv – CS AI · 8h ago6/10
🧠
Co-Fusion4D: Spatio-temporal Collaborative Fusion for Robust 3D Object Detection
Co-Fusion4D is a new framework for 3D object detection in autonomous driving that addresses spatiotemporal inconsistencies in Bird's Eye View (BEV) detectors by using current-frame-centric fusion with historical frame alignment. The approach achieves state-of-the-art performance on the nuScenes benchmark (74.9% mAP, 75.6% NDS) through a Dual Attention Fusion module that enhances temporal stability without test-time augmentation.