y0news
#vision-language-model1 article
1 articles
AIBullisharXiv โ€“ CS AI ยท 6h ago6
๐Ÿง 

Less is More: Lean yet Powerful Vision-Language Model for Autonomous Driving

Researchers introduce Max-V1, a novel vision-language model framework that treats autonomous driving as a language problem, predicting trajectories from camera input. The model achieved over 30% performance improvement on the nuScenes dataset and demonstrates strong cross-vehicle adaptability.