AIBullisharXiv โ CS AI ยท 7h ago7/10
๐ง
Can VLMs Unlock Semantic Anomaly Detection? A Framework for Structured Reasoning
Researchers introduce SAVANT, a model-agnostic framework that improves Vision Language Models' ability to detect semantic anomalies in autonomous driving scenarios by 18.5% through structured reasoning instead of ad hoc prompting. The team used this approach to label 10,000 real-world images and fine-tuned an open-source 7B model achieving 90.8% recall, demonstrating practical deployment feasibility without proprietary model dependency.