AIBearisharXiv – CS AI · 7h ago7/10
🧠
A Structured Benchmark for Text-Guided Anomaly Detection: When Language Stops Conditioning the Decision
Researchers introduce TGAD, a new benchmark for evaluating text-guided anomaly detection systems, revealing that current multimodal vision-language models do not actually use language instructions to condition their decisions as claimed. Testing shows that removing object nouns causes performance to collapse, and component-level instructions fail to constrain defect detection, suggesting these systems rely primarily on visual features rather than genuine language guidance.