AINeutralarXiv – CS AI · 7h ago6/10
🧠
LASA: A Weak Supervision Method for Open-Vocabulary Scene Sketch Semantic Segmentation
Researchers introduce LASA, a weak supervision method for open-vocabulary sketch semantic segmentation that aggregates multi-layer Vision Transformer attention maps to capture complementary spatial cues. The approach achieves significant improvements over baselines without requiring pixel-level annotations, advancing computer vision capabilities for sparse line drawing interpretation.