y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#fine-grained-perception News & Analysis

2 articles tagged with #fine-grained-perception. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles
AIBearisharXiv – CS AI · 4d ago7/10
🧠

VisualNeedle: Benchmarking Active Visual Search in Information-Dense Scenes

Researchers introduce VisualNeedle, a benchmark that exposes limitations in multimodal large language models' ability to perform genuine fine-grained visual search in information-dense scenes. Despite frontier MLLMs reporting over 90% accuracy on existing benchmarks, VisualNeedle reveals that these models struggle significantly when critical evidence is spatially constrained to minute regions, with the best model achieving only 56% accuracy versus 63% human performance.

AIBullisharXiv – CS AI · Apr 157/10
🧠

CropVLM: Learning to Zoom for Fine-Grained Vision-Language Perception

Researchers introduce CropVLM, a reinforcement learning-based method that enables Vision-Language Models to dynamically focus on relevant image regions for improved fine-grained understanding tasks. The approach works with existing VLMs without modification and demonstrates significant performance gains on text recognition and document analysis without requiring human-labeled training data.