y0news
AnalyticsDigestsSourcesRSSAICrypto
#decoding-technique1 article
1 articles
AINeutralarXiv โ€“ CS AI ยท 1d ago6/10
๐Ÿง 

Revealing Multi-View Hallucination in Large Vision-Language Models

Researchers identify 'multi-view hallucination' as a major problem in large vision-language models (LVLMs), where these AI systems confuse visual information from different viewpoints or instances. They created MVH-Bench benchmark and developed Reference Shift Contrastive Decoding (RSCD) technique, which improved performance by up to 34.6 points without requiring model retraining.