AINeutralarXiv โ CS AI ยท 14h ago6/10
๐ง
Belief-Aware VLM Model for Human-like Reasoning
Researchers propose a belief-aware Vision Language Model framework that enhances human-like reasoning by integrating retrieval-based memory and reinforcement learning. The approach addresses limitations in current VLMs and VLAs by approximating belief states through vector-based memory, demonstrating improved performance on vision-question-answering tasks compared to zero-shot baselines.