AIBullisharXiv โ CS AI ยท 5h ago6/10
๐ง
Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs
Researchers propose Persistent Visual Memory (PVM), a lightweight module that addresses visual signal degradation in Large Vision-Language Models by maintaining consistent visual perception during long text generation. Integrated into Qwen3-VL models, PVM demonstrates measurable accuracy improvements with minimal computational overhead, particularly benefiting complex reasoning tasks.