AIBullisharXiv – CS AI · May 46/10
🧠
Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs
Researchers propose Persistent Visual Memory (PVM), a lightweight module that addresses visual signal degradation in Large Vision-Language Models by maintaining consistent visual perception during long text generation. Integrated into Qwen3-VL models, PVM demonstrates measurable accuracy improvements with minimal computational overhead, particularly benefiting complex reasoning tasks.