AIBullisharXiv – CS AI · 18h ago6/10
🧠
OmniMem: Perturbation-aware Memory Compression for Streaming Audio-Visual LLMs
OmniMem is a new memory compression framework for audio-visual large language models that enables efficient long-form video understanding by using modality-aware memory allocation and perturbation-aware token selection. The approach achieves 2-4% accuracy improvements over existing compression methods while reducing memory requirements, with potential applications in real-time video AI systems.