AIBullisharXiv โ CS AI ยท 17h ago6/10
๐ง
Cut to the Chase: Training-free Multimodal Summarization via Chain-of-Events
Researchers introduce CoE, a training-free multimodal summarization framework that uses a Chain-of-Events approach with Hierarchical Event Graph to better understand and summarize content across videos, transcripts, and images. The system achieves significant performance improvements over existing methods, showing average gains of +3.04 ROUGE, +9.51 CIDEr, and +1.88 BERTScore across eight datasets.