AIBullisharXiv โ CS AI ยท 5d ago6/103
๐ง
A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation
Researchers introduced InterSyn, a 1.8M sample dataset designed to improve Large Multimodal Models' ability to generate interleaved image-text content. The dataset includes a new evaluation framework called SynJudge that measures four key performance metrics, with experiments showing significant improvements even with smaller 25K-50K sample subsets.