AIBullisharXiv โ CS AI ยท 5d ago6/104
๐ง
VINCIE: Unlocking In-context Image Editing from Video
Researchers introduce VINCIE, a novel approach that learns in-context image editing directly from videos without requiring specialized models or curated training data. The method uses a block-causal diffusion transformer trained on video sequences and achieves state-of-the-art results on multi-turn image editing benchmarks.