AIBullisharXiv โ CS AI ยท 5d ago6/104
๐ง
TTOM: Test-Time Optimization and Memorization for Compositional Video Generation
Researchers introduce TTOM (Test-Time Optimization and Memorization), a training-free framework that improves compositional video generation in Video Foundation Models during inference. The system uses layout-attention optimization and parametric memory to better align text prompts with generated video outputs, showing strong transferability across different scenarios.