AINeutralarXiv – CS AI · 14h ago6/10
🧠
LoCoT2V-Bench: Benchmarking Long-Form and Complex Text-to-Video Generation
Researchers introduce LoCoT2V-Bench, a new benchmark for evaluating long-form video generation from complex text prompts, along with LoCoT2V-Eval, a multi-dimensional evaluation framework. Testing 17 models reveals that while perceptual quality is strong, fine-grained text alignment and character consistency remain major technical challenges in the field.