AINeutralarXiv – CS AI · 18h ago6/10
🧠
ViMax: Agentic Video Generation
ViMax introduces an agentic multi-agent framework for long-form video generation that maintains narrative coherence and visual consistency across extended scenes. The system uses hierarchical narrative planning, retrieval-augmented generation, and VLM-guided agents to coordinate specialized components that negotiate storytelling decisions while tracking character and environmental states.