AINeutralarXiv – CS AI · 8h ago6/10
🧠
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
Researchers introduce Avatar Forcing, a new framework for generating interactive talking head avatars that respond to user inputs like speech and motion in real-time with approximately 500ms latency. The system uses diffusion forcing to enable multimodal interaction and a preference optimization method that learns expressive reactions without additional labeled data, achieving 80% preference over baseline models.