AINeutralarXiv – CS AI · 7h ago6/10
🧠
Temporally-Aligned Evaluation for Audio-Driven Talking Head Generation
Researchers propose a new evaluation framework for audio-driven talking head generation that uses sequence-level alignment instead of frame-by-frame comparison. The method accounts for natural timing variations in speech-driven facial motion, providing more accurate assessment of generative model quality across different datasets and speaking styles.