#audio-visual-synthesis News & Analysis

3 articles tagged with #audio-visual-synthesis. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles

AINeutralarXiv – CS AI · May 286/10

🧠

From Talking to Singing: A New Challenge for Audio-Visual Deepfake Detection

Researchers have developed a new deepfake detection framework called T-AVFD that addresses a critical gap in audio-visual forgery detection by handling singing scenarios, where traditional cross-modal inconsistency methods fail. The study introduces the SHDF dataset and demonstrates improved detection performance across both talking and singing deepfakes through text-guided pattern learning.

AINeutralarXiv – CS AI · May 125/10

🧠

ChladniSonify: A Visual-Acoustic Mapping Method for Chladni Patterns in New Media Art Creation

ChladniSonify presents a real-time system that maps visual Chladni patterns to acoustic frequencies using deep learning and plate theory, achieving 99.33% classification accuracy with sub-50ms latency. The engineering prototype bridges audio-visual art creation by automating the traditionally subjective mapping between vibration patterns and sound, addressing technical barriers in new media art workflows.

AIBullisharXiv – CS AI · Mar 96/10

🧠

TempoSyncDiff: Distilled Temporally-Consistent Diffusion for Low-Latency Audio-Driven Talking Head Generation

Researchers introduce TempoSyncDiff, a new AI framework that uses distilled diffusion models to generate realistic talking head videos from audio with significantly reduced computational latency. The system addresses key challenges in AI-driven video synthesis including temporal instability, identity drift, and audio-visual alignment while enabling deployment on edge computing devices.