Eroding Trust in Real Speech: A Large-Scale Study of Human Audio Deepfake Perception
A comprehensive listening study of 1,768 participants reveals that while humans remain similarly accurate at detecting fake audio (71.2%), they have significantly eroded trust in authentic speech, with real sample detection dropping from 72.7% to 64.1% compared to 2021 baselines. Modern commercial and language model-generated deepfakes pose the greatest challenge to human perception, though ML detectors maintain >94.5% accuracy across all conditions.