Linguistically Augmented Audio Speech Data (LinguAS)
Researchers introduce LinguAS, a dataset of 800+ audio samples annotated with linguistic features to improve detection of deepfaked and spoofed speech. Models trained on this linguistically-augmented data significantly outperform existing deepfake detection baselines, addressing a critical gap in audio forensics.