Ethical and Technical Limits of Deepfake Speech Datasets
Researchers auditing 39 deepfake speech detection datasets found critical flaws undermining fairness claims and generalization metrics. Most datasets lack demographic metadata, and widespread overlap in underlying training sources creates illusions of robustness that may not transfer to real-world scenarios.