AIBearisharXiv โ CS AI ยท 6h ago1
๐ง
Turning Black Box into White Box: Dataset Distillation Leaks
Researchers discovered that dataset distillation, a technique for compressing large datasets into smaller synthetic ones, has serious privacy vulnerabilities. The study introduces an Information Revelation Attack (IRA) that can extract sensitive information from synthetic datasets, including predicting the distillation algorithm, model architecture, and recovering original training samples.