AINeutralarXiv – CS AI · 6h ago6/10
🧠
Advancing the State-of-the-Art in Empirical Privacy Auditing
Researchers propose a new empirical privacy auditing framework for fine-tuned large language models that uses synthetic canaries generated via high-temperature sampling to detect data leakage. The method also introduces a novel audit for synthetic data generated from privacy-sensitive models, revealing how model capacity and training data characteristics affect memorization risks.