AINeutralarXiv – CS AI · 7h ago6/10
🧠
idSCD: Identifying Training Datasets through Semantic Correlation Descriptors
Researchers have developed a new method called Semantic Correlation Descriptors (SCDs) to identify whether a specific dataset was used to train a machine learning model by analyzing the spurious correlations embedded in its learned structure. This white-box approach outperforms existing black-box membership inference techniques, achieving up to 60% higher accuracy in detecting dataset membership across natural language and medical text classification tasks.