y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#membership-inference News & Analysis

13 articles tagged with #membership-inference. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

13 articles
AINeutralarXiv – CS AI · May 287/10
🧠

RULER: Representation-Level Verification of Machine Unlearning

Researchers introduce RULER, a verification framework that detects machine unlearning failures at the representation level rather than just output metrics. The study reveals that popular unlearning methods pass traditional evaluation tests yet still retain encoded information about forgotten data in their internal representations, highlighting a critical gap in current verification protocols.

AIBearisharXiv – CS AI · May 277/10
🧠

Pretraining Data Exposure in Large Language Models: A Survey of Membership Inference, Data Contamination, and Security Implications

A comprehensive survey examines Pretraining Data Exposure (PDE) in large language models, unifying two previously isolated research areas—membership inference and data contamination—to assess whether specific data appeared in LLM training datasets. The work formalizes exposure levels, reviews attack and defense mechanisms, and highlights privacy and evaluation integrity risks as model sizes and training data scales continue to grow.

AIBearisharXiv – CS AI · May 117/10
🧠

On Privacy Leakage in Tabular Diffusion Models: Influential Factors, Attacker Knowledge, and Metrics

Researchers demonstrate significant privacy vulnerabilities in tabular diffusion models (TDMs), which are increasingly used to generate synthetic data as privacy-preserving alternatives. Through membership inference attacks in both black-box and white-box settings, the study reveals that attackers can successfully breach these systems without perfect knowledge of training data or massive computational resources, while also exposing flaws in commonly-used privacy metrics.

AIBearisharXiv – CS AI · May 77/10
🧠

Beyond Public Access in LLM Pre-Training Data

Researchers using copyrighted O'Reilly Media books conducted membership inference attacks on OpenAI's language models, finding that GPT-4o exhibits patterns suggesting recognition of pay-walled content (AUROC 0.82) while GPT-4o Mini shows minimal recognition (AUROC 0.56). The findings highlight gaps in corporate transparency around AI training data sources and underscore the need for formal licensing frameworks.

🏢 OpenAI🧠 GPT-4
AIBearisharXiv – CS AI · Apr 207/10
🧠

Noise Aggregation Analysis Driven by Small-Noise Injection: Efficient Membership Inference for Diffusion Models

Researchers have developed a novel membership inference attack against diffusion models that uses noise aggregation analysis and small-noise injection to determine whether specific data samples were included in training datasets. The method significantly reduces computational costs while improving accuracy compared to existing approaches, highlighting emerging privacy vulnerabilities in widely-deployed generative AI systems like Stable Diffusion.

🧠 Stable Diffusion
AIBearisharXiv – CS AI · Apr 147/10
🧠

Powerful Training-Free Membership Inference Against Autoregressive Language Models

Researchers have developed EZ-MIA, a training-free membership inference attack that dramatically improves detection of memorized data in fine-tuned language models by analyzing probability shifts at error positions. The method achieves 3.8x higher detection rates than previous approaches on GPT-2 and demonstrates that privacy risks in fine-tuned models are substantially greater than previously understood.

🧠 Llama
AINeutralarXiv – CS AI · Mar 177/10
🧠

Membership Inference for Contrastive Pre-training Models with Text-only PII Queries

Researchers developed UMID, a new text-only auditing framework to detect if personally identifiable information was memorized during training of multimodal AI models like CLIP and CLAP. The method significantly improves efficiency and effectiveness of membership inference attacks while maintaining privacy constraints.

AIBullisharXiv – CS AI · Mar 167/10
🧠

Learnability and Privacy Vulnerability are Entangled in a Few Critical Weights

Researchers discovered that privacy vulnerabilities in neural networks exist in only a small fraction of weights, but these same weights are critical for model performance. They developed a new approach that preserves privacy by rewinding and fine-tuning only these critical weights instead of retraining entire networks, maintaining utility while defending against membership inference attacks.

AIBearisharXiv – CS AI · Mar 97/10
🧠

Window-based Membership Inference Attacks Against Fine-tuned Large Language Models

Researchers developed WBC (Window-Based Comparison), a new membership inference attack method that significantly outperforms existing approaches by analyzing localized patterns in Large Language Models rather than global signals. The technique achieves 2-3 times better detection rates and exposes critical privacy vulnerabilities in fine-tuned LLMs through sliding window analysis and binary voting mechanisms.

AINeutralarXiv – CS AI · Jun 16/10
🧠

idSCD: Identifying Training Datasets through Semantic Correlation Descriptors

Researchers have developed a new method called Semantic Correlation Descriptors (SCDs) to identify whether a specific dataset was used to train a machine learning model by analyzing the spurious correlations embedded in its learned structure. This white-box approach outperforms existing black-box membership inference techniques, achieving up to 60% higher accuracy in detecting dataset membership across natural language and medical text classification tasks.

AINeutralarXiv – CS AI · May 276/10
🧠

Assessing Per-Sample Membership Inference Vulnerability without Retraining

Researchers propose a novel method to assess individual training data vulnerability to membership inference attacks without requiring shadow models. The approach combines theoretical analysis in linear settings with a practical surrogate score for deep networks, using only geometry and loss information from a single trained model.

AIBullisharXiv – CS AI · May 96/10
🧠

PACZero: PAC-Private Fine-Tuning of Language Models via Sign Quantization

PACZero introduces a novel PAC-private fine-tuning mechanism for large language models that achieves usable utility while maintaining zero mutual information leakage, surpassing traditional differential privacy approaches. Using sign quantization of zeroth-order gradients, the method exploits moments of unanimous agreement across candidate subsets to eliminate privacy costs, demonstrating competitive performance on benchmark tasks like SST-2 and SQuAD.

AINeutralarXiv – CS AI · May 96/10
🧠

SMI: Statistical Membership Inference for Reliable Unlearned Model Auditing

Researchers propose Statistical Membership Inference (SMI), a new training-free auditing method that challenges the reliability of existing Membership Inference Attacks (MIAs) for verifying machine unlearning. The framework addresses a fundamental flaw in current auditing approaches by reformulating the problem as estimating non-member proportions in feature distributions, eliminating the need for computationally expensive shadow model training.