y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#privacy-preservation News & Analysis

16 articles tagged with #privacy-preservation. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

16 articles
AIBullisharXiv – CS AI · 1d ago7/10
🧠

Vision Language Model Helps Private Information De-Identification in Vision Data

Researchers introduce VisShield, a privacy-enhancing framework for Vision Language Models that uses specialized instruction-tuning and the OPTIC dataset to detect and mask sensitive information like Protected Health Information in images. The approach combines OCR-focused prompts with tailored training to enable VLMs to recognize privacy-sensitive text and output precise bounding boxes for effective de-identification.

AINeutralarXiv – CS AI · May 287/10
🧠

RULER: Representation-Level Verification of Machine Unlearning

Researchers introduce RULER, a verification framework that detects machine unlearning failures at the representation level rather than just output metrics. The study reveals that popular unlearning methods pass traditional evaluation tests yet still retain encoded information about forgotten data in their internal representations, highlighting a critical gap in current verification protocols.

AIBullisharXiv – CS AI · May 97/10
🧠

FIT to Forget: Robust Continual Unlearning for Large Language Models

Researchers introduce FIT, a continual unlearning framework enabling large language models to efficiently forget privacy-sensitive, copyrighted, and harmful content across sequential deletion requests. The method addresses critical limitations of existing single-shot unlearning approaches by preventing catastrophic forgetting while maintaining model utility, demonstrated across models up to 14B parameters.

AIBullisharXiv – CS AI · Apr 147/10
🧠

Private Seeds, Public LLMs: Realistic and Privacy-Preserving Synthetic Data Generation

Researchers propose RPSG, a novel method for generating synthetic data from private text using large language models while maintaining differential privacy protections. The approach uses private seeds and formal privacy mechanisms during candidate selection, achieving high fidelity synthetic data with stronger privacy guarantees than existing methods.

AIBullisharXiv – CS AI · Mar 167/10
🧠

Learnability and Privacy Vulnerability are Entangled in a Few Critical Weights

Researchers discovered that privacy vulnerabilities in neural networks exist in only a small fraction of weights, but these same weights are critical for model performance. They developed a new approach that preserves privacy by rewinding and fine-tuning only these critical weights instead of retraining entire networks, maintaining utility while defending against membership inference attacks.

AIBullisharXiv – CS AI · 1d ago6/10
🧠

A Comparative Study of Student Perspectives on Technical Writing Feedback Quality: Evaluating LLMs, SLMs, and Humans in Computer Science Topics

A research study compares feedback quality from locally-hosted small language models (SLMs), commercial LLMs like GPT-4, and human instructors across computer science courses. The findings show that quantized Llama-3.1 matched commercial LLM performance while offering privacy and cost advantages, though human feedback remained superior for specialized writing tasks.

🧠 GPT-4🧠 Llama
AINeutralarXiv – CS AI · 2d ago6/10
🧠

Towards Efficient and Exact Forgetting Services in Pre-Trained-Model-based Continual Learning

Researchers propose Analytic Continual Unlearning (ACU), a gradient-free method enabling efficient removal of specific knowledge from pre-trained models during continuous learning phases while preserving privacy. The approach uses closed-form solutions to handle sequential forgetting requests, addressing gaps in existing unlearning techniques that struggle with privacy violations and adversarial request patterns.

AINeutralarXiv – CS AI · 6d ago6/10
🧠

ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models

Researchers introduce ZeroUnlearn, a novel machine unlearning framework that efficiently removes sensitive information from large language models through knowledge re-mapping and representational orthogonality, rather than expensive retraining. The method preserves overall model utility while selectively unlearning harmful data in few-shot settings, addressing critical privacy and safety concerns in LLMs.

AINeutralarXiv – CS AI · May 276/10
🧠

Shadow Unlearning: A Neuro-Semantic Approach to Fidelity-Preserving Faceless Forgetting in LLMs

Researchers introduce Shadow Unlearning, a privacy-preserving machine unlearning method that removes training data influence from LLMs without exposing sensitive information to attacks. The Neuro-Semantic Projector Unlearning (NSPU) framework achieves this while maintaining model performance and is 10x more computationally efficient than existing approaches.

AINeutralarXiv – CS AI · May 126/10
🧠

diffGHOST: Diffusion based Generative Hedged Oblivious Synthetic Trajectories

diffGHOST is a new conditional diffusion model that synthesizes mobility trajectories while preserving privacy through latent space segmentation. The approach addresses a critical gap in existing generative models that lack formal privacy guarantees despite handling sensitive personal movement data.

AINeutralarXiv – CS AI · May 116/10
🧠

INO-SGD: Addressing Utility Imbalance under Individualized Differential Privacy

Researchers propose INO-SGD, a novel algorithm addressing the utility imbalance problem in individualized differential privacy (IDP) machine learning systems. The algorithm strategically down-weights sensitive data batches to prevent underrepresentation of privacy-protected subsets, improving model performance for high-privacy users while maintaining differential privacy guarantees.

AINeutralarXiv – CS AI · May 96/10
🧠

ICU-Bench:Benchmarking Continual Unlearning in Multimodal Large Language Models

Researchers introduce ICU-Bench, a new benchmark for testing machine unlearning in multimodal AI models, addressing privacy concerns from large-scale training datasets. The benchmark reveals that current unlearning methods struggle with continuous privacy deletion requests, highlighting a critical gap between theoretical approaches and real-world deployment needs.

AINeutralarXiv – CS AI · Apr 206/10
🧠

Harmonizing Multi-Objective LLM Unlearning via Unified Domain Representation and Bidirectional Logit Distillation

Researchers propose a multi-objective unlearning framework for Large Language Models that simultaneously removes hazardous information, preserves general utility, avoids over-refusal, and resists adversarial attacks. The method uses unified domain representation and bidirectional logit distillation to harmonize competing optimization goals, achieving state-of-the-art performance across diverse unlearning requirements.

AINeutralarXiv – CS AI · Apr 106/10
🧠

AdaProb: Efficient Machine Unlearning via Adaptive Probability

Researchers propose AdaProb, a machine unlearning method that enables trained AI models to efficiently forget specific data while preserving privacy and complying with regulations like GDPR. The approach uses adaptive probability distributions and demonstrates 20% improvement in forgetting effectiveness with 50% less computational overhead compared to existing methods.

AIBullisharXiv – CS AI · Mar 176/10
🧠

Computation and Communication Efficient Federated Unlearning via On-server Gradient Conflict Mitigation and Expression

Researchers propose FOUL (Federated On-server Unlearning), a new framework for efficiently removing specific participants' data from federated learning models without accessing client data. The approach reduces computational and communication costs while maintaining privacy compliance through a two-stage process that performs unlearning operations on the server side.

AIBullisharXiv – CS AI · Mar 26/1017
🧠

Controllable Reasoning Models Are Private Thinkers

Researchers developed a method to train AI reasoning models to follow privacy instructions in their internal reasoning traces, not just final answers. The approach uses separate LoRA adapters and achieves up to 51.9% improvement on privacy benchmarks, though with some trade-offs in task performance.