y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#data-protection News & Analysis

42 articles tagged with #data-protection. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

42 articles
AINeutralarXiv – CS AI · 3d ago7/10
🧠

RULER: Representation-Level Verification of Machine Unlearning

Researchers introduce RULER, a verification framework that detects machine unlearning failures at the representation level rather than just output metrics. The study reveals that popular unlearning methods pass traditional evaluation tests yet still retain encoded information about forgotten data in their internal representations, highlighting a critical gap in current verification protocols.

AIBearisharXiv – CS AI · 4d ago7/10
🧠

Pretraining Data Exposure in Large Language Models: A Survey of Membership Inference, Data Contamination, and Security Implications

A comprehensive survey examines Pretraining Data Exposure (PDE) in large language models, unifying two previously isolated research areas—membership inference and data contamination—to assess whether specific data appeared in LLM training datasets. The work formalizes exposure levels, reviews attack and defense mechanisms, and highlights privacy and evaluation integrity risks as model sizes and training data scales continue to grow.

AI × CryptoBullishBankless · May 237/10
🤖

Venice AI is Applied Cypherpunk

Venice AI applies cypherpunk principles to artificial intelligence inference, building privacy protections into AI systems rather than treating it as an afterthought. The project draws philosophical parallels to the cypherpunk movement's core belief that privacy must be architecturally embedded, not granted by benevolent actors.

Venice AI is Applied Cypherpunk
GeneralBearishFortune Crypto · May 117/10
📰

‘It’s here’: Google issues dire warning after catching hackers using AI to break into computers

Google has detected hackers actively using AI to enhance cyberattacks and breach computer systems, confirming long-standing security concerns about weaponized artificial intelligence. The discovery signals that the predicted convergence of AI capabilities with malicious intent has moved from theoretical risk to operational reality, potentially expanding the threat landscape for individuals and organizations worldwide.

‘It’s here’: Google issues dire warning after catching hackers using AI to break into computers
AIBearishDecrypt – AI · May 77/10
🧠

Chrome Deletes Its Own Privacy Promise for Sneaky On-Device AI

Google Chrome has quietly installed a 4GB on-device AI model while simultaneously removing privacy disclosures that previously promised to keep user data off Google's servers. This move raises significant concerns about transparency and the erosion of privacy protections in mainstream browsers.

Chrome Deletes Its Own Privacy Promise for Sneaky On-Device AI
AIBullisharXiv – CS AI · Apr 147/10
🧠

Private Seeds, Public LLMs: Realistic and Privacy-Preserving Synthetic Data Generation

Researchers propose RPSG, a novel method for generating synthetic data from private text using large language models while maintaining differential privacy protections. The approach uses private seeds and formal privacy mechanisms during candidate selection, achieving high fidelity synthetic data with stronger privacy guarantees than existing methods.

AI × CryptoBullisharXiv – CS AI · Apr 147/10
🤖

Hardening x402: PII-Safe Agentic Payments via Pre-Execution Metadata Filtering

Researchers have developed presidio-hardened-x402, an open-source middleware that filters personally identifiable information from AI agent payment requests using the x402 protocol before data reaches payment servers or centralized APIs. The tool achieves 97.2% precision in detecting PII with minimal latency, addressing a critical privacy gap where payment metadata is currently transmitted without data processing agreements.

AIBullisharXiv – CS AI · Apr 67/10
🧠

Opal: Private Memory for Personal AI

Researchers present Opal, a private memory system for personal AI that uses trusted hardware enclaves and oblivious RAM to protect user data privacy while maintaining query accuracy. The system achieves 13 percentage point improvement in retrieval accuracy over semantic search and 29x higher throughput with 15x lower costs than secure baselines.

AINeutralarXiv – CS AI · Mar 47/102
🧠

WARP: Weight Teleportation for Attack-Resilient Unlearning Protocols

Researchers introduce WARP, a new defense mechanism for machine unlearning protocols that protects against privacy attacks where adversaries can exploit differences between pre- and post-unlearning AI models. The technique reduces attack success rates by up to 92% while maintaining model accuracy on retained data.

AIBullisharXiv – CS AI · Mar 37/102
🧠

Model Collapse Is Not a Bug but a Feature in Machine Unlearning for LLMs

Researchers propose Partial Model Collapse (PMC), a novel machine unlearning method for large language models that removes private information without directly training on sensitive data. The approach leverages model collapse - where models degrade when trained on their own outputs - as a feature to deliberately forget targeted information while preserving general utility.

AIBearisharXiv – CS AI · Mar 37/103
🧠

Multi-PA: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models

Researchers introduce Multi-PA, a comprehensive benchmark for evaluating privacy risks in Large Vision-Language Models (LVLMs), covering 26 personal privacy categories, 15 trade secrets, and 18 state secrets across 31,962 samples. Testing 21 open-source and 2 closed-source LVLMs revealed significant privacy vulnerabilities, with models generally posing high risks of facilitating privacy breaches across different privacy categories.

AIBearisharXiv – CS AI · Mar 37/104
🧠

AudAgent: Automated Auditing of Privacy Policy Compliance in AI Agents

Researchers have developed AudAgent, an automated tool that monitors AI agents in real-time to ensure they comply with their stated privacy policies. The tool revealed that many AI agents powered by major providers like Claude, Gemini, and DeepSeek fail to protect highly sensitive data like SSNs and violate their own privacy policies.

$LINK
AIBearisharXiv – CS AI · Feb 277/107
🧠

Large-scale online deanonymization with LLMs

Researchers demonstrate that large language models can successfully deanonymize pseudonymous users across online platforms at scale, achieving up to 68% recall at 90% precision. The study shows LLMs can match users between platforms like Hacker News and LinkedIn, or across Reddit communities, using only unstructured text data.

$NEAR
AINeutralOpenAI News · Nov 127/106
🧠

Fighting the New York Times’ invasion of user privacy

OpenAI is resisting the New York Times' request for access to 20 million private ChatGPT conversations, while simultaneously implementing enhanced security and privacy protections for user data. This legal dispute highlights growing tensions over data privacy and corporate access to AI conversation logs.

AIBullishGoogle DeepMind Blog · Oct 237/104
🧠

VaultGemma: The world's most capable differentially private LLM

VaultGemma represents a breakthrough as the most capable large language model trained from scratch using differential privacy techniques. This development advances privacy-preserving AI by demonstrating that sophisticated models can be built while maintaining strong data protection guarantees.

AI × CryptoBullishHugging Face Blog · Aug 27/106
🤖

Towards Encrypted Large Language Models with FHE

The article discusses the development of encrypted large language models using Fully Homomorphic Encryption (FHE) technology. This approach would allow AI models to process data while keeping it encrypted, potentially addressing privacy concerns in AI applications.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

Shadow Unlearning: A Neuro-Semantic Approach to Fidelity-Preserving Faceless Forgetting in LLMs

Researchers introduce Shadow Unlearning, a privacy-preserving machine unlearning method that removes training data influence from LLMs without exposing sensitive information to attacks. The Neuro-Semantic Projector Unlearning (NSPU) framework achieves this while maintaining model performance and is 10x more computationally efficient than existing approaches.

AINeutralarXiv – CS AI · May 126/10
🧠

diffGHOST: Diffusion based Generative Hedged Oblivious Synthetic Trajectories

diffGHOST is a new conditional diffusion model that synthesizes mobility trajectories while preserving privacy through latent space segmentation. The approach addresses a critical gap in existing generative models that lack formal privacy guarantees despite handling sensitive personal movement data.

AINeutralarXiv – CS AI · May 116/10
🧠

INO-SGD: Addressing Utility Imbalance under Individualized Differential Privacy

Researchers propose INO-SGD, a novel algorithm addressing the utility imbalance problem in individualized differential privacy (IDP) machine learning systems. The algorithm strategically down-weights sensitive data batches to prevent underrepresentation of privacy-protected subsets, improving model performance for high-privacy users while maintaining differential privacy guarantees.

AIBullisharXiv – CS AI · Apr 206/10
🧠

Privacy-Preserving LLMs Routing

Researchers propose PPRoute, a privacy-preserving framework for LLM routing that uses Secure Multi-Party Computation (MPC) to protect user data while dynamically selecting between model providers. The system achieves 20x speedup over naive MPC implementations through optimized encoder inference, multi-step model training, and an efficient Top-k algorithm, maintaining routing quality without sacrificing privacy.

AINeutralarXiv – CS AI · Apr 206/10
🧠

From Vulnerable Data Subjects to Vulnerabilizing Data Practices: Navigating the Protection Paradox in AI-Based Analyses of Platformized Lives

This academic paper examines how AI and data science practices can paradoxically increase vulnerability of subjects they aim to protect, using a case study of computer vision analysis of children in monetized YouTube content. The authors develop an ethics protocol identifying four critical decision points—dataset design, operationalization, inference, and dissemination—where technical choices create vulnerabilizing factors including exposure, monetization, narrative fixing, and algorithmic optimization.

AIBullishAI News · Apr 156/10
🧠

Commvault launches a ‘Ctrl-Z’ for cloud AI workloads

Commvault has launched AI Protect, a governance solution that provides rollback capabilities for autonomous AI agents operating in cloud environments. The platform addresses critical risks posed by AI systems that can independently delete files, access databases, modify infrastructure, and alter security policies without adequate oversight or recovery mechanisms.

Page 1 of 2Next →