#data-protection News & Analysis

61 articles tagged with #data-protection. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

61 articles

AINeutralarXiv – CS AI · Jun 197/10

🧠

TRAP: Benchmark for Task-completion and Resistance to Active Privacy-extraction

Researchers introduce TRAP, a benchmark evaluating AI agents' ability to complete document-intensive tasks using private information while resisting extraction attempts. Testing 22 models reveals all exhibit privacy leakage, with instruction-following ability correlating to higher exposure risk, though a proposed structural isolation method using hash keys shows promise in mitigating the fundamental trade-off between task accuracy and privacy protection.

AIBearishCrypto Briefing · Jun 107/10

🧠

CrowdStrike warns of rising cyberattacks from China targeting AI

CrowdStrike has issued a warning about escalating cyberattacks originating from China that specifically target AI infrastructure and assets. The threat underscores the critical vulnerability of AI systems to state-sponsored cyber operations and highlights the urgent need for robust cybersecurity defenses across the AI industry.

AIBearisharXiv – CS AI · Jun 107/10

🧠

IDP-Bench: Benchmarking ability of LLMs to protect personal information in interdependent privacy contexts

Researchers introduced IDP-Bench, the first benchmark evaluating how well large language models protect interdependent privacy—where one person's data can be revealed by others without consent. Testing eight open-source LLMs revealed strong performance in recognizing data co-ownership but significant weaknesses in understanding contextual integrity parameters and judging sharing appropriateness, with smaller models showing particular vulnerability to prompt sensitivity.

AIBearishCrypto Briefing · Jun 37/10

🧠

Meta’s AI chatbot breach exposes security flaws, impacts high-profile accounts

Meta's AI chatbot experienced a significant security breach that exposed high-profile Instagram accounts, revealing critical vulnerabilities in authentication mechanisms for large-scale AI systems. The incident underscores the urgent need for more robust security protocols as AI deployments expand across consumer-facing platforms.

AINeutralarXiv – CS AI · May 287/10

🧠

RULER: Representation-Level Verification of Machine Unlearning

Researchers introduce RULER, a verification framework that detects machine unlearning failures at the representation level rather than just output metrics. The study reveals that popular unlearning methods pass traditional evaluation tests yet still retain encoded information about forgotten data in their internal representations, highlighting a critical gap in current verification protocols.

AIBearisharXiv – CS AI · May 277/10

🧠

Pretraining Data Exposure in Large Language Models: A Survey of Membership Inference, Data Contamination, and Security Implications

A comprehensive survey examines Pretraining Data Exposure (PDE) in large language models, unifying two previously isolated research areas—membership inference and data contamination—to assess whether specific data appeared in LLM training datasets. The work formalizes exposure levels, reviews attack and defense mechanisms, and highlights privacy and evaluation integrity risks as model sizes and training data scales continue to grow.

AI × CryptoBullishBankless · May 237/10

🤖

Venice AI is Applied Cypherpunk

Venice AI applies cypherpunk principles to artificial intelligence inference, building privacy protections into AI systems rather than treating it as an afterthought. The project draws philosophical parallels to the cypherpunk movement's core belief that privacy must be architecturally embedded, not granted by benevolent actors.

GeneralBearishFortune Crypto · May 117/10

📰

‘It’s here’: Google issues dire warning after catching hackers using AI to break into computers

Google has detected hackers actively using AI to enhance cyberattacks and breach computer systems, confirming long-standing security concerns about weaponized artificial intelligence. The discovery signals that the predicted convergence of AI capabilities with malicious intent has moved from theoretical risk to operational reality, potentially expanding the threat landscape for individuals and organizations worldwide.

AIBearishDecrypt – AI · May 77/10

🧠

Chrome Deletes Its Own Privacy Promise for Sneaky On-Device AI

Google Chrome has quietly installed a 4GB on-device AI model while simultaneously removing privacy disclosures that previously promised to keep user data off Google's servers. This move raises significant concerns about transparency and the erosion of privacy protections in mainstream browsers.

AIBullisharXiv – CS AI · Apr 147/10

🧠

Private Seeds, Public LLMs: Realistic and Privacy-Preserving Synthetic Data Generation

Researchers propose RPSG, a novel method for generating synthetic data from private text using large language models while maintaining differential privacy protections. The approach uses private seeds and formal privacy mechanisms during candidate selection, achieving high fidelity synthetic data with stronger privacy guarantees than existing methods.

AI × CryptoBullisharXiv – CS AI · Apr 147/10

🤖

Hardening x402: PII-Safe Agentic Payments via Pre-Execution Metadata Filtering

Researchers have developed presidio-hardened-x402, an open-source middleware that filters personally identifiable information from AI agent payment requests using the x402 protocol before data reaches payment servers or centralized APIs. The tool achieves 97.2% precision in detecting PII with minimal latency, addressing a critical privacy gap where payment metadata is currently transmitted without data processing agreements.

AIBullisharXiv – CS AI · Apr 67/10

🧠

Opal: Private Memory for Personal AI

Researchers present Opal, a private memory system for personal AI that uses trusted hardware enclaves and oblivious RAM to protect user data privacy while maintaining query accuracy. The system achieves 13 percentage point improvement in retrieval accuracy over semantic search and 29x higher throughput with 15x lower costs than secure baselines.

AIBearishArs Technica – AI · Mar 57/10

🧠

Workers report watching Ray-Ban Meta-shot footage of people using the bathroom

Meta faces accusations of concealing privacy facts about Ray-Ban smart glasses after workers reported viewing footage of people in bathrooms. The allegations raise serious concerns about user privacy and data handling practices for wearable AI devices.

AIBearishTechCrunch – AI · Mar 57/10

🧠

Meta sued over AI smart glasses’ privacy concerns, after workers reviewed nudity, sex, and other footage

Meta faces a lawsuit over privacy concerns regarding its AI smart glasses, with allegations that the company's marketing promised user control while subcontractors were actually reviewing customer footage including sensitive content. The legal action centers on discrepancies between Meta's privacy promises and actual data handling practices.

AINeutralarXiv – CS AI · Mar 47/102

🧠

WARP: Weight Teleportation for Attack-Resilient Unlearning Protocols

Researchers introduce WARP, a new defense mechanism for machine unlearning protocols that protects against privacy attacks where adversaries can exploit differences between pre- and post-unlearning AI models. The technique reduces attack success rates by up to 92% while maintaining model accuracy on retained data.

AIBearisharXiv – CS AI · Mar 37/104

🧠

AudAgent: Automated Auditing of Privacy Policy Compliance in AI Agents

Researchers have developed AudAgent, an automated tool that monitors AI agents in real-time to ensure they comply with their stated privacy policies. The tool revealed that many AI agents powered by major providers like Claude, Gemini, and DeepSeek fail to protect highly sensitive data like SSNs and violate their own privacy policies.

$LINK

AIBearisharXiv – CS AI · Mar 37/103

🧠

Multi-PA: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models

Researchers introduce Multi-PA, a comprehensive benchmark for evaluating privacy risks in Large Vision-Language Models (LVLMs), covering 26 personal privacy categories, 15 trade secrets, and 18 state secrets across 31,962 samples. Testing 21 open-source and 2 closed-source LVLMs revealed significant privacy vulnerabilities, with models generally posing high risks of facilitating privacy breaches across different privacy categories.

AIBullisharXiv – CS AI · Mar 37/102

🧠

Model Collapse Is Not a Bug but a Feature in Machine Unlearning for LLMs

Researchers propose Partial Model Collapse (PMC), a novel machine unlearning method for large language models that removes private information without directly training on sensitive data. The approach leverages model collapse - where models degrade when trained on their own outputs - as a feature to deliberately forget targeted information while preserving general utility.

AIBearisharXiv – CS AI · Feb 277/107

🧠

Large-scale online deanonymization with LLMs

Researchers demonstrate that large language models can successfully deanonymize pseudonymous users across online platforms at scale, achieving up to 68% recall at 90% precision. The study shows LLMs can match users between platforms like Hacker News and LinkedIn, or across Reddit communities, using only unstructured text data.

$NEAR

AINeutralOpenAI News · Nov 127/106

🧠

Fighting the New York Times’ invasion of user privacy

OpenAI is resisting the New York Times' request for access to 20 million private ChatGPT conversations, while simultaneously implementing enhanced security and privacy protections for user data. This legal dispute highlights growing tensions over data privacy and corporate access to AI conversation logs.

AIBullishGoogle DeepMind Blog · Oct 237/104

🧠

VaultGemma: The world's most capable differentially private LLM

VaultGemma represents a breakthrough as the most capable large language model trained from scratch using differential privacy techniques. This development advances privacy-preserving AI by demonstrating that sophisticated models can be built while maintaining strong data protection guarantees.

AINeutralOpenAI News · Jun 57/106

🧠

How we’re responding to The New York Times’ data demands in order to protect user privacy

OpenAI is challenging a court order from The New York Times that would require indefinite retention of ChatGPT and API user data. The company is fighting the demands to protect user privacy while addressing legal requirements and maintaining data protection commitments.

AI × CryptoBullishHugging Face Blog · Aug 27/106

🤖

Towards Encrypted Large Language Models with FHE

The article discusses the development of encrypted large language models using Fully Homomorphic Encryption (FHE) technology. This approach would allow AI models to process data while keeping it encrypted, potentially addressing privacy concerns in AI applications.

CryptoBearishCrypto Briefing · Jun 256/10

⛓️

Bithumb fined 210M won for user data transfer violations

South Korean cryptocurrency exchange Bithumb has been fined 210 million won for violating user data transfer regulations, underscoring intensifying regulatory oversight of crypto platforms. The penalty reflects broader compliance pressures facing exchanges globally as regulators strengthen data protection requirements.

AINeutralarXiv – CS AI · Jun 236/10

🧠

$\pi$-RAG: Oblivious Retrieval via Semantic Quantization and Transcendental Addressing for Large Language Models

Researchers introduce π-RAG, a novel retrieval architecture that protects sensitive data in Large Language Models by using the digits of pi as an oblivious indirection layer, eliminating direct exposure of vector embeddings to inversion attacks. The system combines semantic quantization with cryptographic salting to enable privacy-preserving retrieval for compliance-heavy sectors like finance and healthcare.

Page 1 of 3Next →