y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#data-security News & Analysis

16 articles tagged with #data-security. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

16 articles
AIBearisharXiv – CS AI · 5d ago7/10
🧠

Cross-modal linkage risk in clinical vision-language models

Researchers discovered that vision-language models trained on paired chest X-rays and medical reports can re-link de-identified images to their original reports through embedding similarity, creating a privacy vulnerability. The team demonstrated this risk scales with model specialization and developed a differential privacy technique that reduces re-linkage by 62% while preserving diagnostic utility.

AI × CryptoBullishBlockonomi · May 307/10
🤖

AI Agents Are Leaking Alpha: Here is How Crypto Infrastructure Is Closing the Privacy Gap

Centralized AI inference systems create security vulnerabilities by logging and retaining user prompts containing valuable trading signals and proprietary information. Crypto infrastructure projects like NEAR, Phala, and Nillion are addressing this privacy gap through trusted execution environments (TEEs) and multi-party computation (MPC), enabling encrypted AI inference with minimal performance degradation.

$NEAR
AIBullishOpenAI News · May 187/10
🧠

OpenAI and Dell partner to bring Codex to hybrid and on-premise enterprise environments

OpenAI and Dell have partnered to deploy Codex, OpenAI's AI coding model, in enterprise hybrid and on-premise environments, enabling organizations to implement AI-powered coding agents while maintaining data security and control. This collaboration addresses enterprise demand for deploying advanced AI capabilities within existing infrastructure rather than relying solely on cloud-based solutions.

🏢 OpenAI
AIBearisharXiv – CS AI · May 117/10
🧠

On Privacy Leakage in Tabular Diffusion Models: Influential Factors, Attacker Knowledge, and Metrics

Researchers demonstrate significant privacy vulnerabilities in tabular diffusion models (TDMs), which are increasingly used to generate synthetic data as privacy-preserving alternatives. Through membership inference attacks in both black-box and white-box settings, the study reveals that attackers can successfully breach these systems without perfect knowledge of training data or massive computational resources, while also exposing flaws in commonly-used privacy metrics.

AINeutralarXiv – CS AI · Mar 277/10
🧠

AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective

Researchers propose a unified framework for AI security threats that categorizes attacks based on four directional interactions between data and models. The comprehensive taxonomy addresses vulnerabilities in foundation models through four categories: data-to-data, data-to-model, model-to-data, and model-to-model attacks.

AINeutralarXiv – CS AI · 5d ago6/10
🧠

Catch-Only-One: Non-Transferable Examples for Model-Specific Authorization

Researchers introduce non-transferable examples (NTEs), a novel data encoding technique that restricts unauthorized model access while preserving utility for authorized applications. The method leverages model-specific low-sensitivity subspaces to act as cryptographic-like controls on AI data usage, addressing regulatory demands for purpose limitation without requiring model retraining or deployment control.

GeneralBearishFortune Crypto · May 296/10
📰

California sues 23andMe over alleged ‘lax’ data security that failed to protect nearly 7 million users’ data in 2023 breach

California has sued 23andMe over inadequate data security that allowed hackers to access personal information from nearly 7 million users in a 2023 breach. The company agreed to a $50 million settlement in the resulting class-action lawsuit, highlighting growing regulatory scrutiny of genetic testing companies' cybersecurity practices.

California sues 23andMe over alleged ‘lax’ data security that failed to protect nearly 7 million users’ data in 2023 breach
AINeutralarXiv – CS AI · May 296/10
🧠

Combating Data Laundering in LLM Training

Researchers have developed Synthesis Data Reversion (SDR), a technique to detect unauthorized LLM training data even when that data has been deliberately obfuscated through stylistic transformation. The method works by inferring laundering patterns and generating synthetic queries that mimic the transformed data, effectively countering data laundering practices that previously evaded detection.

🧠 Llama
GeneralBullishGoogle Research Blog · May 276/10
📰

Private analytics via zero-trust aggregation

Zero-trust aggregation enables private analytics by aggregating sensitive data without exposing individual records, combining security protocols with privacy-preserving computation. This approach addresses the growing tension between data utility and user privacy, allowing organizations to extract insights while maintaining cryptographic guarantees against unauthorized access or data breaches.

Private analytics via zero-trust aggregation
AINeutralarXiv – CS AI · May 96/10
🧠

Channel-Level Semantic Perturbations: Unlearnable Examples for Diverse Training Paradigms

Researchers have developed a new technique called Shallow Semantic Camouflage (SSC) to protect personal data from unauthorized use in AI model training. The work addresses a critical gap where existing data protection methods fail under modern pretraining-finetuning paradigms, demonstrating that frozen pretrained weights significantly weaken previous unlearnable example approaches.

AINeutralMIT News – AI · Jan 56/104
🧠

MIT scientists investigate memorization risk in the age of clinical AI

MIT researchers have developed methods to test AI models used in clinical settings to prevent them from inadvertently revealing anonymized patient health data through memorization. This research addresses a critical privacy and security concern as healthcare AI systems become more prevalent.

AIBullishOpenAI News · Jul 186/105
🧠

New compliance and administrative tools for ChatGPT Enterprise

OpenAI has announced new administrative and compliance tools for ChatGPT Enterprise, including API integrations, SCIM provisioning, and GPT controls. These features are designed to help organizations manage compliance programs, enhance data security, and scale user access management more effectively.

AI × CryptoBullishHugging Face Blog · Nov 176/107
🤖

Sentiment Analysis on Encrypted Data with Homomorphic Encryption

The article discusses techniques for performing sentiment analysis on encrypted data using homomorphic encryption. This approach allows analysis of sensitive data while maintaining privacy, potentially enabling new applications in finance and other sectors requiring data confidentiality.

AINeutralGoogle Research Blog · Aug 204/108
🧠

Securing private data at scale with differentially private partition selection

The article discusses differentially private partition selection, a technique for securing private data at scale. This represents an advancement in privacy-preserving algorithms that can protect sensitive information while still allowing for data analysis and processing.

AINeutralHugging Face Blog · Jul 104/107
🧠

Experimenting with Automatic PII Detection on the Hub using Presidio

The article title suggests content about implementing automatic Personally Identifiable Information (PII) detection on a platform hub using Microsoft's Presidio tool. However, the article body appears to be empty or unavailable, preventing detailed analysis of the implementation details or implications.