#data-leakage News & Analysis

15 articles tagged with #data-leakage. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

15 articles

AIBearisharXiv – CS AI · Jun 237/10

🧠

MuPPET: A Benchmark for Contextual Privacy of LLM Assistants in Multi-Party Conversations

Researchers introduced MuPPET, a benchmark testing privacy vulnerabilities in large language model assistants operating in multi-party conversations. The study reveals that LLMs leak significantly more sensitive information in group settings than in one-to-one interactions, with both frontier and smaller open-weight models showing substantial exposure risks that existing privacy defenses cannot adequately address.

AIBearisharXiv – CS AI · Jun 117/10

🧠

Can AI Agents Synthesize Scientific Conclusions?

Researchers introduced SciConBench, a benchmark evaluating AI agents' ability to synthesize scientific conclusions from systematic reviews. Testing eight frontier models and research agents under controlled conditions revealed fundamental limitations: the best-performing agent achieved only 0.337 factual F1 score, with consumer-facing tools like Google AI Overview generating incomplete or contradictory conclusions despite available ground-truth answers.

🏢 Google

AIBearisharXiv – CS AI · Jun 97/10

🧠

VisualLeakBench: Reproducible Action-Boundary Propagation Failures in Vision-Language Agents

Researchers introduce VisualLeakBench, a 500-image benchmark that reveals critical security vulnerabilities in vision-language agents, where sensitive information visible in screenshots and documents is propagated into tool arguments. Testing four production VLM systems shows baseline failure rates of 78.8% for personally identifiable information and 85.5% for unsafe text, with defensive prompts reducing PII propagation but leaving unsafe-text leakage at 52.6%.

AIBearisharXiv – CS AI · Jun 27/10

🧠

PrivacyPeek: Auditing What LLM-Based Agents Acquire, Not Just What They Say

PrivacyPeek introduces a new benchmark for evaluating privacy vulnerabilities in LLM-based agents, revealing that autonomous AI systems routinely acquire sensitive information beyond what tasks require. The research demonstrates that existing privacy audits miss critical acquisition-stage leakage, where data enters the agent's context, and that current prompt-level defenses are largely ineffective.

AIBearisharXiv – CS AI · May 127/10

🧠

Political Plasticity: An Analysis of Ideological Adaptability in Large Language Models

Researchers developed a testing framework to study "political plasticity"—how Large Language Models adapt their ideological responses based on user context. The study found that newer, larger LLMs reliably shift responses along economic and personal freedom axes when prompted with few-shot examples, while older models show limited adaptability, raising concerns about potential data leakage and model reliability.

AINeutralarXiv – CS AI · May 97/10

🧠

A Review of Large Language Models for Stock Price Forecasting from a Hedge-Fund Perspective

A comprehensive review examines how large language models are being applied to stock price forecasting in quantitative finance, with particular emphasis on practical challenges often overlooked in academic literature. The analysis, framed from a hedge-fund perspective, addresses critical implementation issues including sentiment analysis fragility, data leakage risks, and market friction constraints that affect real-world trading performance.

AIBearisharXiv – CS AI · May 97/10

🧠

LeakDojo: Decoding the Leakage Threats of RAG Systems

LeakDojo is a new research framework that systematically evaluates security vulnerabilities in Retrieval-Augmented Generation (RAG) systems, revealing that stronger LLM instruction-following capabilities correlate with higher data leakage risks. The study benchmarks six attack methods across multiple LLMs and datasets, providing critical insights into how RAG databases can be exploited and suggesting that improvements in RAG faithfulness may paradoxically increase security vulnerabilities.

AIBearishDecrypt · May 77/10

🧠

Your AI Chatbot May Be Leaking Your Chats to Meta, TikTok and Google

A new study reveals that popular AI chatbots including ChatGPT, Claude, Grok, and Perplexity are sharing user data with third-party ad trackers like Meta, TikTok, and Google, often without explicit user consent and even when users reject cookie tracking. This finding raises significant privacy and regulatory concerns for millions of users relying on these platforms.

🏢 Perplexity🧠 ChatGPT🧠 Claude

AIBearisharXiv – CS AI · Mar 267/10

🧠

Uncovering Memorization in Timeseries Imputation models: LBRM Membership Inference and its link to attribute Leakage

Researchers have identified critical privacy vulnerabilities in deep learning models used for time series imputation, demonstrating that these models can leak sensitive training data through membership and attribute inference attacks. The study introduces a two-stage attack framework that successfully retrieves significant portions of training data even from models designed to be robust against overfitting-based attacks.

AIBearisharXiv – CS AI · Mar 117/10

🧠

Security Considerations for Multi-agent Systems

A comprehensive study reveals that multi-agent AI systems (MAS) face distinct security vulnerabilities that existing frameworks inadequately address. The research evaluated 16 AI security frameworks against 193 identified threats across 9 categories, finding that no framework achieves majority coverage in any single category, with non-determinism and data leakage being the most under-addressed areas.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Leakage-Aware Benchmarking of LLM Forecasting: Real-Time Nowcasts as the Decision-Time Input for Macro Factor Ranking

Researchers benchmark a retrieval-augmented LLM system for equity factor ranking using strictly decision-time information, avoiding data leakage common in forecasting benchmarks. The 7B model achieves modest positive results (median IC +0.154) comparable to simpler kNN baselines, suggesting real-time macro data and historical analogies drive most signal while LLMs may add marginal value in extreme rankings.

AINeutralarXiv – CS AI · Jun 26/10

🧠

Ghost Tool Calls: Issue-Time Privacy for Speculative Agent Tools

Researchers identify a privacy vulnerability in AI agents that use speculative tool calls to reduce latency, where external services receive and retain inferred user intent data even after the agent abandons the speculative branch. The study proposes Speculative Tool Privacy Contracts as a runtime solution, finding that only issue-time policies suppressing or modifying calls before dispatch effectively mitigate information leakage.

AIBullisharXiv – CS AI · Jun 26/10

🧠

scicode-lint: Detecting Methodology Bugs in Scientific Python Code with LLM-Generated Patterns

Researchers introduced scicode-lint, an AI-powered linter that automatically detects methodology bugs in scientific Python code by using large language models to generate detection patterns rather than hand-coding them. The tool addresses a critical gap where traditional static analysis fails to catch subtle errors like data leakage and incorrect cross-validation that produce plausible but wrong results, achieving 65% precision on preprocessing leakage detection with 100% recall on benchmark tests.

AINeutralarXiv – CS AI · Jun 26/10

🧠

Privacy Policy Enforcement Guardrails for Data-Sensitive Retrieval-Augmented Generation

Researchers introduce a Privacy Policy Enforcement framework that detects subtle data leakage in RAG systems beyond standard PII filters, using dual one-class density estimators to identify contextual attribute clusters that collectively identify individuals. The T3+OCSVM detector achieves 93%+ AUROC while reducing false positives by 44-55% and maintaining millisecond latency, outperforming traditional supervised approaches.

AINeutralarXiv – CS AI · Apr 146/10

🧠

Detecting RAG Extraction Attack via Dual-Path Runtime Integrity Game

Researchers propose CanaryRAG, a runtime defense mechanism that protects Retrieval-Augmented Generation systems from adversarial attacks that extract proprietary data from knowledge bases. The solution uses embedded canary tokens to detect leakage in real-time while maintaining normal system performance, offering a practical safeguard for organizations deploying RAG-based AI systems.