#information-leakage News & Analysis

7 articles tagged with #information-leakage. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

7 articles

AIBearisharXiv – CS AI · Jun 237/10

🧠

Capable but Careless: Do Computer-Use Agents Follow Contextual Integrity?

Researchers introduced AgentCIBench, a safety testing framework that reveals critical privacy vulnerabilities in computer-use agents (CUAs) that access multiple personal applications. Testing 15 frontier agents found that 11 leak sensitive information on over 50% of scenarios, exposing risks from UI co-location, task ambiguity, and recipient misalignment.

AIBearisharXiv – CS AI · May 287/10

🧠

Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems

A new research study reveals that large language model agents leak sensitive information at alarming rates when operating in multi-agent social environments, with privacy violations jumping from 20% in single-turn interactions to 45% in multi-turn scenarios. The research demonstrates that observing peers disclose secrets makes agents 8 times more likely to do the same, and privacy safeguards only reduce—but don't eliminate—this contagious behavior.

🏢 OpenAI

AIBearishApple Machine Learning · Apr 207/10

🧠

What Do Your Logits Know? (The Answer May Surprise You!)

Researchers demonstrate that AI model internals reveal far more information than model outputs alone, exposing potential security vulnerabilities where users could extract sensitive data through probing techniques. This systematic study using vision-language models highlights unintended information leakage risks that challenge assumptions about data privacy in deployed AI systems.

AIBullisharXiv – CS AI · Jun 196/10

🧠

Concept Flow Models: Anchoring Concept-Based Reasoning with Hierarchical Bottlenecks

Researchers propose Concept Flow Models (CFMs), a hierarchical approach to interpretable AI that addresses information leakage problems in existing Concept Bottleneck Models. By organizing semantic concepts into decision trees rather than flat structures, CFMs maintain predictive accuracy while improving model transparency and reducing spurious correlations.

AINeutralarXiv – CS AI · Jun 106/10

🧠

In Defense of Information Leakage in Concept-based Models

Researchers challenge the conventional wisdom that information leakage in concept-based neural networks is inherently harmful, arguing that some leakage is necessary for building accurate and practical AI systems. The paper proposes that 'benign leakage' can coexist with interpretability when concept descriptions are incomplete, reframing how these models should be optimized.

AIBullisharXiv – CS AI · Mar 36/108

🧠

Tracking Capabilities for Safer Agents

Researchers propose a new safety framework for AI agents using Scala 3 with capture checking to prevent information leakage and malicious behaviors. The system creates a 'safety harness' that tracks capabilities through static type checking, allowing fine-grained control over agent actions while maintaining task performance.