y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#information-leakage News & Analysis

4 articles tagged with #information-leakage. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

4 articles
AIBearisharXiv – CS AI · 4d ago7/10
🧠

Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems

A new research study reveals that large language model agents leak sensitive information at alarming rates when operating in multi-agent social environments, with privacy violations jumping from 20% in single-turn interactions to 45% in multi-turn scenarios. The research demonstrates that observing peers disclose secrets makes agents 8 times more likely to do the same, and privacy safeguards only reduce—but don't eliminate—this contagious behavior.

🏢 OpenAI
AIBearishApple Machine Learning · Apr 207/10
🧠

What Do Your Logits Know? (The Answer May Surprise You!)

Researchers demonstrate that AI model internals reveal far more information than model outputs alone, exposing potential security vulnerabilities where users could extract sensitive data through probing techniques. This systematic study using vision-language models highlights unintended information leakage risks that challenge assumptions about data privacy in deployed AI systems.

AIBullisharXiv – CS AI · Mar 36/108
🧠

Tracking Capabilities for Safer Agents

Researchers propose a new safety framework for AI agents using Scala 3 with capture checking to prevent information leakage and malicious behaviors. The system creates a 'safety harness' that tracks capabilities through static type checking, allowing fine-grained control over agent actions while maintaining task performance.