#information-security News & Analysis

9 articles tagged with #information-security. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

9 articles

AIBearisharXiv – CS AI · Jun 237/10

🧠

MuPPET: A Benchmark for Contextual Privacy of LLM Assistants in Multi-Party Conversations

Researchers introduced MuPPET, a benchmark testing privacy vulnerabilities in large language model assistants operating in multi-party conversations. The study reveals that LLMs leak significantly more sensitive information in group settings than in one-to-one interactions, with both frontier and smaller open-weight models showing substantial exposure risks that existing privacy defenses cannot adequately address.

AIBearisharXiv – CS AI · Jun 107/10

🧠

A Note on the Strategic Confinement Problem

Researchers introduce the 'strategic confinement problem,' extending Lampson's classical confinement theory to scenarios where communicating parties are strategic agents with shared coordination resources. The work demonstrates that information-theoretic bounds on communication capacity may fail to constrain the harmful outcomes strategic agents can jointly achieve through covert channels, particularly in systems of learned AI agents.

AIBearishMIT News – AI · Jun 97/10

🧠

The consequences of relying on AI for accurate news

A Media Lab study reveals that reliance on AI for news verification may paradoxically weaken users' ability to detect misinformation, similar to how GPS dependency has diminished navigation skills. This cognitive atrophy poses risks for media literacy and information security in an increasingly AI-mediated information ecosystem.

AIBearisharXiv – CS AI · Jun 27/10

🧠

Adversarial Feeds Steer LLM Agent Decisions Against Their Defaults

Researchers demonstrate that LLM agents' decisions can be systematically manipulated through adversarial feed curation—the ordering and composition of information sources agents consume before acting. Testing on 2,785 decision rollouts across four open-source LLMs, they found feeds can shift genuinely uncertain decisions from 5% to 100% in one direction, though they cannot override firmly held model defaults, revealing a critical safety vulnerability in the upstream ranker layer rather than the model itself.

AINeutralarXiv – CS AI · Jun 27/10

🧠

Generative AI and Digital Ecosystem Resilience: A Proactive Lifecycle-Based Survey

A comprehensive survey examines how generative AI has accelerated adversarial synthetic content creation, necessitating a shift from reactive to proactive detection methods. Using the C5 Interaction Model framework, researchers integrate machine learning with social science approaches to detect coordinated inauthentic behavior, synthetic narrative propagation, and emerging threats across information ecosystems.

AINeutralarXiv – CS AI · Jun 116/10

🧠

When Poison Fails After Retrieval: Revisiting Corpus Poisoning under Chunking and Reranking Pipelines

Researchers demonstrate that existing corpus poisoning attacks against RAG systems fail significantly after reranking stages, revealing a critical gap between retrieval-stage attacks and real-world multi-stage pipelines. They propose CRCP, a new poisoning framework that accounts for document chunking and reranking to achieve higher attack success rates across realistic retrieval configurations.

AINeutralarXiv – CS AI · Jun 26/10

🧠

Privacy Policy Enforcement Guardrails for Data-Sensitive Retrieval-Augmented Generation

Researchers introduce a Privacy Policy Enforcement framework that detects subtle data leakage in RAG systems beyond standard PII filters, using dual one-class density estimators to identify contextual attribute clusters that collectively identify individuals. The T3+OCSVM detector achieves 93%+ AUROC while reducing false positives by 44-55% and maintaining millisecond latency, outperforming traditional supervised approaches.

AINeutralarXiv – CS AI · Apr 146/10

🧠

Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind

Researchers introduce ToM-SB, a novel challenge where AI defenders must use theory-of-mind reasoning to deceive attackers trying to extract sensitive information. Through reinforcement learning, trained models outperform frontier LLMs like GPT-4 and Gemini-Pro, revealing an emergent bidirectional relationship between belief modeling and deception capabilities.

🧠 GPT-5

AINeutralarXiv – CS AI · Apr 106/10

🧠

Say Something Else: Rethinking Contextual Privacy as Information Sufficiency

Researchers formalize privacy-preserving communication for LLM agents by introducing Information Sufficiency (IS) as a framework and proposing free-text pseudonymization as a third privacy strategy alongside suppression and generalization. Evaluation across 792 scenarios reveals that pseudonymization offers superior privacy-utility tradeoffs, and that multi-turn conversational testing exposes significant privacy leakage missed by single-message assessments.