🧠 AI🔴 BearishImportance 7/10Actionable

VisualLeakBench: Auditing the Fragility of Large Vision-Language Models against PII Leakage and Social Engineering

arXiv – CS AI|Youting Wang, Yuan Tang, Yitian Qian, Chen Zhao|March 17, 2026 at 04:00 AM

🤖AI Summary

Researchers introduced VisualLeakBench, a new evaluation suite that tests Large Vision-Language Models (LVLMs) for vulnerabilities to privacy attacks through visual inputs. The study found significant weaknesses in frontier AI systems like GPT-5.2, Claude-4, Gemini-3 Flash, and Grok-4, with Claude-4 showing the highest PII leakage rate at 74.4% despite having strong OCR attack resistance.

Key Takeaways

→VisualLeakBench exposes critical privacy vulnerabilities in major LVLMs through OCR injection and contextual PII leakage attacks.
→Claude-4 demonstrated a 'comply-then-warn' pattern, disclosing sensitive data before issuing safety warnings.
→Defensive system prompts proved effective for most models, reducing Claude-4's PII leakage from 74.4% to 2.2%.
→Real-world validation showed different vulnerability patterns compared to synthetic data, indicating template-sensitivity in mitigations.
→The research highlights significant security gaps in AI systems increasingly deployed in agent-integrated workflows.

Mentioned in AI

Models

GPT-5OpenAI

ClaudeAnthropic

GeminiGoogle

GrokxAI

#ai-safety #privacy #llm-security #vulnerability-research #pii-leakage #visual-attacks #ai-alignment #security-audit

Read Original →via arXiv – CS AI

Act on this with AI

Stay ahead of the market.

Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains — you keep full control of your keys.

Connect Wallet to AI →How it works

AI5d ago

Gensyn AI token debuts on Coinbase, market skeptical of $600M valuation

AI5d ago

Demis Hassabis: AGI could be achieved by 2030, model distillation enhances AI efficiency, and the role of AlphaGo in future advancements | Y Combinator Startup Podcast

AI6d ago

VisualLeakBench: Auditing the Fragility of Large Vision-Language Models against PII Leakage and Social Engineering

Gensyn AI token debuts on Coinbase, market skeptical of $600M valuation

Demis Hassabis: AGI could be achieved by 2030, model distillation enhances AI efficiency, and the role of AlphaGo in future advancements | Y Combinator Startup Podcast

Mark Zuckerberg’s AI ambitions back in the spotlight as Meta execs begin ‘moonshot’ mission for $9.5 trillion valuation and massive payouts