🧠 AI🔴 BearishImportance 7/10Actionable

Invisible to Humans, Triggered by Agents: Stealthy Jailbreak Attacks on Mobile Vision-Language Agents

arXiv – CS AI|Renhua Ding, Xiao Yang, Zhengwei Fang, Jun Luo, Kun He, Jun Zhu|April 10, 2026 at 04:00 AM

🤖AI Summary

Researchers have discovered a new attack vulnerability in mobile vision-language agents where malicious prompts remain invisible to human users but are triggered during autonomous agent interactions. Using an optimization method called HG-IDA*, attackers can achieve 82.5% planning and 75.0% execution hijack rates on GPT-4o by exploiting the lack of touch signals during agent operations, exposing a critical security gap in deployed mobile AI systems.

Analysis

This research exposes a fundamental asymmetry in how vision-language models interact with humans versus autonomous agents on mobile devices. The attack exploits the fact that automated agents generate near-zero contact touch signals, creating an invisible window where malicious visual prompts can execute without detection. Traditional jailbreak attempts require persistent visual manipulations that users notice, but this new paradigm separates agent perception from human perception entirely, making detection significantly harder.

The vulnerability emerges from the rapid deployment of LVLMs as mobile agents without adequate consideration of realistic threat models. As AI systems move beyond controlled lab environments into personal devices handling sensitive user data and cross-app actions, the interaction surface expands dramatically. The introduction of HG-IDA* as a one-shot optimization method demonstrates how attackers can systematically bypass safety filters through efficient prompt engineering tailored specifically for agent exploitation.

The 82.5% planning hijack rate represents a severe capability gap—agents can be reliably tricked into planning unauthorized actions before execution even occurs. For users and developers, this suggests current safety mechanisms in vision-language models are fundamentally inadequate for autonomous mobile scenarios. The attack succeeds because existing defenses focus on visible content manipulation rather than the interaction patterns themselves.

Moving forward, the security community must prioritize interaction-level signals as defense mechanisms. Systems need to detect anomalies in how agents perceive and respond to visual input differently from humans. This finding likely accelerates industry discussions around agent sandboxing, permission models, and the need for behavioral verification systems that validate whether agent actions align with authentic user intent.

Key Takeaways

→Mobile vision-language agents can be hijacked through invisible jailbreak prompts that don't appear to human users, achieving up to 82.5% attack success rates
→The vulnerability exploits the lack of touch contact signals during autonomous agent interactions, creating a detection-free attack window
→HG-IDA* optimization enables efficient one-shot jailbreak prompt construction that evades current LVLM safety filters
→Current AI safety mechanisms prioritize visible content manipulation over interaction-level anomalies, leaving agents exposed
→Cross-app action hijacking demonstrates that the threat extends beyond single-application compromise to system-wide unauthorized access

Mentioned in AI

Models

GPT-4OpenAI

#vision-language-models #jailbreak-attacks #mobile-ai-security #autonomous-agents #prompt-injection #safety-bypass #lvlm-vulnerability #agent-security

Read Original →via arXiv – CS AI

Act on this with AI

Stay ahead of the market.

Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains — you keep full control of your keys.

Connect Wallet to AI →How it works

AIMay 6

Your company’s AI could delete everything in 9 seconds. ServiceNow wants to be the kill switch

AIMay 6

Hut 8 (HUT) Stock Soars 37% on Massive $9.8 Billion AI Data Center Agreement

AIMay 6

Invisible to Humans, Triggered by Agents: Stealthy Jailbreak Attacks on Mobile Vision-Language Agents

Your company’s AI could delete everything in 9 seconds. ServiceNow wants to be the kill switch

Hut 8 (HUT) Stock Soars 37% on Massive $9.8 Billion AI Data Center Agreement

S&P 500 and NASDAQ hit record highs as AI chip stocks surge