y0news
AnalyticsDigestsSourcesRSSAICrypto
#lean-41 article
1 articles
AINeutralarXiv โ€“ CS AI ยท 7h ago7/10
๐Ÿง 

The Defense Trilemma: Why Prompt Injection Defense Wrappers Fail?

Researchers prove mathematically that no continuous input-preprocessing defense can simultaneously maintain utility, preserve model functionality, and guarantee safety against prompt injection attacks in language models with connected prompt spaces. The findings establish a fundamental trilemma showing that defenses must inevitably fail at some threshold inputs, with results verified in Lean 4 and validated empirically across three LLMs.