y0news
AnalyticsDigestsSourcesRSSAICrypto
#defense-framework2 articles
2 articles
AIBullisharXiv โ€“ CS AI ยท Feb 277/104
๐Ÿง 

AgentSentry: Mitigating Indirect Prompt Injection in LLM Agents via Temporal Causal Diagnostics and Context Purification

Researchers have developed AgentSentry, a novel defense framework that protects AI agents from indirect prompt injection attacks by detecting and mitigating malicious control attempts in real-time. The system achieved 74.55% utility under attack, significantly outperforming existing defenses by 20-33 percentage points while maintaining benign performance.

AINeutralGoogle DeepMind Blog ยท Apr 26/105
๐Ÿง 

Evaluating potential cybersecurity threats of advanced AI

A new framework has been developed to help cybersecurity experts evaluate and prioritize defenses against potential threats from advanced AI systems. The framework aims to enable organizations to systematically identify necessary security measures and allocate resources effectively.