#execution-safety News & Analysis

4 articles tagged with #execution-safety. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

4 articles

AIBullisharXiv – CS AI · Mar 177/10

🧠

ILION: Deterministic Pre-Execution Safety Gates for Agentic AI Systems

Researchers introduce ILION, a deterministic safety system for autonomous AI agents that can execute real-world actions like financial transactions and API calls. The system achieves 91% precision with sub-millisecond latency, significantly outperforming existing text-safety infrastructure that wasn't designed for agent execution safety.

🏢 OpenAI🧠 Llama

AI × CryptoNeutralarXiv – CS AI · Mar 127/10

🤖

Execution Is the New Attack Surface: Survivability-Aware Agentic Crypto Trading with OpenClaw-Style Local Executors

Researchers propose Survivability-Aware Execution (SAE), a new security framework for AI-powered crypto trading systems that prevents execution-induced losses from compromised AI agents or malicious prompts. The system implements middleware protection between AI strategy engines and exchange executors, reducing maximum drawdown by 93.1% and attack success rates by 27.2% in testing.

AINeutralarXiv – CS AI · Jun 106/10

🧠

SkillResolve-Bench: Measuring and Resolving Same-Capability Ambiguity in Agent Skill Retrieval

Researchers introduce SkillResolve-Bench, a benchmark for evaluating agent skill retrieval systems that addresses the critical problem of selecting the correct skill variant when multiple capabilities are semantically similar. The benchmark includes 661 helper/risky skill pairs and proposes SkillResolve, a method that achieves safer procedural exposure by selecting appropriate skill representatives from capability families.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Safe Embodied AI for Long-horizon Tasks: A Cross-layer Analysis of Robotic Manipulation

A comprehensive survey examines safety mechanisms for embodied AI systems performing long-horizon robotic manipulation tasks, identifying critical gaps in current research across planning, policy design, and execution phases. The analysis reveals that while safety receives attention, evidence remains fragmented with limited formal guarantees, particularly for contact-rich manipulation scenarios in real-world deployment.