AINeutralarXiv โ CS AI ยท 14h ago6/10
๐ง
STARS: Skill-Triggered Audit for Request-Conditioned Invocation Safety in Agent Systems
Researchers introduce STARS, a framework for continuously auditing AI agent skill invocations in real-time by combining static capability analysis with request-conditioned risk modeling. The approach demonstrates improved detection of prompt injection attacks compared to static baselines, though remains most valuable as a triage layer rather than a complete replacement for pre-deployment screening.