y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#dynamic-testing News & Analysis

2 articles tagged with #dynamic-testing. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles
AIBullisharXiv – CS AI · Jun 117/10
🧠

Runtime Skill Audit: Targeted Runtime Probing for Agent Skill Security

Researchers introduced Runtime Skill Audit (RSA), a dynamic analysis method that detects malicious behavior in LLM agent skills by testing them under targeted runtime conditions rather than relying on static code review. RSA achieved 90% accuracy in identifying harmful skills and maintained effectiveness against evolving attacks where static methods failed, addressing a critical security gap in agent-based AI systems.

AINeutralarXiv – CS AI · Mar 176/10
🧠

NetArena: Dynamic Benchmarks for AI Agents in Network Automation

NetArena introduces a dynamic benchmarking framework for evaluating AI agents in network automation tasks, addressing limitations of static benchmarks through runtime query generation and network emulator integration. The framework reveals that AI agents achieve only 13-38% performance on realistic network queries, significantly improving statistical reliability by reducing confidence-interval overlap from 85% to 0%.