y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#process-monitoring News & Analysis

1 article tagged with #process-monitoring. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

1 articles
AINeutralarXiv – CS AI · 14h ago7/10
🧠

OpenClawBench: Benchmarking Process-side Anomalies in Real-world Agent Execution Trajectories

Researchers introduce OpenClawBench, a large-scale dataset of 31,264 annotated agent execution trajectories that reveals a significant gap between task success and process reliability. The study finds that 9.3% of oracle-passing executions contain process-side anomalies like unresolved ambiguities and unsafe operations, demonstrating that success metrics alone mask critical failure modes in AI agent systems.