y0news
AnalyticsDigestsRSSAICrypto
#ai-workflows2 articles
2 articles
AIBullisharXiv โ€“ CS AI ยท 5h ago1
๐Ÿง 

AgentAssay: Token-Efficient Regression Testing for Non-Deterministic AI Agent Workflows

Researchers introduce AgentAssay, the first framework for regression testing AI agent workflows, achieving 78-100% cost reduction while maintaining statistical guarantees. The system uses behavioral fingerprinting and stochastic testing methods to detect regressions in autonomous AI agents across multiple models including GPT-5.2, Claude Sonnet 4.6, and others.

AINeutralarXiv โ€“ CS AI ยท 5h ago1
๐Ÿง 

Delegation and Verification Under AI

Research shows AI creates phase transitions in workplace workflows where small differences in workers' verification abilities lead to dramatically different delegation behaviors. AI amplifies quality disparities between workers, with some rationally over-delegating while reducing oversight, potentially degrading institutional performance despite improved baseline task success.