AIBullisharXiv – CS AI · 18h ago7/10
🧠
SKILL.nb: Selective Formalization and Gated Execution for Durable Agent Workflows
SKILL.nb is a new framework that improves AI agent reliability by selectively formalizing workflow steps based on execution evidence, storing them as versioned notebooks with natural language guidance and executable code. The system achieved 53.7% success on web automation tasks and retained 91.7% performance across multiple re-executions, significantly outperforming existing baselines in handling environment drift and task specification changes.