#regression-testing News & Analysis

3 articles tagged with #regression-testing. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles

AIBullisharXiv – CS AI · May 297/10

🧠

GRASP: Gated Regression-Aware Skill Proposer for Self-Improving LLM Agents

Researchers introduce GRASP, a method for improving large language model agents through controlled skill library updates that prevent performance regression. Tested across five base models on clinical benchmarks, GRASP achieves dramatic improvements (40.6% to 88.8% on MedAgentBench) while maintaining stability, outperforming existing self-improvement approaches by significant margins.

🧠 GPT-4🧠 GPT-5🧠 Gemini

AIBullisharXiv – CS AI · Mar 46/104

🧠

AgentAssay: Token-Efficient Regression Testing for Non-Deterministic AI Agent Workflows

Researchers introduce AgentAssay, the first framework for regression testing AI agent workflows, achieving 78-100% cost reduction while maintaining statistical guarantees. The system uses behavioral fingerprinting and stochastic testing methods to detect regressions in autonomous AI agents across multiple models including GPT-5.2, Claude Sonnet 4.6, and others.

AINeutralarXiv – CS AI · Mar 33/104

🧠

Test Case Prioritization: A Snowballing Literature Review and TCPFramework with Approach Combinators

Researchers conducted a comprehensive literature review of test case prioritization (TCP) techniques and developed a new framework with ensemble methods called approach combinators. The study analyzed 324 TCP-related studies and proposed new evaluation metrics, with their methods achieving up to 2.7% reduction in regression testing time while performing comparably to state-of-the-art algorithms.