y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#regression-detection News & Analysis

2 articles tagged with #regression-detection. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles
AINeutralarXiv – CS AI · Jun 116/10
🧠

Layer-Isolated Evaluation: Gating the Deterministic Scaffold of a Production LLM Agent with a No-LLM, Regression-Locked Test Harness

Researchers present layer-isolated evaluation, a deterministic testing framework that decomposes LLM agents into eight functional layers, each validated independently without requiring LLM execution. Testing across 238 cases reveals that aggregate end-to-end metrics mask localized regressions, with targeted layer failures causing 25-91 percentage point drops in component-specific tests while barely affecting overall pass rates.

AINeutralarXiv – CS AI · May 16/10
🧠

Test Before You Deploy: Governing Updates in the LLM Supply Chain

Researchers propose a deployment-side governance framework for managing Large Language Model updates, addressing the problem of silent behavioral changes in hosted LLM services that lack explicit versioning. The framework combines production contracts, risk-category-based testing, and compatibility gates to prevent regressions in functionality, safety, and performance.