AINeutralarXiv โ CS AI ยท 8h ago6/10
๐ง
Test Before You Deploy: Governing Updates in the LLM Supply Chain
Researchers propose a deployment-side governance framework for managing Large Language Model updates, addressing the problem of silent behavioral changes in hosted LLM services that lack explicit versioning. The framework combines production contracts, risk-category-based testing, and compatibility gates to prevent regressions in functionality, safety, and performance.