AIBullisharXiv – CS AI · 9h ago7/10
🧠
SAGE: Scalable AI Governance & Evaluation
Researchers and LinkedIn introduce SAGE, a framework that combines human judgment with AI surrogates to evaluate search relevance at scale. By using a bidirectional calibration loop between policy, precedent examples, and LLM judges, the system achieves near-human agreement while reducing inference costs by 92×, ultimately driving a 0.25% lift in LinkedIn's daily active users.