AIBullisharXiv โ CS AI ยท 4h ago7/10
๐ง
Robust LLM Performance Certification via Constrained Maximum Likelihood Estimation
Researchers propose a new constrained maximum likelihood estimation (MLE) method to accurately estimate failure rates of large language models by combining human-labeled data, automated judge annotations, and domain-specific constraints. The approach outperforms existing methods like Prediction-Powered Inference across various experimental conditions, providing a more reliable framework for LLM safety certification.