AINeutralarXiv – CS AI · 6h ago6/10
🧠
CITE: Anytime-Valid Statistical Inference in LLM Self-Consistency
Researchers propose CITE, an algorithm that enables reliable certification of Large Language Model outputs through multiple sampling while controlling error rates under data-dependent stopping conditions. The method addresses a critical challenge in LLM reliability by providing statistical guarantees without requiring advance knowledge of possible answer categories.