BEAVER: An Efficient Deterministic LLM Verifier
BEAVER is a new verification framework that computes mathematically sound probability bounds on whether large language models satisfy safety properties, identifying 2-3x more risky outputs than existing methods while using 90% less computational resources. The framework addresses a critical gap in LLM deployment by providing deterministic guarantees rather than ad-hoc sampling estimates.