AIBullisharXiv โ CS AI ยท 5h ago0
๐ง
Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute
Researchers propose 'best-of-โ' approach for large language models that uses majority voting with infinite samples, achieving superior performance but requiring infinite computation. They develop an adaptive generation scheme that dynamically selects the optimal number of samples based on answer agreement and extend the framework to weighted ensembles of multiple LLMs.