AIBullisharXiv โ CS AI ยท Mar 166/10
๐ง
AdaBoN: Adaptive Best-of-N Alignment
Researchers propose AdaBoN, an adaptive Best-of-N alignment method that improves computational efficiency in language model alignment by allocating inference-time compute based on prompt difficulty. The two-stage algorithm outperforms uniform allocation strategies while using 20% less computational budget.