AINeutralarXiv โ CS AI ยท 5h ago0
๐ง
Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability
Researchers introduce Spectrum Tuning, a new post-training method that improves AI language models' ability to generate diverse outputs and follow in-context steering instructions. The technique addresses limitations in current post-training approaches that reduce models' distributional coverage and flexibility when tasks require multiple valid answers rather than single correct responses.