AINeutralarXiv – CS AI · 15h ago6/10
🧠
Alignment Tuning for Large Language Models: A Data-Centric Lens on Alignment Data Pipelines
A new arXiv survey reframes large language model alignment tuning through a data-centric lens, decomposing alignment data construction into three stages: response synthesis, preference evaluation, and preference instantiation. By organizing existing alignment methods into a unified taxonomy, the research identifies design trade-offs and failure modes while establishing principles for improving alignment data pipeline design.