AIBearisharXiv – CS AI · 14h ago7/10
🧠
When and How Human Curation Backfires: Preference Alignment under Multi-Model Self-Consuming Loop
A new study reveals that human curation efforts to align AI models can backfire in multi-model ecosystems where models train on outputs from other models. While curation improves alignment in isolated systems, cross-model interactions can dampen or reverse these benefits, potentially degrading long-term alignment across interconnected AI systems.