AINeutralarXiv – CS AI · 15h ago6/10
🧠
PICACO: Pluralistic In-Context Value Alignment of LLMs via Total Correlation Optimization
Researchers introduce PICACO, a novel in-context alignment method that optimizes meta-instructions to help large language models better understand and balance multiple, often conflicting human values without fine-tuning. The approach uses total correlation optimization to improve alignment across up to 8 distinct values while reducing noise, addressing a key limitation where LLMs struggle to reconcile competing preferences in single prompts.