y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#preference-alignment News & Analysis

6 articles tagged with #preference-alignment. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

6 articles
AIBullisharXiv – CS AI · Apr 77/10
🧠

One Model for All: Multi-Objective Controllable Language Models

Researchers introduce Multi-Objective Control (MOC), a new approach that trains a single large language model to generate personalized responses based on individual user preferences across multiple objectives. The method uses multi-objective optimization principles in reinforcement learning from human feedback to create more controllable and adaptable AI systems.

AINeutralarXiv – CS AI · Jun 26/10
🧠

Efficient Exploration for Iterative Nash Preference Optimization

Researchers propose an improved Nash Learning from Human Feedback (NLHF) algorithm that addresses exploration challenges in preference alignment for large language models. The new method achieves better regret bounds without exponential dependence on regularization parameters and demonstrates empirical improvements when fine-tuning Llama-3-8B.

🧠 Llama
AINeutralarXiv – CS AI · May 296/10
🧠

In-Context Reward Adaptation for Robust Preference Modeling

Researchers propose In-Context Reward Adaptation, a transformer-based framework that dynamically models diverse human preferences without costly retraining. By incorporating human response time as an auxiliary signal, the approach enables language models to adapt to unseen preference domains on-the-fly, addressing a critical limitation of static reward models used in RLHF systems.

AINeutralarXiv – CS AI · May 286/10
🧠

StoryLens: Preference-Aligned Story Rewriting via Context-Aware Narrative Enrichment

Researchers introduce StoryLens, a framework for preference-aligned story rewriting that goes beyond style transfer to incorporate context-aware narrative enrichment. Human studies show context-enhanced rewriting improves reader satisfaction by 24.5% compared to style-only approaches, supported by a new benchmark, reward model, and two-stage rewriting system combining supervised learning with reinforcement learning.

AINeutralarXiv – CS AI · May 96/10
🧠

Flexible Agent Alignment with Goal Inference from Open-Ended Dialog

Researchers introduce Open-Universe Assistance Games (OU-AGs), a framework enabling LLM-based agents to infer and align with human preferences through open-ended dialogue. The GOOD method extracts evolving goals from natural language interactions using probabilistic inference, demonstrating improved user intent alignment across shopping, robotics, and coding domains without requiring large offline datasets.