#preference-modeling News & Analysis

5 articles tagged with #preference-modeling. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

5 articles

AINeutralarXiv – CS AI · Jun 236/10

🧠

TailorMind: Towards Preference-Aligned Multimodal Content Generation

TailorMind is a new AI system that generates personalized multimodal content by combining collaborative filtering with controllable generation, addressing the gap between user preferences and available content. The researchers introduce TailorBench, a comprehensive benchmark for evaluating personalized content generation across coherence, novelty, and aesthetic dimensions, with results showing 29% recall gains in reranking tasks.

AINeutralarXiv – CS AI · Jun 26/10

🧠

lmfaoooo at SemEval-2026 Task 1: Humor Is an Audience. Preference Modeling for Constrained Humor Generation

A research team won first place in the SemEval-2026 Task-1 humor generation competition by developing a system that generates diverse joke candidates and selects the best ones using a preference model trained on human comparisons. The approach addresses the core challenge that humor is subjective and audience-dependent, rather than objectively measurable, achieving top rankings across English, Chinese, and Spanish subtasks.

AINeutralarXiv – CS AI · Apr 146/10

🧠

Influencing Humans to Conform to Preference Models for RLHF

Researchers demonstrate that human preferences can be influenced to better align with the mathematical models used in RLHF algorithms, without changing underlying reward functions. Through three interventions—revealing model parameters, training humans on preference models, and modifying elicitation questions—the study shows significant improvements in preference data quality and AI alignment outcomes.

AIBullisharXiv – CS AI · Mar 96/10

🧠

PONTE: Personalized Orchestration for Natural Language Trustworthy Explanations

Researchers introduce PONTE, a human-in-the-loop framework that creates personalized, trustworthy AI explanations by combining user preference modeling with verification modules. The system addresses the challenge of one-size-fits-all AI explanations by adapting to individual user expertise and cognitive needs while maintaining faithfulness and reducing hallucinations.

AIBullisharXiv – CS AI · Mar 26/1015

🧠

DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Generation

Researchers introduce DesignSense-10k, a dataset of 10,235 human-annotated preference pairs for evaluating graphic layout generation, along with DesignSense, a specialized AI model that outperforms existing models by 54.6% in layout quality assessment. The framework addresses the gap between AI-generated layouts and human aesthetic preferences, showing practical improvements in layout generation through reinforcement learning.