AIBullisharXiv – CS AI · 3d ago7/10
🧠LoopFM introduces a novel knowledge distillation framework that transfers rich intermediate representations from large foundation models to compact vertical models, achieving significant conversion improvements (0.5-1.22%) in industrial-scale systems by structuring FM embeddings as input features rather than relying on single scalar predictions.
AIBullisharXiv – CS AI · 4d ago7/10
🧠Researchers introduced FLUID, a production-scale recommendation system that eliminates reliance on item IDs for livestreaming platforms by using multimodal semantic codes instead. Deployed across platforms with over one billion users, the system achieves significant performance gains including 2.05% improvement in cold-start room views, addressing a fundamental challenge in recommending short-lived broadcast content.
AIBearisharXiv – CS AI · May 117/10
🧠Researchers demonstrate that a simple graph heuristic without machine learning matches or outperforms advanced generative recommendation systems on standard benchmarks, revealing that widely-used datasets contain structural shortcuts that don't require sophisticated modeling. The findings question whether current benchmark evaluations actually validate the advanced capabilities that modern recommendation systems claim to provide.
AINeutralarXiv – CS AI · Apr 67/10
🧠Research examines how Large Language Models can be used to initialize contextual bandits for recommendation systems, finding that LLM-generated preferences remain effective up to 30% data corruption but can harm performance beyond 50% corruption. The study provides theoretical analysis showing when LLM warm-starts outperform cold-start approaches, with implications for AI-driven recommendation systems.
AINeutralarXiv – CS AI · Mar 267/10
🧠Researchers developed ESCM² (Entire Space Counterfactual Multitask Model), a new framework that improves post-click conversion rate estimation in recommender systems by addressing intrinsic estimation bias and false independence assumptions. The model-agnostic approach incorporates counterfactual learning to enhance recommendation accuracy and has been validated on large-scale industrial datasets.
AIBullisharXiv – CS AI · Mar 57/10
🧠Researchers introduce AgentSelect, a comprehensive benchmark for recommending AI agent configurations based on narrative queries. The benchmark aggregates over 111,000 queries and 107,000 deployable agents from 40+ sources to address the critical gap in selecting optimal LLM agent setups for specific tasks.
AINeutralarXiv – CS AI · Mar 46/102
🧠Researchers propose PURE, a new framework for AI-powered recommendation systems that addresses preference-inconsistent explanations - where AI provides factually correct but unconvincing reasoning that conflicts with user preferences. The system uses a select-then-generate approach to improve both evidence selection and explanation generation, demonstrating reduced hallucinations while maintaining recommendation accuracy.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers introduce Quotient DAGs, a novel framework for off-policy evaluation that addresses variance issues in importance sampling by recognizing when generation process details are irrelevant to evaluation targets. The method computes exact unordered slate propensities efficiently through Forward-DP, a dynamic programming approach that avoids factorial enumeration, enabling practical evaluation for autoregressive slate recommendation systems.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers propose H2Rec, a novel framework that combines Semantic IDs (SID) and Hash IDs (HID) to improve sequential recommendation systems, particularly for long-tail items. The dual-branch architecture addresses the performance trade-off between head and tail recommendations, with validation across public benchmarks and a commercial platform.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Ocean4Rec presents a novel approach to video-on-demand recommendation by using LLMs offline to generate OCEAN personality profiles for content items, then performing request-time reranking without real-time model calls. The system demonstrates significant NDCG improvements (7.6-61.5%) on Samsung Smart TV data while maintaining deployment simplicity and predictable latency for production services.
$OCEAN
AIBullisharXiv – CS AI · 4d ago6/10
🧠Researchers introduce ProRL, a reinforcement learning framework designed to improve proactive recommender systems that guide users toward target items through sequential recommendations. The approach addresses fundamental gradient estimation problems in policy learning by implementing stepwise reward centering and position-specific advantage estimation, demonstrating superior performance on real-world datasets.
AIBullisharXiv – CS AI · 4d ago6/10
🧠Researchers demonstrate a novel approach to advertising systems by using fine-tuned large language models as complementary predictors for advertiser forecasting rather than traditional ranking roles. Deployed in production-scale environments, this method improves candidate generation and downstream ranking by leveraging LLM knowledge to predict likely advertisers from user data, delivering measurable offline and online business improvements.
AINeutralarXiv – CS AI · 5d ago6/10
🧠L2Rec introduces a novel framework that adapts large language models for personalized recommendations by unifying behavioral and semantic signals at the parameter level using a Dual-view Personalized Mixture-of-Experts mechanism. The approach demonstrates superior performance across multiple datasets and validates real-world applicability through industrial A/B testing.
AIBullisharXiv – CS AI · 5d ago6/10
🧠Hi-SAM is a new hierarchical multi-modal recommendation framework that improves how AI systems process diverse data types (text, images) for personalized suggestions. The system addresses tokenization inefficiencies and architectural misalignments in existing approaches, achieving 6.55% improvement in core metrics when deployed at scale.
AINeutralarXiv – CS AI · 5d ago5/10
🧠Uniboost is a new traffic allocation framework for recommendation systems that uses posterior value alignment and linear boosting to improve interpretability and efficiency in allocating traffic across business objectives. The system reduces score inflation and decouples allocation plans, demonstrating improved performance in online A/B tests with practical applications for large-scale industrial recommendation systems.
🏢 Meta
AINeutralarXiv – CS AI · 5d ago6/10
🧠Researchers have identified and addressed popularity bias in Generative Recommenders (GRs), a emerging class of AI systems that use unified end-to-end frameworks for recommendations. The study reveals that this bias stems from token-level optimization flaws and undifferentiated item tokenization, proposing Ghost, a novel system using asymmetric unlikelihood optimization and skeleton-founded tokenization to mitigate the problem while maintaining recommendation quality.
AINeutralarXiv – CS AI · May 126/10
🧠UxSID is a new machine learning framework that models long user behavior sequences using semantic grouping and dual-level attention, achieving state-of-the-art performance with a 0.337% revenue lift in large-scale advertising tests. The approach balances computational efficiency with semantic awareness by using Semantic IDs rather than item-specific search methods.
AINeutralarXiv – CS AI · May 125/10
🧠Researchers propose a multi-level graph attention network framework that uses contrastive learning to improve knowledge-graph-based recommendation systems. The approach addresses limitations in existing methods by leveraging multi-view learning and self-supervised techniques to better model user preferences and item representations.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers propose DCGL, a dual-channel graph learning framework that combines Knowledge Graphs with Large Language Models to improve recommendation systems. The method addresses limitations in current approaches by separately modeling semantic and behavioral patterns, using contrastive learning and adaptive fusion to achieve better performance across sparse and active user scenarios.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce TRACE, a benchmark dataset for evaluating tourism recommendation systems that combine multi-turn dialogue, verifiable review citations, and rejection recovery. The dataset reveals a significant gap in existing conversational recommender systems: LLMs excel at recall but cite weakly, while retrieval-based systems ground better but struggle with accuracy and adaptation.
AIBullisharXiv – CS AI · May 116/10
🧠Researchers propose RRCM, a novel framework that enhances Large Language Model-based recommendation systems by dynamically retrieving relevant collaborative and metadata information. The system learns optimal context construction through ranking-driven optimization, addressing key challenges in balancing context quality with efficiency limitations.
AINeutralarXiv – CS AI · May 96/10
🧠This survey examines the integration of Foundation Models into federated learning systems for privacy-preserving recommendation engines. It addresses the fundamental challenge of balancing global knowledge leverage with personalized user preferences while maintaining data privacy through decentralized architectures, representing an emerging intersection of federation, personalization, and foundation models.
AINeutralarXiv – CS AI · May 96/10
🧠Researchers introduce Owen-Shapley Policy Optimization (OSPO), a reinforcement learning algorithm that improves how language models learn from feedback by attributing credit to individual tokens rather than treating entire sequences as atomic units. The method addresses a fundamental training gap in generative AI systems used for recommendation tasks, showing measurable improvements on real e-commerce datasets.
AINeutralarXiv – CS AI · May 16/10
🧠Researchers introduce PAD-Rec, a lightweight module that optimizes speculative decoding for LLM-based recommendation systems by incorporating position-aware embeddings. The approach achieves up to 3.1x speedup in inference while preserving recommendation quality, addressing the latency bottleneck in generative list-wise recommendations.
AINeutralarXiv – CS AI · Apr 146/10
🧠SRBench introduces a comprehensive evaluation framework for Sequential Recommendation models that combines Large Language Models with traditional neural network approaches. The benchmark addresses critical gaps in existing evaluation methodologies by incorporating fairness, stability, and efficiency metrics alongside accuracy, while establishing fair comparison mechanisms between LLM-based and neural network-based recommendation systems.
🏢 Meta