#markov-decision-process News & Analysis

5 articles tagged with #markov-decision-process. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

5 articles

AINeutralarXiv – CS AI · 5d ago6/10

🧠

An End-to-End Learning Approach for Solving Capacitated Location-Routing Problems

Researchers propose DRLHQ, a deep reinforcement learning approach with heterogeneous query attention mechanisms to solve capacitated location-routing problems (CLRPs) and their open variants. This marks the first end-to-end learning framework for CLRPs, demonstrating superior performance over traditional and DRL-based baselines on benchmark datasets.

AINeutralarXiv – CS AI · May 126/10

🧠

Iterative Critique-and-Routing Controller for Multi-Agent Systems with Heterogeneous LLMs

Researchers propose a critique-and-routing controller for multi-agent LLM systems that iteratively refines outputs through sequential decision-making rather than one-shot routing. The method uses reinforcement learning with agent-utilization constraints to achieve performance approaching the strongest agent while reducing computational calls by over 75%, advancing coordination efficiency in heterogeneous AI systems.

AIBullisharXiv – CS AI · Feb 275/106

🧠

Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies

Researchers developed a learned scheduler for masked diffusion models (MDMs) in language modeling that outperforms traditional rule-based approaches. The new method uses a KL-regularized Markov decision process framework and demonstrated significant improvements, including 20.1% gains over random scheduling and 11.2% over max-confidence approaches on benchmark tests.

AINeutralarXiv – CS AI · Mar 35/106

🧠

Multi-Sourced, Multi-Agent Evidence Retrieval for Fact-Checking

Researchers propose WKGFC, a new AI system that uses knowledge graphs and multi-agent retrieval to improve fact-checking accuracy. The system addresses limitations of current methods that rely on textual similarity by implementing an automated Markov Decision Process with LLM agents to retrieve and verify evidence from multiple sources.

AINeutralarXiv – CS AI · Mar 34/105

🧠

A Resource-Rational Principle for Modeling Visual Attention Control

Researchers have developed a new resource-rational framework for modeling visual attention as a sequential decision-making process using AI techniques like Partially Observable Markov Decision Processes. The framework successfully models human eye-movement behaviors in tasks like reading and multitasking, offering potential applications for Human-Computer Interaction design.