#monte-carlo-tree-search News & Analysis

18 articles tagged with #monte-carlo-tree-search. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

18 articles

AIBullisharXiv – CS AI · Jun 97/10

🧠

Segment-level Tree Search for Long Meeting Document Summarization

Researchers propose S3, a training-free framework using Monte Carlo Tree Search to summarize long meeting documents by composing segment-level summaries. The approach achieves performance comparable to larger language models while using a 7B parameter model, addressing cumulative error propagation issues in multi-stage summarization pipelines.

AIBullisharXiv – CS AI · Jun 47/10

🧠

ChatSOP: An SOP-Guided MCTS Planning Framework for Controllable LLM Dialogue Agents

ChatSOP introduces a novel framework combining Standard Operating Procedures with Monte Carlo Tree Search to improve controllability of LLM-based dialogue agents. The research demonstrates 27.95% improvement in action accuracy over GPT-3.5 baselines through SOP-guided planning and a curated multi-scenario dialogue dataset.

🧠 GPT-4

AIBullisharXiv – CS AI · Apr 207/10

🧠

Bilevel Optimization of Agent Skills via Monte Carlo Tree Search

Researchers propose a bilevel optimization framework using Monte Carlo Tree Search to systematically improve LLM agent skills—structured collections of instructions, tools, and resources. The framework optimizes both skill structure and component content simultaneously, demonstrating performance improvements on Operations Research tasks and addressing a previously unsolved challenge in agent design optimization.

AIBullisharXiv – CS AI · Apr 147/10

🧠

Variance-Aware Prior-Based Tree Policies for Monte Carlo Tree Search

Researchers introduce Inverse-RPO, a methodology for deriving prior-based tree policies in Monte Carlo Tree Search from first principles, and apply it to create variance-aware UCT algorithms that outperform PUCT without additional computational overhead. This advances the theoretical foundation of MCTS used in reinforcement learning systems like AlphaZero.

AINeutralarXiv – CS AI · Jun 256/10

🧠

UC-Search: Risk-Aware Test-Time Search for Delayed Constrained Time-Series Control

UC-Search is a model-agnostic test-time algorithm that combines time-series forecasting with constrained decision-making under uncertainty. The approach uses beam search and Monte Carlo tree search variants to optimize delayed control decisions while respecting feasibility constraints, demonstrating measurable improvements over existing methods like CEM and MPPI across inventory control and financial forecasting benchmarks.

AINeutralarXiv – CS AI · Jun 236/10

🧠

REBA: A Revealed Belief Automaton Framework for Online Planning in Continuous POMDPs

Researchers introduce REBA (Revealed Belief Automaton), a new framework for online planning in continuous partially observable environments that dynamically certifies belief states rather than relying on predefined discrete abstractions. The method achieves 17-47% performance improvements over existing approaches in patrolling and navigation tasks by combining information-theoretic analysis with formal symbolic planning.

AINeutralarXiv – CS AI · Jun 196/10

🧠

VERITAS: Verifier-Guided Proof Search for Zero-Shot Formal Theorem Proving

VERITAS introduces a zero-shot framework for formal theorem proving that leverages rich verifier feedback signals rather than binary pass/fail outcomes. Using a two-phase approach combining Best-of-N sampling with critic-guided Monte Carlo Tree Search, the system achieves 40.6% accuracy on miniF2F benchmarks and demonstrates particular strength in combinatorial problems where iterative lemma recovery is critical.

AIBullisharXiv – CS AI · Jun 26/10

🧠

S3TS: Stochastic Scenario-Structured Tree Search for Advanced Planning Under Uncertainty

Researchers introduce S3TS, a novel algorithm combining Monte Carlo Tree Search with stochastic optimization to handle both non-linear complexity and uncertainty in energy grid scheduling. The approach demonstrates near-optimal performance in linear settings and significantly outperforms existing methods in non-linear scenarios, achieving up to 51% cost reductions compared to baseline algorithms.

AINeutralarXiv – CS AI · May 296/10

🧠

Influence-Guided Symbolic Regression: Scientific Discovery via LLM-Driven Equation Search with Granular Feedback

Researchers introduce Influence-Guided Symbolic Regression (IGSR), a novel framework combining LLMs with Monte Carlo Tree Search to discover scientific equations more efficiently. The method uses granular influence scores to evaluate which components of equations contribute to accuracy, enabling systematic refinement. The approach demonstrated genuine discovery potential by identifying a novel relationship between DNA methylation and RNA Polymerase II pausing that was subsequently validated experimentally.

AINeutralarXiv – CS AI · May 286/10

🧠

AlphaTransit: Learning to Design City-scale Transit Routes

Researchers introduce AlphaTransit, an AI framework combining Monte Carlo Tree Search with neural networks to optimize city-scale bus network design. The system achieves 9.9-11.4% performance improvements over reinforcement learning alone by coupling learned guidance with tree search, demonstrating that hybrid approaches outperform single-method solutions for complex infrastructure planning problems.

AINeutralarXiv – CS AI · May 286/10

🧠

Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models

Researchers introduce McDiffuSE, an MCTS-based framework that optimizes slot-filling order in Masked Diffusion Models to improve performance on mathematical and code reasoning tasks. The approach achieves 3.2% improvement over autoregressive baselines and up to 19.5% gains on specific benchmarks by strategically exploring generation orderings rather than following sequential patterns.

AINeutralarXiv – CS AI · May 275/10

🧠

Monte Carlo Permutation Search

Researchers propose Monte Carlo Permutation Search (MCPS), an improved Monte Carlo Tree Search algorithm that enhances the GRAVE algorithm for game-playing AI. MCPS leverages statistics from all playouts containing moves along the path from root to node, demonstrating superior performance across multiple games while eliminating GRAVE's bias hyperparameter.

AINeutralarXiv – CS AI · May 126/10

🧠

Arcane: An Assertion Reduction Framework through Semantic Clustering and MCTS-Guided Rule Exploring

Arcane is a new assertion reduction framework that uses semantic clustering and Monte Carlo Tree Search to eliminate redundant assertions in hardware verification, achieving up to 76.2% reduction in assertion count while maintaining full formal coverage and enabling 2.6x to 6.1x simulation speedups.

AIBullisharXiv – CS AI · Mar 166/10

🧠

ToolTree: Efficient LLM Agent Tool Planning via Dual-Feedback Monte Carlo Tree Search and Bidirectional Pruning

Researchers have developed ToolTree, a new Monte Carlo tree search-based planning system for LLM agents that improves tool selection and usage through dual-feedback evaluation and bidirectional pruning. The system achieves approximately 10% performance gains over existing methods while maintaining high efficiency across multiple benchmarks.

AIBullisharXiv – CS AI · Mar 126/10

🧠

Resource-constrained Amazons chess decision framework integrating large language models and graph attention

Researchers developed a lightweight AI framework for the Game of the Amazons that combines graph attention networks with large language models, achieving 15-56% improvement in decision accuracy while using minimal computational resources. The hybrid approach demonstrates weak-to-strong generalization by leveraging GPT-4o-mini for synthetic training data and graph-based learning for structural reasoning.

🧠 GPT-4

AIBullisharXiv – CS AI · Mar 26/1013

🧠

RF-Agent: Automated Reward Function Design via Language Agent Tree Search

Researchers introduce RF-Agent, a framework that uses Large Language Models as agents to automatically design reward functions for control tasks through Monte Carlo Tree Search. The method improves upon existing approaches by better utilizing historical feedback and enhancing search efficiency across 17 diverse low-level control tasks.

AINeutralarXiv – CS AI · Mar 34/104

🧠

Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning

Researchers introduce Coordinated Boltzmann MCTS (CB-MCTS), a new approach for multi-agent AI planning that uses stochastic exploration instead of deterministic methods. The technique addresses challenges in sparse reward environments where traditional decentralized Monte Carlo Tree Search struggles, showing superior performance in deceptive scenarios while remaining competitive on standard benchmarks.

AINeutralarXiv – CS AI · Feb 274/108

🧠

Generalized Rapid Action Value Estimation in Memory-Constrained Environments

Researchers introduce GRAVE2, GRAVER and GRAVER2 algorithms that extend Generalized Rapid Action Value Estimation (GRAVE) for game playing AI. These new variants dramatically reduce memory requirements while maintaining the same playing strength as the original GRAVE algorithm.