#sequential-monte-carlo News & Analysis

2 articles tagged with #sequential-monte-carlo. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AIBullisharXiv – CS AI · Jun 57/10

🧠

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

Researchers propose Agentic Monte Carlo (AMC), a novel method for optimizing black-box LLM agents without API access by using Sequential Monte Carlo sampling to steer agents toward optimal behavior. The technique bridges the gap between reinforcement learning and Bayesian inference, demonstrating competitive performance against RL baselines while maintaining the black-box model architecture.

AIBullisharXiv – CS AI · Jun 27/10

🧠

Beyond the Frontier: Stochastic Backtracking for Efficient Test-Time Scaling

Researchers introduce stochastic backtracking, a novel test-time scaling method for language models that revisits previously generated solution paths rather than committing irreversibly to frontier candidates. The approach uses subpool selection and power backtrack sequential Monte Carlo to improve reasoning accuracy while reducing token generation, outperforming existing PRM-guided methods across mathematical benchmarks.