#bayesian-optimization News & Analysis

18 articles tagged with #bayesian-optimization. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

18 articles

AIBullisharXiv – CS AI · Jun 27/10

🧠

Principle-Evolvable Scientific Discovery via Uncertainty Minimization

Researchers introduce PiEvo, a framework that enables AI scientific agents to autonomously evolve their underlying scientific principles rather than search within fixed hypothesis spaces. The system achieves 29.7-31.1% improvement in solution quality and 83.3% faster convergence by treating scientific discovery as Bayesian optimization over an expanding principle space.

AINeutralarXiv – CS AI · May 287/10

🧠

Why LLMs Fail at Causal Discovery and How Interventional Agents Escape

Researchers prove that large language models fundamentally cannot perform causal discovery through standard training methods, establishing this limitation as intrinsic to supervised learning rather than a model-specific flaw. They propose Agentic Causal Bayesian Optimization (A-CBO), which bypasses this constraint by using frozen language models as query oracles within an external optimization loop, achieving superior performance on causal inference benchmarks.

AIBullisharXiv – CS AI · Apr 107/10

🧠

AgentOpt v0.1 Technical Report: Client-Side Optimization for LLM-Based Agent

AgentOpt v0.1, a new Python framework, addresses client-side optimization for AI agents by intelligently allocating models, tools, and API budgets across pipeline stages. Using search algorithms like Arm Elimination and Bayesian Optimization, the tool reduces evaluation costs by 24-67% while achieving near-optimal accuracy, with cost differences between model combinations reaching up to 32x at matched performance levels.

AINeutralarXiv – CS AI · Jun 196/10

🧠

Manifold Bandits: Bayesian Curriculum Learning over the Latent Geometry of Large Language Models

Researchers propose Bayesian Manifold Curriculum (BMC), a new framework for training large language models through reinforcement learning that treats problem sampling as a structured bandit problem rather than independent tasks. The approach organizes problems hierarchically and balances difficulty, diversity, and task relevance, showing that difficulty alone is insufficient for optimal model improvement.

AINeutralarXiv – CS AI · Jun 96/10

🧠

SVRG and Beyond via Posterior Correction

Researchers have established a fundamental connection between Stochastic Variance Reduced Gradient (SVRG), a decade-old optimization method, and Bayesian posterior correction techniques. This theoretical breakthrough enables the derivation of novel SVRG extensions using flexible exponential-family posteriors, including Newton-like and Adam-like variants that improve training efficiency.

GeneralNeutralarXiv – CS AI · Jun 25/10

📰

Optimal Transport-based Permutation-Invariant Bayesian Optimization of Offshore Wind Farm Layouts

Researchers propose PIBO, a Permutation-Invariant Bayesian Optimization approach that leverages Optimal Transport theory to optimize offshore wind farm layouts. The method exploits the symmetry inherent in wind turbine placement problems where order doesn't matter, achieving superior layouts while reducing computation time by approximately 50% compared to standard Bayesian Optimization.

AINeutralarXiv – CS AI · Jun 26/10

🧠

Transferring Information Across Interventions in Causal Bayesian Optimization

Researchers present graph-coupled causal Bayesian optimization, a method that improves expensive system optimization by sharing information across related interventions through a causal kernel. The approach demonstrates logarithmic information gains and cleanly separates optimization, causal estimation, and intervention selection errors, with strongest performance when direct interventions are unavailable.

AINeutralarXiv – CS AI · Jun 26/10

🧠

Evidence-Gated LLM Priors for Multi-Objective Bayesian Optimization

Researchers propose a framework for incorporating Large Language Model (LLM) priors into multi-objective Bayesian optimization while maintaining robustness against miscalibrated advice. Using an objective-wise reputation mechanism and counterfactual gating, the approach dynamically adjusts trust in LLM suggestions based on observed performance rather than accepting them blindly, with empirical validation across molecular optimization tasks.

AINeutralarXiv – CS AI · Jun 26/10

🧠

c-TPE: Tree-structured Parzen Estimator with Inequality Constraints for Expensive Hyperparameter Optimization

Researchers propose c-TPE, an enhanced Bayesian optimization method that extends the Tree-structured Parzen Estimator to handle inequality constraints in hyperparameter optimization. The method addresses practical real-world limitations like memory and latency constraints while maintaining strong performance, demonstrating superiority over existing approaches across 81 expensive optimization problems.

AINeutralarXiv – CS AI · Jun 26/10

🧠

Tree-Structured Parzen Estimator: Understanding Its Algorithm Components and Their Roles for Better Empirical Performance

Researchers have conducted a comprehensive ablation study of Tree-Structured Parzen Estimator (TPE), a widely-used Bayesian optimization method, to clarify the role of each control parameter and improve its empirical performance. The study provides actionable recommendations for parameter tuning in machine learning frameworks like Hyperopt and Optuna, with implementations now available through OptunaHub.

AIBullisharXiv – CS AI · May 296/10

🧠

A Language-Guided Bayesian Optimization for Efficient LoRA Hyperparameter Search

Researchers propose a Bayesian Optimization framework that uses pre-trained Large Language Models to efficiently search for optimal LoRA (Low-Rank Adaptation) hyperparameters by encoding domain knowledge as natural language prompts. The method discovers high-performing configurations in ~30 iterations versus 45,000 combinations, achieving 20% performance improvements while significantly reducing computational costs.

AINeutralarXiv – CS AI · May 285/10

🧠

Preference-Shaped Expected Hypervolume and R2 Improvement: Exact Computation and Monotonicity

This academic paper advances Bayesian multiobjective optimization by clarifying how preference transformations affect two key performance indicators—hypervolume and R2—used in algorithm design. The research provides exact computational methods and proves that R2 improvement, contrary to prior assumptions, cannot be directly computed as objective-space hypervolume but instead represents volume in scalarization space, enabling new algorithmic implementations.

AINeutralarXiv – CS AI · May 116/10

🧠

Open-Ended Task Discovery via Bayesian Optimization

Researchers introduce Generate-Select-Refine (GSR), a Bayesian optimization framework that dynamically discovers and refines tasks during scientific workflows rather than optimizing fixed objectives. The approach demonstrates superior performance across product development, chemical synthesis, algorithm analysis, and patent repurposing compared to existing LLM-based optimizers.

AINeutralarXiv – CS AI · May 116/10

🧠

Ensemble Distributionally Robust Bayesian Optimisation

Researchers propose a novel Ensemble Distributionally Robust Bayesian Optimisation algorithm that addresses context distributional uncertainty in zeroth-order optimization. The method achieves sublinear regret bounds while remaining computationally tractable, improving upon existing state-of-the-art approaches.

AINeutralarXiv – CS AI · May 116/10

🧠

DT-PBO: an Interpretable Tree-based Surrogate Model for Preferential Bayesian Optimization

Researchers introduce DT-PBO, a tree-based surrogate model for Preferential Bayesian Optimization that prioritizes interpretability over traditional Gaussian Process approaches. The method achieves competitive performance on benchmark functions while providing transparent insights into decision-maker preferences, addressing critical needs in high-stakes domains like healthcare.

$MKR

AIBullisharXiv – CS AI · Apr 156/10

🧠

Learning Chain Of Thoughts Prompts for Predicting Entities, Relations, and even Literals on Knowledge Graphs

Researchers introduce RALP, a novel method that uses chain-of-thought prompts with large language models to improve knowledge graph predictions, outperforming traditional embedding models by over 5% on standard benchmarks while better handling unseen entities, relations, and numerical data.

AIBullisharXiv – CS AI · Apr 106/10

🧠

EmoMAS: Emotion-Aware Multi-Agent System for High-Stakes Edge-Deployable Negotiation with Bayesian Orchestration

Researchers introduce EmoMAS, a Bayesian multi-agent framework that enables small language models to perform sophisticated negotiation by treating emotional intelligence as a strategic variable. The system coordinates game-theoretic, reinforcement learning, and psychological agents to optimize negotiation outcomes while maintaining privacy through edge deployment, demonstrating performance comparable to larger models across high-stakes domains.

AIBullisharXiv – CS AI · Mar 27/1019

🧠

Thompson Sampling via Fine-Tuning of LLMs

Researchers developed ToSFiT (Thompson Sampling via Fine-Tuning), a new Bayesian optimization method that uses fine-tuned large language models to improve search efficiency in complex discrete spaces. The approach eliminates computational bottlenecks by directly parameterizing reward probabilities and demonstrates superior performance across diverse applications including protein search and quantum circuit design.