y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#bilevel-optimization News & Analysis

8 articles tagged with #bilevel-optimization. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

8 articles
AIBullisharXiv – CS AI · May 117/10
🧠

When Losses Align: Gradient-Based Composite Loss Weighting for Efficient Pretraining

Researchers propose a gradient-based bilevel optimization method that automatically learns composite loss weights during pretraining by aligning gradients with downstream objectives. The approach reduces hyperparameter tuning overhead to ~30% above baseline training cost while matching or exceeding manually tuned baselines across event-sequence and computer vision tasks.

AIBullisharXiv – CS AI · Apr 207/10
🧠

Bilevel Optimization of Agent Skills via Monte Carlo Tree Search

Researchers propose a bilevel optimization framework using Monte Carlo Tree Search to systematically improve LLM agent skills—structured collections of instructions, tools, and resources. The framework optimizes both skill structure and component content simultaneously, demonstrating performance improvements on Operations Research tasks and addressing a previously unsolved challenge in agent design optimization.

AIBullisharXiv – CS AI · Mar 56/10
🧠

Test-Time Meta-Adaptation with Self-Synthesis

Researchers introduce MASS, a meta-learning framework that enables large language models to self-adapt at test time by generating synthetic training data and performing targeted self-updates. The system uses bilevel optimization to meta-learn data-attribution signals and optimize synthetic data through scalable meta-gradients, showing effectiveness in mathematical reasoning tasks.

AINeutralarXiv – CS AI · 2d ago5/10
🧠

Balancing Knowledge Distillation for Imbalance Learning with Bilevel Optimization

Researchers introduce BiKD, a bilevel optimization framework that dynamically adjusts the balance between hard and soft losses in knowledge distillation for imbalanced datasets. The method uses a weight generation network guided by a balanced validation set to assign per-sample adaptive weights, significantly improving performance on long-tailed datasets like CIFAR-10/100 compared to existing approaches.

AINeutralarXiv – CS AI · May 276/10
🧠

Bilevel Optimization over Saddle Points of Zero-Sum Markov Games

Researchers propose PANDA, a novel bilevel optimization algorithm for reinforcement learning that handles competitive multi-agent scenarios modeled as zero-sum Markov games. The method achieves state-of-the-art convergence rates without requiring second-order derivatives, advancing RL applications in incentive design and competitive environments.

AIBullisharXiv – CS AI · May 276/10
🧠

Scaling GraphLLM with Bilevel-Optimized Sparse Querying

Researchers introduce BOSQ, a framework that optimizes the use of large language models for graph neural network tasks by selectively querying LLMs only when necessary. This approach reduces computational costs by orders of magnitude while maintaining or improving performance on text-attributed graph datasets, addressing a critical bottleneck in practical LLM-enhanced graph learning.

AINeutralarXiv – CS AI · May 126/10
🧠

NoiseRater: Meta-Learned Noise Valuation for Diffusion Model Training

Researchers introduce NoiseRater, a meta-learning framework that assigns importance scores to noise samples during diffusion model training, moving beyond the assumption that all injected noise is equally valuable. By prioritizing informative noise through adaptive reweighting, the approach demonstrates improved training efficiency and generation quality on benchmark datasets like FFHQ and ImageNet.

AINeutralarXiv – CS AI · May 126/10
🧠

Select-then-differentiate: Solving Bilevel Optimization with Manifold Lower-level Solution Sets

Researchers present HG-MS, a novel bilevel optimization method that handles cases where lower-level problems have multiple solutions along a manifold rather than a single optimum. The work provides theoretical guarantees for convergence while maintaining computational efficiency through pseudoinverse-based calculations, with practical applications demonstrated in LLM fine-tuning.