y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#agent-optimization News & Analysis

12 articles tagged with #agent-optimization. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

12 articles
AIBullisharXiv – CS AI · Jun 87/10
🧠

Insights Generator: Systematic Corpus-Level Trace Diagnostics for LLM Agents

Researchers introduce Insights Generator (IG), a multi-agent system that automates the diagnosis of failures in large language model agents by analyzing execution trace corpora at scale. IG produces evidence-backed natural language insights about systematic behavioral patterns, demonstrating 30.4 percentage point performance improvements when human experts implement its recommendations.

AIBullisharXiv – CS AI · Jun 57/10
🧠

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

Researchers propose Agentic Monte Carlo (AMC), a novel method for optimizing black-box LLM agents without API access by using Sequential Monte Carlo sampling to steer agents toward optimal behavior. The technique bridges the gap between reinforcement learning and Bayesian inference, demonstrating competitive performance against RL baselines while maintaining the black-box model architecture.

AIBullisharXiv – CS AI · Jun 17/10
🧠

Scaling Multi-Agent Environment Co-Design with Diffusion Models

Researchers introduce Diffusion Co-Design (DiCoDe), a scalable framework that jointly optimizes agent policies and environment configurations using diffusion models with novel constraint-handling and knowledge-sharing mechanisms. The method achieves 39% higher rewards with 66% fewer simulations in warehouse automation, demonstrating significant advances in multi-agent system deployment across logistics, pathfinding, and renewable energy domains.

AIBullisharXiv – CS AI · May 127/10
🧠

SkillEvolver: Skill Learning as a Meta-Skill

SkillEvolver introduces a meta-learning framework that automatically improves AI agent skills through iterative refinement based on real-world deployment failures, achieving 56.8% accuracy on benchmark tasks compared to 43.6% for manually curated skills. The system learns by modifying skill prose and code rather than model weights, enabling seamless integration with any compatible agent without retraining.

AIBullisharXiv – CS AI · May 127/10
🧠

Shepherd: A Runtime Substrate Empowering Meta-Agents with a Formalized Execution Trace

Shepherd is a new runtime substrate that enables meta-agents to supervise and optimize other agents through formalized execution traces, achieving 5x faster forking than Docker and demonstrating measurable improvements in coding assistance, optimization, and reinforcement learning tasks. The open-source system mechanizes core operations in Lean and enables replay, branching, and counterfactual exploration of agent behaviors.

AIBullisharXiv – CS AI · May 127/10
🧠

Debugging the Debuggers: Failure-Anchored Structured Recovery for Software Engineering Agents

Researchers present PROBE, a framework that improves how AI software engineering agents recover from failures by converting runtime telemetry into structured diagnoses and bounded recovery guidance. The system achieves 65% diagnosis accuracy and 21.8% recovery rates on previously unresolved cases, with a prototype deployed at Microsoft showing practical viability without disrupting existing workflows.

AIBullisharXiv – CS AI · May 97/10
🧠

Recursive Agent Optimization

Researchers introduce Recursive Agent Optimization (RAO), a reinforcement learning method enabling AI agents to spawn and delegate tasks to themselves recursively. This approach allows agents to handle longer contexts, solve harder problems through divide-and-conquer strategies, and achieve better training efficiency with reduced computational time.

AINeutralarXiv – CS AI · Feb 277/106
🧠

VeRO: An Evaluation Harness for Agents to Optimize Agents

Researchers introduced VeRO (Versioning, Rewards, and Observations), a new evaluation framework for testing AI coding agents that can optimize other AI agents through iterative improvement cycles. The system provides reproducible benchmarks and structured execution traces to systematically measure how well coding agents can improve target agents' performance.

AINeutralarXiv – CS AI · May 285/10
🧠

Learning to Assign Prediction Tasks to Agents with Capacity Constraints

Researchers propose a machine learning framework for optimally assigning prediction tasks to heterogeneous agents (humans or AI systems) subject to capacity constraints. The work develops explore-exploit algorithms that learn agent expertise and adapt assignments dynamically, demonstrating improvements over baseline approaches across tabular, image, and text tasks.

AINeutralarXiv – CS AI · May 286/10
🧠

Adapting the Interface, Not the Model: Runtime Harness Adaptation for Deterministic LLM Agents

Researchers introduce Life-Harness, a runtime interface adaptation method that improves frozen LLM agent performance without modifying model weights. The technique evolves from training trajectories to fix model-environment mismatches, achieving 88.5% average improvement across 126 settings and demonstrating cross-model transferability that suggests environment-side structure matters as much as model architecture.

AIBullishMicrosoft Research Blog · Dec 116/103
🧠

Agent Lightning: Adding reinforcement learning to AI agents without code rewrites

Microsoft Research introduced Agent Lightning, a system that enables developers to add reinforcement learning capabilities to AI agents without requiring code rewrites. The system decouples agent functionality from training processes, converting each agent action into reinforcement learning data to improve performance with minimal code changes.