y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#policy-learning News & Analysis

9 articles tagged with #policy-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

9 articles
AIBullisharXiv – CS AI Β· Mar 57/10
🧠

VITA: Vision-to-Action Flow Matching Policy

Researchers developed VITA, a new AI framework that streamlines robot policy learning by directly flowing from visual inputs to actions without requiring conditioning modules. The system achieves 1.5-2x faster inference speeds while maintaining or improving performance compared to existing methods across 14 simulation and real-world robotic tasks.

AIBullisharXiv – CS AI Β· Mar 37/104
🧠

Dense-Jump Flow Matching with Non-Uniform Time Scheduling for Robotic Policies: Mitigating Multi-Step Inference Degradation

Researchers developed a new robotic policy framework using dense-jump flow matching with non-uniform time scheduling to address performance degradation in multi-step inference. The approach achieves up to 23.7% performance gains over existing baselines by optimizing integration scheduling during training and inference phases.

AINeutralarXiv – CS AI Β· 1d ago6/10
🧠

Beyond Static Sandboxing: Learned Capability Governance for Autonomous AI Agents

Researchers introduce Aethelgard, an adaptive governance framework that addresses the capability overprovisioning problem in autonomous AI agents by dynamically restricting tool access based on task requirements. The system uses reinforcement learning to enforce least-privilege principles, reducing security exposure while maintaining operational efficiency.

AINeutralarXiv – CS AI Β· 1d ago5/10
🧠

Hybrid-AIRL: Enhancing Inverse Reinforcement Learning with Supervised Expert Guidance

Researchers introduce Hybrid-AIRL, an enhanced inverse reinforcement learning framework that combines adversarial learning with supervised expert guidance to improve reward function inference in complex, imperfect-information environments like poker. The method demonstrates superior sample efficiency and learning stability compared to traditional AIRL, particularly in settings with sparse and delayed rewards.

AIBullisharXiv – CS AI Β· Mar 165/10
🧠

Accelerating Residual Reinforcement Learning with Uncertainty Estimation

Researchers developed an improved Residual Reinforcement Learning method that uses uncertainty estimation to enhance sample efficiency and work with stochastic base policies. The approach outperformed existing methods in simulation benchmarks and demonstrated successful zero-shot sim-to-real transfer in real-world deployments.

AINeutralarXiv – CS AI Β· Mar 34/104
🧠

Embedding Morphology into Transformers for Cross-Robot Policy Learning

Researchers developed an embodiment-aware transformer policy that improves cross-robot policy learning by injecting morphological information through kinematic tokens, topology-aware attention, and joint-attribute conditioning. This approach consistently outperforms baseline vision-language-action models across multiple robot embodiments.

AINeutralarXiv – CS AI Β· Mar 24/105
🧠

Bridging Dynamics Gaps via Diffusion Schr\"odinger Bridge for Cross-Domain Reinforcement Learning

Researchers propose BDGxRL, a novel framework using Diffusion SchrΓΆdinger Bridge to enable reinforcement learning agents to transfer policies across different domains without direct target environment access. The method aligns source domain transitions with target dynamics through offline demonstrations and introduces reward modulation for consistent learning.

AINeutralOpenAI News Β· Jun 171/107
🧠

Learning policy representations in multiagent systems

The article title references learning policy representations in multiagent systems, which relates to AI research in multi-agent reinforcement learning. However, no article body content was provided for analysis.