#dexterous-manipulation News & Analysis

13 articles tagged with #dexterous-manipulation. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

13 articles

AIBullisharXiv – CS AI · Jun 197/10

🧠

ENPIRE: Agentic Robot Policy Self-Improvement in the Real World

Researchers introduce ENPIRE, a framework that enables AI coding agents to autonomously improve robot manipulation policies through real-world feedback loops without human intervention. The system achieves 99% success rates on complex dexterous tasks like pin box organization and tool use, demonstrating that AI agents can now conduct independent robotics research in physical environments.

🏢 Meta

AIBullisharXiv – CS AI · Jun 117/10

🧠

LUCID: Learning Embodiment-Agnostic Intent Models from Unstructured Human Videos for Scalable Dexterous Robot Skill Acquisition

LUCID is a machine learning framework that learns robot manipulation skills from unstructured internet videos and human demonstrations, then transfers this knowledge to different robot embodiments through a shared intent model. The approach eliminates the need for expensive, embodiment-specific robot training data and demonstrates zero-shot transfer capabilities across multiple real-world tasks.

AIBullisharXiv – CS AI · Jun 107/10

🧠

YUBI: Yielding Universal Bidigital Interface for Bimanual Dexterous Manipulation at Scale

Researchers introduce YUBI, a finger-aligned gripper that improves upon existing data collection systems for robotic manipulation by enabling more ergonomic, intuitive bimanual control. The team released an unprecedented 8,434-hour dataset across 1.20M episodes and demonstrated that policies trained on YUBI data transfer successfully across multiple robot platforms, advancing the development of robotic foundation models.

AIBullisharXiv – CS AI · Jun 107/10

🧠

UniDexTok: A Unified Dexterous Hand Tokenizer from Real Data

UniDexTok introduces a unified tokenization system that standardizes how different dexterous robotic hands represent their states, enabling cross-embodiment learning from real-world data. By mapping diverse hand kinematics to a shared 22-degree-of-freedom interface, the system achieves sub-millimeter reconstruction accuracy—a 99% improvement over previous approaches—while eliminating the need for simulation or manual retargeting.

AIBullisharXiv – CS AI · Jun 97/10

🧠

EgoAERO: Learning Dexterous Manipulation from a Single Egocentric Video without Object Assets

EgoAERO introduces a framework enabling robots to learn dexterous manipulation skills from single egocentric human videos without requiring pre-scanned object assets or CAD models. The system reconstructs hand-object trajectories and converts them into robot policies, supported by a new large-scale dataset (EgoDex-R) containing 4.3M RGB-D frames, achieving performance comparable to traditional asset-dependent methods.

AIBullisharXiv – CS AI · May 287/10

🧠

Turning Video Models into Generalist Robot Policies

Researchers present VERA, a decoupled approach to robot control that separates video prediction from action execution using inverse dynamics models. Rather than fine-tuning video models with action labels, the method keeps the video planner unchanged and trains embodiment-specific models to translate predicted frames into robot actions, enabling zero-shot cross-embodiment generalization.

AIBullisharXiv – CS AI · Apr 77/10

🧠

Learning Dexterous Grasping from Sparse Taxonomy Guidance

Researchers developed GRIT, a two-stage AI framework that learns dexterous robotic grasping from sparse taxonomy guidance, achieving 87.9% success rate. The system first predicts grasp specifications from scene context, then generates finger motions while preserving intended grasp structure, improving generalization to novel objects.

AINeutralarXiv – CS AI · Jun 236/10

🧠

CoorDex: Coordinating Body and Hand Priors for Continuous Dexterous Humanoid Loco-Manipulation

Researchers introduce CoorDex, a learning pipeline that enables humanoid robots to perform complex dexterous manipulation tasks while continuously moving, rather than stopping to grasp objects. The system coordinates high-dimensional body and hand control through latent priors and residual reinforcement learning, demonstrated on a Unitree G1 humanoid with a 20-DOF hand performing tasks like in-motion bottle grasping and fridge operation.

AINeutralarXiv – CS AI · Jun 116/10

🧠

Blind Dexterous Grasping via Real2Sim2Real Tactile Policy Learning

Researchers developed a framework for teaching dexterous robotic hands to grasp objects using only touch sensation, without visual input or real-world demonstrations. The approach combines tactile sensor calibration, geometry-aware learning, and diffusion-based policy aggregation to achieve 27% grasp success on both seen and unseen objects.

AINeutralarXiv – CS AI · Jun 116/10

🧠

Bridging the Morphology Gap: Adapting VLA Models to Dexterous Manipulation via Intent-Conditioned Fine-Tuning

Researchers introduce InDex, a framework that adapts Vision-Language-Action (VLA) models from simple parallel grippers to complex dexterous robotic hands through intent-conditioned fine-tuning. The approach uses a two-stage architecture that preserves spatial reasoning capabilities while efficiently learning fine-grained multi-finger control with minimal training data.

AIBullisharXiv – CS AI · May 296/10

🧠

BORA: Bridging Offline Reinforcement Learning and Online Residual Adaptation for Real-World Dexterous VLA Models

Researchers introduce BORA, an offline-to-online reinforcement learning framework that enables Vision-Language-Action (VLA) models to perform complex dexterous robotic manipulation tasks more reliably in real-world settings. The method combines offline critic training with lightweight online adaptation, achieving 33% improvement in success rates over traditional imitation learning approaches.

AINeutralarXiv – CS AI · May 286/10

🧠

Beyond Binary: Sim-to-Real Dexterous Manipulation with Physics-Grounded Contact Representation

Researchers introduce Center-of-Pressure (CoP), a physics-grounded tactile representation that enables robots to perform complex contact-rich manipulation tasks through sim-to-real transfer learning. The method preserves dense touch sensor information while remaining robust across simulation-to-reality gaps, demonstrating zero-shot transfer on dexterous hand tasks like peg insertion and ball balancing.

AIBullisharXiv – CS AI · Mar 116/10

🧠

DexHiL: A Human-in-the-Loop Framework for Vision-Language-Action Model Post-Training in Dexterous Manipulation

Researchers introduce DexHiL, a human-in-the-loop framework for improving Vision-Language-Action models in robotic dexterous manipulation tasks. The system allows real-time human corrections during robot execution and demonstrates 25% better success rates compared to standard offline training methods.