#visuomotor-control News & Analysis

4 articles tagged with #visuomotor-control. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

4 articles

AIBullisharXiv – CS AI · Jun 97/10

🧠

CT-VAM: A Cerebello-Thalamic-Inspired Vision-Action Model for Efficient Visuomotor Control

Researchers introduce CT-VAM, a compact 68M-parameter neural network inspired by cerebellar-thalamic brain architecture for robotic manipulation tasks. The model processes visual inputs and proprioception to predict action sequences efficiently on edge devices, matching larger vision-language-action models while reducing latency and enabling practical deployment on resource-constrained robots.

AINeutralarXiv – CS AI · Jun 196/10

🧠

Co-policy: Responsive Human-Robot Co-Creation for Musical Performances

Researchers introduce Co-policy, a framework enabling robots to participate in real-time musical co-creation with humans by combining semantic understanding with physically executable performance. The system uses a fine-tuned vision-language model and a Gaussian-Mixture Visuomotor Policy to generate complementary musical responses rather than merely reproducing user input, demonstrating improved performance over existing diffusion-policy approaches.

AINeutralarXiv – CS AI · Jun 86/10

🧠

AxisGuide: Grounding Robot Action Coordinate System in RGB Observations for Robust Visuomotor Manipulation

Researchers introduce AxisGuide, a lightweight method that improves robot manipulation by explicitly visualizing action coordinates in camera views. The technique augments visual observations with cues showing robot base-frame axes, enabling better generalization when objects are placed in unseen locations despite identical scene layouts.

AINeutralarXiv – CS AI · May 276/10

🧠

Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient

Researchers introduce SDPG, a visual reinforcement learning method that trains robotic control policies significantly faster and more efficiently on consumer GPUs. The approach reduces computational overhead through stochastic gradient estimation while maintaining superior performance, and includes new benchmarks for advancing visual robotics research.

🏢 Nvidia