AIBullisharXiv – CS AI · 14h ago7/10
🧠
VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models
Researchers introduce VLA-Pro, a framework that enhances vision-language-action models for robotics by storing and retrieving task-specific procedural memories during inference. The approach achieves dramatic performance gains—up to 207% improvement in simulation and raising real-world success rates from 5.8% to 65%—demonstrating significant progress in cross-task generalization for robotic manipulation.