AIBullisharXiv โ CS AI ยท 5h ago1
๐ง
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI
Researchers developed D2E (Desktop to Embodied AI), a framework that uses desktop gaming data to pretrain AI models for robotics tasks. Their 1B-parameter model achieved 96.6% success on manipulation tasks and 83.3% on navigation, matching performance of models up to 7 times larger while using scalable desktop data instead of expensive physical robot training data.