y0news
← Feed
Back to feed
🧠 AI🟢 Bullish

GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning

arXiv – CS AI|Mingleyang Li, Yuran Wang, Yue Chen, Tianxing Chen, Jiaqi Liang, Zishun Shen, Haoran Lu, Ruihai Wu, Hao Dong|
🤖AI Summary

Researchers developed GarmentPile++, an AI pipeline that uses vision-language models to retrieve individual garments from cluttered piles following natural language instructions. The system integrates visual affordance perception with dual-arm robotics to handle complex garment manipulation tasks in real-world home assistant applications.

Key Takeaways
  • GarmentPile++ addresses the real-world challenge of retrieving garments from cluttered piles rather than single-item scenarios.
  • The system combines vision-language models with visual affordance perception for high-level reasoning and low-level action execution.
  • A dual-arm cooperation framework handles large garments and incorrect grasping scenarios that single-arm systems cannot manage.
  • The pipeline uses SAM2 visual segmentation to enhance VLM awareness of individual garment states within piles.
  • Testing demonstrates effectiveness across diverse tasks in both simulation and real-world environments.
Read Original →via arXiv – CS AI
Act on this with AI
Stay ahead of the market.
Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains — you keep full control of your keys.
Connect Wallet to AI →How it works
Related Articles