AINeutralarXiv – CS AI · 9h ago6/10
🧠
UNCOM: Zero-shot Context-Aware Command Understanding for Tabletop Scenarios
UNCOM is a zero-shot framework that enables robots to understand natural human commands in tabletop environments by integrating speech, gestures, and scene context without requiring task-specific training data. The system achieves 82.39% success rate on real-world interaction scenarios, demonstrating practical viability for general-purpose domestic robotics applications.