AIBullisharXiv – CS AI · 15h ago7/10
🧠
Bridging the Semantic-Action Gap in Visual Token Pruning for Efficient VLA Inference
Researchers propose VLA-Pruner, a novel token pruning method that accelerates Vision-Language-Action models for embodied AI by addressing the mismatch between semantic and action-critical visual processing. The method achieves up to 1.99x speedup while maintaining manipulation performance by considering both semantic context and temporal action relevance, unlike existing VLM pruning approaches.