AIBullisharXiv – CS AI · 10h ago6/10
🧠
Robust Zero-Shot Generalization for Open-Vocabulary Action Recognition via Task Arithmetic
Researchers propose a novel approach to Open Vocabulary Action Recognition (OVAR) using task arithmetic and model merging, enabling zero-shot generalization to novel actions without requiring costly domain-specific fine-tuning. By combining task vectors from models trained on diverse public datasets, the method achieves superior out-of-distribution performance while avoiding privacy and regulatory concerns associated with target-domain training.