AINeutralarXiv – CS AI · 7h ago6/10
🧠
GIRL-DETR: Gradient-Isolated Reinforcement Learning for Video Moment Retrieval
GIRL-DETR introduces a novel reinforcement learning approach for video moment retrieval that addresses the optimization gap between training losses and evaluation metrics. By freezing backbone networks and applying progressive RL only to detection heads, the method achieves significant accuracy improvements while protecting learned feature representations in lightweight models.