AINeutralarXiv – CS AI · 7h ago6/10
🧠
Steering Where to Listen: Instruction-Based Activation Steering Redirects Temporal Attention in Large Audio-Language Models
Researchers developed instruction-based vector steering to redirect temporal attention in Large Audio-Language Models (LALMs), enabling these systems to concentrate on acoustically relevant regions without retraining. The technique achieves 60-68% accuracy in locating queried sound events, substantially outperforming standard prompting methods, revealing how LALMs encode temporal structure in audio understanding.