AIBullisharXiv – CS AI · 7h ago7/10
🧠
AuRA: Internalizing Audio Understanding into LLMs as LoRA
AuRA is a novel method that distills audio understanding directly into large language models through LoRA adaptation, eliminating the need for cascaded ASR pipelines or costly multimodal training. The technique achieves superior performance and efficiency compared to existing speech-language approaches by enabling parallel end-to-end inference while reusing pretrained models.