AIBullisharXiv โ CS AI ยท 14h ago7/10
๐ง
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music
Researchers introduce Audio Flamingo Next (AF-Next), an advanced open-source audio-language model that processes speech, sound, and music with support for inputs up to 30 minutes. The model incorporates a new temporal reasoning approach and demonstrates competitive or superior performance compared to larger proprietary alternatives across 20 benchmarks.