y0news
#mamba-architecture2 articles
2 articles
AIBullisharXiv โ€“ CS AI ยท 4h ago5
๐Ÿง 

DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone

Researchers introduce DiffuMamba, a new diffusion language model using Mamba backbone architecture that achieves up to 8.2x higher inference throughput than Transformer-based models while maintaining comparable performance. The model demonstrates linear scaling with sequence length and represents a significant advancement in efficient AI text generation systems.

AIBullisharXiv โ€“ CS AI ยท 4h ago0
๐Ÿง 

R2GenCSR: Mining Contextual and Residual Information for LLMs-based Radiology Report Generation

Researchers have developed R2GenCSR, a new AI framework for generating radiology reports that uses Mamba architecture instead of Transformers to reduce computational complexity while maintaining performance. The system leverages context retrieval and large language models to produce high-quality medical reports from X-ray images.