AIBullisharXiv โ CS AI ยท 14h ago7/10
๐ง
Adapting 2D Multi-Modal Large Language Model for 3D CT Image Analysis
Researchers propose a method to adapt 2D multimodal large language models for 3D medical imaging analysis, introducing a Text-Guided Hierarchical Mixture of Experts framework that enables task-specific feature extraction. The approach demonstrates improved performance on medical report generation and visual question answering tasks while reusing pre-trained parameters from 2D models.