#mllm-architecture News & Analysis

3 articles tagged with #mllm-architecture. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles

AIBullisharXiv – CS AI · Jun 107/10

🧠

SPACE: Source-free Proxy Anchor Concept Erasure for MLLMs

Researchers introduce SPACE, a source-free machine unlearning framework for multimodal large language models that removes sensitive data without access to original training data. The two-stage approach uses text-guided proxy anchors and dual-constraint semantic isolation to erase target concepts while maintaining model performance, addressing growing privacy and regulatory compliance needs.

AIBullisharXiv – CS AI · Apr 147/10

🧠

LAST: Leveraging Tools as Hints to Enhance Spatial Reasoning for Multimodal Large Language Models

Researchers introduce LAST, a framework that enhances multimodal large language models' spatial reasoning by integrating specialized vision tools through an interactive sandbox interface. The approach achieves ~20% performance improvements over baseline models and outperforms proprietary closed-source LLMs on spatial reasoning tasks by converting complex tool outputs into consumable hints for language models.

AIBullisharXiv – CS AI · Jun 26/10

🧠

DenseMLLM: Standard Multimodal LLMs for Dense Prediction

Researchers introduce DenseMLLM, a multimodal large language model that performs fine-grained dense prediction tasks like semantic segmentation and depth estimation without requiring task-specific decoders. The minimalist approach achieves competitive performance while maintaining the generalist design philosophy of standard MLLMs, potentially simplifying model architecture and increasing practical applicability.