AINeutralarXiv – CS AI · 7h ago6/10
🧠
Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training
Researchers propose ART (Art-based Reinforcement Training), a parameter-efficient fine-tuning method for multimodal LLMs that optimizes only raw visual inputs rather than model weights or prompts. The technique achieves competitive accuracy with LoRA on benchmarks while maintaining compatibility with high-throughput inference engines like vLLM that don't support traditional fine-tuning modifications.