AINeutralarXiv – CS AI · 18h ago6/10
🧠
Collaborative Edge-to-Server Inference for Vision-Language Models
Researchers propose a collaborative edge-to-server inference framework for vision-language models that reduces communication costs by selectively transmitting only high-entropy regions of interest rather than full-resolution images. The two-stage approach maintains inference accuracy while substantially decreasing bandwidth requirements across visual question-answering tasks.