AINeutralarXiv โ CS AI ยท 4d ago7/103
๐ง
CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing
Researchers introduced CityLens, a comprehensive benchmark for evaluating Large Vision-Language Models' ability to predict socioeconomic indicators from urban imagery. The study tested 17 state-of-the-art LVLMs across 11 prediction tasks using data from 17 global cities, revealing promising capabilities but significant limitations in urban socioeconomic analysis.