#visual-llm News & Analysis

2 articles tagged with #visual-llm. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AINeutralarXiv – CS AI · Jun 56/10

🧠

Using street view images and visual LLMs to predict heritage values for governance support: Risks, ethics, and policy implications

Swedish authorities are using visual Large Language Models to analyze 154,710 street view images across Sweden to identify buildings with heritage values, supporting the EU's Energy Performance of Buildings Directive implementation. The research addresses Sweden's lack of a comprehensive heritage building register while raising critical concerns about LLM transparency, error detection, and potential misuse in government governance.

AIBullisharXiv – CS AI · Mar 36/107

🧠

Dr. Seg: Revisiting GRPO Training for Visual Large Language Models through Perception-Oriented Design

Researchers introduce Dr. Seg, a new framework that improves Group Relative Policy Optimization (GRPO) training for Visual Large Language Models by addressing key differences between language reasoning and visual perception tasks. The framework includes a Look-to-Confirm mechanism and Distribution-Ranked Reward module that enhance performance in complex visual scenarios without requiring architectural changes.