AIBearisharXiv – CS AI · 6h ago7/10
🧠
How Far Are VLMs from Privacy Awareness in the Physical World? An Empirical Study
Researchers present ImmersedPrivacy, an evaluation framework that tests Vision-Language Models' ability to recognize and respect privacy in physical environments. Testing 12 state-of-the-art VLMs reveals significant deficiencies: all models struggle with cluttered scenes, none exceed 65% accuracy when social context changes, and even the best model only balances task completion with privacy preservation 51% of the time.