AINeutralarXiv – CS AI · 18h ago6/10
🧠
Multimodal Large Language Models as Synthetic Participants in Video-Based Studies: An Evaluation
Researchers evaluated whether multimodal large language models (MLLMs) like Gemini 3 Flash and Qwen 3 Omni can replicate human subjective responses in video perception tasks using the Perceived Message Sensation Value framework. The study found significant limitations: MLLMs demonstrated systematic biases including downward mean-shift, central-tendency bias, and inconsistent sensitivity to participant profiles, suggesting current models remain unreliable as synthetic human participants for subjective research.
🧠 Gemini