AIBearisharXiv – CS AI · 6h ago7/10
🧠
Apparent Psychological Profiles of Large Language Models are Largely a Measurement Artifact
A peer-reviewed study finds that psychological profiles assigned to large language models through human-designed tests are largely measurement artifacts rather than genuine model traits. The research, analyzing 56 instruction-tuned LLMs, reveals that directional response bias—not actual personality—drives 81-90% of differences between models, undermining the validity of using standard psychological instruments to assess LLM safety, usability, and research applications.