AINeutralarXiv – CS AI · 10h ago5/10
🧠
The Model as One Rater Among Several: Measuring Political Positions in Data-Sparse Regions with a Language-Model Panel
Researchers propose a novel method for measuring political positions in data-sparse regions by treating large language models as fallible raters within a panel system rather than standalone measurement devices. The approach achieves 0.86 Krippendorff's alpha reliability across nine models and demonstrates that written axis definitions improve inter-rater agreement, though the method still requires human validation.