AINeutralarXiv – CS AI · 6h ago6/10
🧠
From Texts to Scores: Tracing the Emergence of Essay Quality Representations in Large Language Models
Researchers systematically analyzed how eight large language models encode essay quality information in their hidden representations across three datasets. Using linear probing and neuron-level analysis, they found that essay quality is encoded in linearly accessible form, emerges progressively across layers, and partially transfers across different essay prompts, with individual 'essay scoring neurons' showing strong correlation to scores.