AINeutralarXiv โ CS AI ยท 7h ago6/10
๐ง
Beyond Surface Statistics: Robust Conformal Prediction for LLMs via Internal Representations
Researchers propose a conformal prediction framework for large language models that uses internal neural representations rather than surface-level outputs to assess reliability and uncertainty. The Layer-Wise Information scoring method improves prediction validity under distribution shift while maintaining competitive performance, addressing a critical challenge in deploying LLMs where traditional uncertainty signals become unreliable.