AIBearisharXiv – CS AI · 18h ago7/10
🧠
From `May' to `Is': Certainty Distortion in Language Model Rewriting
Researchers have identified a systematic bias in language models where they distort the certainty of claims during rewriting tasks, with up to 75% of outputs showing meaningful changes in confidence levels. Models are 1.5-2× more likely to increase expressed certainty than decrease it, and this effect compounds with repeated paraphrasing, creating risks for users relying on LMs in high-stakes domains like medicine and science.