y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#llm-calibration News & Analysis

6 articles tagged with #llm-calibration. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

6 articles
AIBullisharXiv – CS AI · May 287/10
🧠

Aligning LLMs with Human Uncertainty: A Beta-Bernoulli Calibrator for LLM Forecasting

Researchers propose the Beta-Bernoulli Calibrator (BBC), a novel method that improves large language model forecasting by converting point estimates into probability distributions using both binary outcomes and aggregated human forecast signals. The approach demonstrates better calibration and accuracy than existing post-hoc methods while leveraging epistemic uncertainty as a more reliable error predictor than verbalized confidence.

AINeutralarXiv – CS AI · May 126/10
🧠

A Semantic-Sampling Framework for Evaluating Calibration in Open-Ended Question Answering

Researchers introduce Sem-ECE, a new framework for evaluating how well large language models calibrate their confidence in open-ended question answering tasks. The method samples multiple answers from LLMs, groups them semantically, and uses answer frequency distributions as confidence measures, outperforming existing evaluation approaches across major commercial models.

AIBullisharXiv – CS AI · May 126/10
🧠

The Silent Vote: Improving Zero-Shot LLM Reliability by Aggregating Semantic Neighborhoods

Researchers propose Semantic Softmax, a novel inference-time method that improves zero-shot LLM classification by recovering probability mass lost during constrained decoding. The approach aggregates scores from semantic synonyms, reducing calibration errors and boosting accuracy on emotion and toxicity detection tasks.

AIBearisharXiv – CS AI · Apr 146/10
🧠

Calibration Collapse Under Sycophancy Fine-Tuning: How Reward Hacking Breaks Uncertainty Quantification in LLMs

A research study demonstrates that fine-tuning language models with sycophantic reward signals degrades their calibration—the ability to accurately quantify uncertainty—even as performance metrics improve. While the effect lacks statistical significance in this experiment, the findings reveal that reward-optimized models retain structured miscalibration even after post-hoc corrections, establishing a methodology for evaluating hidden degradation in fine-tuned systems.

AIBullisharXiv – CS AI · Apr 106/10
🧠

Fine-grained Approaches for Confidence Calibration of LLMs in Automated Code Revision

Researchers propose fine-grained confidence calibration methods for large language models in automated code revision tasks, addressing the limitation of traditional global calibration approaches. By applying local Platt-scaling to task-specific confidence scores, the study demonstrates improved calibration accuracy across multiple code repair and refinement tasks, enabling developers to better trust LLM outputs.

AIBullisharXiv – CS AI · Mar 36/103
🧠

Calibrating Verbalized Confidence with Self-Generated Distractors

Researchers introduce DINCO (Distractor-Normalized Coherence), a method to improve confidence calibration in large language models by using self-generated alternative claims to reduce overconfidence bias. The approach addresses LLM suggestibility issues that cause models to express high confidence on low-accuracy outputs, potentially improving AI safety and trustworthiness.