#pre-registration News & Analysis

2 articles tagged with #pre-registration. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AINeutralarXiv – CS AI · Jun 57/10

🧠

A Pre-Registered Causal Partition of Self-Consistency Elicitation and Reward Design in RLVR

Researchers present a pre-registered causal decomposition framework that reveals how reinforcement learning from verifiable rewards (RLVR) conflates self-consistency elicitation with genuine reward-design effects. Through controlled experiments, they demonstrate that naive performance metrics systematically overestimate reward-design impact by 50-95%, with elicitation dominating in weak-prior regimes. The work provides diagnostic tools to audit published alignment research and expose methodological confounds.

AINeutralarXiv – CS AI · May 286/10

🧠

Let the Results Speak: A Replication-First Paradigm for LLM Behavioral Benchmarking

Researchers propose a replication-first paradigm for evaluating subjective LLM behaviors like empathy and restraint, using four orthogonal validation properties instead of single human-rater consensus. Testing across 49 models reveals that aggregate performance scores mask significant regressions in specific behavioral dimensions, such as gpt-5's 1.87-point decline in advice-restraint compared to gpt-4.1.

🧠 GPT-4🧠 GPT-5