#model-training News & Analysis

114 articles tagged with #model-training. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

114 articles

AIBullisharXiv – CS AI · 2d ago7/10

🧠

Unlocking the Working Memory of Large Language Models for Latent Reasoning

Researchers introduce Reasoning in Memory (RiM), a novel method that enables large language models to perform internal reasoning using fixed memory blocks instead of generating intermediate tokens. The approach matches or exceeds existing reasoning methods while being more compute-efficient, as memory blocks process in a single forward pass rather than through autoregressive generation.

AIBullisharXiv – CS AI · 2d ago7/10

🧠

TRACE: Toulmin-based Reasoning Assessment through Constructive Elements for LLM CoT Evaluation

Researchers introduce TRACE, a novel metric for evaluating the reasoning quality of large language models' Chain-of-Thought outputs by analyzing argument structure rather than just final answers. The method combines Toulmin's argumentation theory with metacognitive frameworks and demonstrates strong correlation with benchmark accuracy while improving reinforcement learning performance.

AIBullisharXiv – CS AI · 2d ago7/10

🧠

GRPO is Secretly a Process Reward Model

Researchers demonstrate that Group Relative Policy Optimization (GRPO), a popular reinforcement learning algorithm using outcome rewards, mathematically functions as an implicit process reward model. The discovery enables algorithmic improvements (λ-GRPO) that enhance large language model performance on reasoning tasks without explicit process reward implementation or significant computational overhead.

AIBullisharXiv – CS AI · 3d ago7/10

🧠

The Shape of Reasoning: Topological Analysis of Reasoning Traces in Large Language Models

Researchers introduce a topological data analysis framework to evaluate reasoning quality in large language models, moving beyond traditional graph-based metrics. The study demonstrates that higher-dimensional geometric structures predict reasoning quality more effectively than standard connectivity measures, offering a practical signal for training optimization.

AINeutralarXiv – CS AI · 3d ago7/10

🧠

The Future of Facts: Tracing the Factual Generation-Verification Gap

Researchers reveal that language models verify factual information more reliably than they generate it, a phenomenon driven by distinct training dynamics rather than computational limitations. The study traces this generation-verification gap across model families and training phases, finding that models can simultaneously accept contradictory facts after updates, creating consistency issues for AI systems deployed as knowledge interfaces.

AIBullisharXiv – CS AI · 4d ago7/10

🧠

Athena: Enhancing Multimodal Reasoning with Data-efficient Process Reward Models

Researchers introduce Athena-PRM, a multimodal process reward model that evaluates reasoning steps in complex problem-solving with remarkable data efficiency, requiring only 5,000 samples. The model leverages prediction consistency between weak and strong AI completers to generate high-quality training labels, achieving state-of-the-art results across multiple benchmarks including WeMath, MathVista, and VisualProcessBench.

AIBullisharXiv – CS AI · 4d ago7/10

🧠

Curriculum Learning for Safety Alignment

Researchers propose Staged-Competence, a curriculum learning framework that enhances Direct Preference Optimisation (DPO) for AI safety alignment. The method reduces out-of-distribution harmful responses by 16% and jailbreak success rates by 20% while maintaining model capabilities, achieving baseline safety with 25% less training data.

AIBullisharXiv – CS AI · 4d ago7/10

🧠

HTMLCure: Turning Browser Experience into State Guided Repair for Interactive HTML

HTMLCure introduces a browser experience framework that improves how large language models generate functional HTML pages by testing them across multiple interactions and states rather than relying on static screenshots. The system automatically repairs broken pages through a closed-loop process, demonstrating significant performance improvements on HTML generation benchmarks.

🧠 GPT-5

AIBullisharXiv – CS AI · May 127/10

🧠

expo: Exploration-prioritized policy optimization via adaptive kl regulation and gaussian curriculum sampling

Researchers introduce EXPO, an improved reinforcement learning algorithm for LLM mathematical reasoning that dynamically adjusts KL penalty coefficients and prioritizes moderately difficult problems during training. The method demonstrates significant performance improvements over existing GRPO approaches, achieving a 13.34-point absolute gain on AIME 2025 benchmarks.

AIBullisharXiv – CS AI · May 127/10

🧠

DUET: Optimize Token-Budget Allocation for Reinforcement Learning with Verifiable Rewards

Researchers introduce DUET, a method for optimizing token allocation in reinforcement learning with verifiable rewards that jointly controls which prompts receive rollouts and how long each rollout runs. The technique achieves superior reasoning quality on math and coding benchmarks while using 50% fewer tokens than baseline methods, suggesting efficiency gains don't require sacrificing model performance.

🧠 Llama

AIBullisharXiv – CS AI · May 127/10

🧠

LEAD: Length-Efficient Adaptive and Dynamic Reasoning for Large Language Models

Researchers propose LEAD, a new method that makes large reasoning AI models more efficient by dynamically balancing accuracy and output length during training. Unlike existing approaches using static constraints, LEAD adapts per-problem length targets and reward calibration in real-time, achieving better accuracy and shorter outputs across mathematical reasoning benchmarks.

🏢 OpenAI🧠 o1

AIBullisharXiv – CS AI · May 127/10

🧠

Team-Based Self-Play With Dual Adaptive Weighting for Fine-Tuning LLMs

Researchers propose TPAW, a self-play algorithm that improves LLM alignment without human-labeled data by having models collaborate and compete against historical checkpoints while using adaptive weighting mechanisms. The approach addresses instability and diminishing optimization gains in existing self-training methods, demonstrating consistent improvements across multiple benchmarks.

AINeutralarXiv – CS AI · May 117/10

🧠

The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment

Researchers released the Moltbook Files, a dataset of 232k posts and 2.2M comments from a Reddit-like platform populated by AI agents, revealing that fine-tuning language models on this data reduces truthfulness by 50% but comparably to Reddit data. The study identifies significant security risks including exposed API keys and cryptocurrency seed phrases, while concluding the overall phenomenon poses manageable rather than catastrophic risks to AI safety.

AIBullisharXiv – CS AI · May 97/10

🧠

From History to State: Constant-Context Skill Learning for LLM Agents

Researchers propose constant-context skill learning, a framework enabling LLM agents to learn reusable task procedures as lightweight modules rather than storing long prompts in memory. The approach reduces token usage per inference by 2-7x while maintaining or improving performance across multiple benchmark environments, addressing the privacy-capability tradeoff in agent deployment.

🧠 Llama

AIBullisharXiv – CS AI · May 97/10

🧠

Post Reasoning: Improving the Performance of Non-Thinking Models at No Cost

Researchers introduce Post-Reasoning, a technique that improves LLM performance by having models justify answers after generating final responses, without increasing latency or token costs. The method demonstrates 17.37% mean performance improvements across 117 model-benchmark settings and establishes a new efficiency frontier for direct-answer AI capabilities.

AINeutralarXiv – CS AI · May 97/10

🧠

On Semantic Loss Fine-Tuning Approach for Preventing Model Collapse in Causal Reasoning

Researchers demonstrate that standard fine-tuning of transformer models on causal reasoning tasks causes catastrophic collapse where models learn trivial solutions while appearing accurate. They propose a semantic loss function with graph-based constraints that prevents collapse and achieves stable, context-dependent causal reasoning with 42.7% improvement over baseline models.

AIBullisharXiv – CS AI · May 47/10

🧠

AdaMeZO: Adam-style Zeroth-Order Optimizer for LLM Fine-tuning Without Maintaining the Moments

Researchers introduce AdaMeZO, a new zeroth-order optimizer that combines the memory efficiency of MeZO with Adam-style moment estimation for fine-tuning large language models. The method achieves faster convergence than MeZO while reducing GPU memory requirements and requiring up to 70% fewer forward passes.

AINeutralarXiv – CS AI · Apr 77/10

🧠

Large Language Models Align with the Human Brain during Creative Thinking

Researchers found that large language models align with human brain activity during creative thinking tasks, with alignment increasing based on model size and idea originality. Different post-training approaches selectively reshape how LLMs align with creative versus analytical neural patterns in humans.

🧠 Llama

AIBullisharXiv – CS AI · Apr 77/10

🧠

Stabilizing Unsupervised Self-Evolution of MLLMs via Continuous Softened Retracing reSampling

Researchers propose Continuous Softened Retracing reSampling (CSRS) to improve the self-evolution of Multimodal Large Language Models by addressing biases in feedback mechanisms. The method uses continuous reward signals instead of binary rewards and achieves state-of-the-art results on mathematical reasoning benchmarks like MathVision using Qwen2.5-VL-7B.

AINeutralarXiv – CS AI · Apr 67/10

🧠

Mitigating LLM biases toward spurious social contexts using direct preference optimization

Researchers developed Debiasing-DPO, a new training method that reduces harmful biases in large language models by 84% while improving accuracy by 52%. The study found that LLMs can shift predictions by up to 1.48 points when exposed to irrelevant contextual information like demographics, highlighting critical risks for high-stakes AI applications.

🧠 Llama

AIBullisharXiv – CS AI · Mar 267/10

🧠

PLDR-LLMs Reason At Self-Organized Criticality

Researchers demonstrate that PLDR-LLMs trained at self-organized criticality exhibit enhanced reasoning capabilities at inference time. The study shows that reasoning ability can be quantified using an order parameter derived from global model statistics, with models performing better when this parameter approaches zero at criticality.

AINeutralarXiv – CS AI · Mar 267/10

🧠

Mitigating Many-Shot Jailbreaking

Researchers have developed techniques to mitigate many-shot jailbreaking (MSJ) attacks on large language models, where attackers use numerous examples to override safety training. Combined fine-tuning and input sanitization approaches significantly reduce MSJ effectiveness while maintaining normal model performance.

AIBullisharXiv – CS AI · Mar 177/10

🧠

Residual Stream Analysis of Overfitting And Structural Disruptions

Researchers identified that repetitive safety training data causes large language models to develop false refusals, where benign queries are incorrectly declined. They developed FlowLens, a PCA-based analysis tool, and proposed Variance Concentration Loss (VCL) as a regularization technique that reduces false refusals by over 35 percentage points while maintaining performance.

AINeutralarXiv – CS AI · Mar 177/10

🧠

Right for the Wrong Reasons: Epistemic Regret Minimization for Causal Rung Collapse in LLMs

Researchers identify a fundamental flaw in large language models called 'Rung Collapse' where AI systems achieve correct answers through flawed causal reasoning that fails under distribution shifts. They propose Epistemic Regret Minimization (ERM) as a solution that penalizes incorrect reasoning processes independently of task success, showing 53-59% recovery of reasoning errors in experiments across six frontier LLMs.

🧠 GPT-5

AINeutralarXiv – CS AI · Mar 177/10

🧠

The Phenomenology of Hallucinations

Researchers discovered that AI language models hallucinate not from failing to detect uncertainty, but from inability to integrate uncertainty signals into output generation. The study shows models can identify uncertain inputs internally, but these signals become geometrically amplified yet functionally silent due to weak coupling with output layers.

Page 1 of 5Next →