#model-fine-tuning News & Analysis

8 articles tagged with #model-fine-tuning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

8 articles

AIBullisharXiv – CS AI · May 127/10

🧠

BaLoRA: Bayesian Low-Rank Adaptation of Large Scale Models

Researchers introduce BaLoRA, a Bayesian extension of Low-Rank Adaptation that improves fine-tuning of large AI models by adding uncertainty quantification while narrowing the accuracy gap with full fine-tuning. The method uses input-adaptive parameterization with minimal computational overhead and demonstrates stronger performance across language, vision, and materials science tasks.

AIBullisharXiv – CS AI · May 97/10

🧠

LLM-AutoDP: Automatic Data Processing via LLM Agents for Model Fine-tuning

Researchers introduce LLM-AutoDP, a framework that uses large language models as autonomous agents to automatically optimize data processing strategies for fine-tuning without human intervention or direct data exposure. The system achieves over 80% win rates against baseline models and reduces search time by up to 10x through novel acceleration techniques, addressing critical challenges in domain-specific model training and data privacy.

AIBullisharXiv – CS AI · May 17/10

🧠

Efficient Training on Multiple Consumer GPUs with RoundPipe

Researchers introduce RoundPipe, a novel pipeline scheduling algorithm that enables efficient fine-tuning of large language models on consumer-grade GPUs by eliminating the weight binding constraint that causes computational bottlenecks. The system achieves 1.48-2.16x speedups over existing approaches and enables fine-tuning of models with up to 235 billion parameters on standard hardware.

AIBullisharXiv – CS AI · Mar 47/105

🧠

NeuroProlog: Multi-Task Fine-Tuning for Neurosymbolic Mathematical Reasoning via the Cocktail Effect

Researchers introduce NeuroProlog, a neurosymbolic framework that improves mathematical reasoning in Large Language Models by converting math problems into executable Prolog programs. The multi-task 'Cocktail' training approach shows significant accuracy improvements of 3-5% across different model sizes, with larger models demonstrating better error correction capabilities.

AINeutralarXiv – CS AI · 6d ago6/10

🧠

AMARIS: A Memory-Augmented Rubric Improvement System for Rubric-Based Reinforcement Learning

AMARIS is a new system that improves how large language models are trained using reinforcement learning by maintaining a persistent memory of past training data and failures. Unlike existing methods that only look at immediate, local information, AMARIS tracks recurring problems and previous rubric adjustments over time, achieving measurable performance improvements across multiple domains.

AINeutralarXiv – CS AI · Apr 156/10

🧠

Operationalising the Right to be Forgotten in LLMs: A Lightweight Sequential Unlearning Framework for Privacy-Aligned Deployment in Politically Sensitive Environments

Researchers introduce a sequential unlearning framework that enables Large Language Models to forget sensitive data while maintaining performance, addressing GDPR compliance and the Right to be Forgotten in politically sensitive deployments. The method stabilizes general capabilities through positive fine-tuning before selectively suppressing designated patterns, demonstrating effectiveness on the SemEval-2025 benchmark with minimal accuracy degradation.

AIBullisharXiv – CS AI · Apr 106/10

🧠

PyFi: Toward Pyramid-like Financial Image Understanding for VLMs via Adversarial Agents

Researchers introduce PyFi, a framework enabling vision language models to understand financial images through progressive reasoning chains, backed by a 600K synthetic dataset organized as a reasoning pyramid. The approach uses adversarial agents to automatically generate training data without human annotation, achieving up to 19.52% accuracy improvements on fine-tuned models.

AINeutralHugging Face Blog · Aug 102/107

🧠

Train and Fine-Tune Sentence Transformers Models

The article appears to be about training and fine-tuning sentence transformer models, which are AI models used for natural language processing tasks. However, the article body is empty, making it impossible to provide specific details about the content or methodology discussed.