#parameter-efficient News & Analysis

28 articles tagged with #parameter-efficient. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

28 articles

AIBullisharXiv – CS AI · Jun 107/10

🧠

ASA: Backbone-Training-Free Representation Engineering for Tool-Calling Agents

Researchers introduce Activation Steering Adapter (ASA), a training-free method that improves LLM tool-calling reliability by intervening on mid-layer activations at inference time. The approach achieves significant performance gains on tool-use benchmarks without parameter updates, addressing a critical gap between what models internally represent and their actual behavior.

AIBullisharXiv – CS AI · May 297/10

🧠

ParaTool: Shifting Tool Representations from Context to Parameters

ParaTool is a new framework that shifts tool representations from context to parameters in large language models, enabling efficient tool calling without relying on lengthy in-context documentation. The approach uses parametric tool pre-training, soft tool selection, and fine-tuning to reduce inference overhead and hallucination risks while maintaining superior performance on benchmark tests.

AIBullisharXiv – CS AI · May 127/10

🧠

LiteMedCoT-VL: Parameter-Efficient Adaptation for Medical Visual Question Answering

Researchers introduce LiteMedCoT-VL, a technique that transfers chain-of-thought reasoning from large language models to compact 2B parameter models for medical visual question answering, achieving 64.9% accuracy on the PMC-VQA benchmark without relying on image captions. The breakthrough demonstrates that smaller models enhanced with reasoning distillation can match or exceed the performance of larger models, enabling deployment of sophisticated medical AI on resource-constrained clinical devices.

AIBullisharXiv – CS AI · May 97/10

🧠

Fine-Tuning Small Language Models for Solution-Oriented Windows Event Log Analysis

Researchers demonstrate that fine-tuned small language models (SLMs) can outperform larger language models for Windows event log analysis while requiring significantly fewer computational resources. The study creates a synthetic dataset with remediation actions and shows SLMs deliver superior issue identification and actionable solutions, presenting a practical alternative to cloud-dependent LLMs for enterprise security operations.

AIBullisharXiv – CS AI · May 97/10

🧠

Continually Evolving Skill Knowledge in Vision Language Action Model

Researchers introduce Stellar VLA, a continual learning framework for vision-language-action models that improves knowledge accumulation without adding network parameters. The approach uses knowledge-guided expert routing and hierarchical task structures, achieving strong performance on robotics benchmarks with minimal data replay and validated real-world transfer capabilities.

AIBullisharXiv – CS AI · Mar 117/10

🧠

Efficiently Aligning Draft Models via Parameter- and Data-Efficient Adaptation

Researchers introduce Efficient Draft Adaptation (EDA), a framework that significantly reduces the cost of adapting draft models for speculative decoding when target LLMs are fine-tuned. EDA achieves superior performance through decoupled architecture, data regeneration, and smart sample selection while requiring substantially less training resources than full retraining.

AIBullisharXiv – CS AI · Mar 56/10

🧠

Controllable and explainable personality sliders for LLMs at inference time

Researchers propose Sequential Adaptive Steering (SAS), a new framework for controlling Large Language Model personalities at inference time without retraining. The method uses orthogonalized steering vectors to enable precise, multi-dimensional personality control by adjusting coefficients, validated on Big Five personality traits.

AIBullisharXiv – CS AI · Mar 47/102

🧠

DMTrack: Spatio-Temporal Multimodal Tracking via Dual-Adapter

Researchers introduce DMTrack, a novel dual-adapter architecture for spatio-temporal multimodal tracking that achieves state-of-the-art performance with only 0.93M trainable parameters. The system uses two key modules - a spatio-temporal modality adapter and a progressive modality complementary adapter - to bridge gaps between different modalities and enable better cross-modality fusion.

AIBullisharXiv – CS AI · Mar 37/104

🧠

Distribution-Aligned Decoding for Efficient LLM Task Adaptation

Researchers introduce SVDecode, a new method for adapting large language models to specific tasks without extensive fine-tuning. The technique uses steering vectors during decoding to align output distributions with task requirements, improving accuracy by up to 5 percentage points while adding minimal computational overhead.

AIBullisharXiv – CS AI · Feb 277/107

🧠

NoRA: Breaking the Linear Ceiling of Low-Rank Adaptation via Manifold Expansion

Researchers introduce NoRA (Non-linear Rank Adaptation), a new parameter-efficient fine-tuning method that overcomes the 'linear ceiling' limitations of traditional LoRA by using SiLU gating and structural dropout. NoRA achieves superior performance at rank 64 compared to LoRA at rank 512, demonstrating significant efficiency gains in complex reasoning tasks.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Priority-Aware Learning-Unlearning Correction for Dynamic Decentralized LoRA Fine-Tuning

Researchers propose a priority-aware learning-unlearning correction framework for decentralized federated learning of large language models, enabling efficient parameter updates when devices dynamically join or leave the network. The orthogonal LoRA mechanism addresses the critical bottleneck of disentangling device contributions from global parameters, with experiments demonstrating robust correction across membership changes.

AINeutralarXiv – CS AI · Jun 26/10

🧠

ARCA: Adapter-Residual Credit Assignment When Token Signals Degenerate

Researchers propose ARCA, a new token-level credit assignment method for language model reinforcement learning that addresses degradation issues in parameter-efficient fine-tuning approaches like LoRA. By measuring where adapters actually modify hidden states rather than tracking output distribution shifts, ARCA provides non-degenerate credit signals competitive with existing baselines while requiring no additional learned components.

AINeutralarXiv – CS AI · Jun 16/10

🧠

ImmigrationQA: A Source-Grounded Dataset and Small-Model Adaptation for U.S. Immigration Law

Researchers released ImmigrationQA, a source-grounded dataset of 17,058 question-answer pairs covering U.S. immigration law, and fine-tuned a Llama 3.2 3B model using LoRA for legal assistance. The fine-tuned model achieved 27% relative improvement over base models but remains limited for complex legal reasoning, demonstrating both the potential and constraints of small language models in high-stakes legal domains.

🧠 Claude🧠 Sonnet🧠 Llama

AIBullisharXiv – CS AI · May 296/10

🧠

Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection

Researchers introduce VisAnomReasoner, a parameter-efficient Vision-Language Model designed for time-series anomaly detection, trained on VisAnomBench—a new benchmark augmented with high-quality natural language explanations. The model achieves significant performance improvements over existing approaches, demonstrating 21-23 percentage point gains in precision and F1 scores.

AINeutralarXiv – CS AI · May 286/10

🧠

Sense Representations Are Inducible Interfaces

Researchers introduce ACROS, a method that adds explicit sense representations (per-token meaning decompositions) to frozen pretrained language models without retraining. The technique achieves competitive results in word-sense disambiguation, lexical steering, and cross-lingual adaptation, positioning sense representations as a practical interface for existing models.

AINeutralarXiv – CS AI · May 276/10

🧠

Model Merging on Loss Landscape: A Geometry Perspective

Researchers introduce EpiMer, a novel framework for merging machine learning models by treating it as a geometric optimization problem on Riemannian manifolds. The method uses low-rank task vectors and curvature information to improve knowledge integration without retraining, demonstrating superior performance when merging fine-tuned CLIP-ViT models across multiple image classification tasks.

AIBullisharXiv – CS AI · Apr 206/10

🧠

JumpLoRA: Sparse Adapters for Continual Learning in Large Language Models

Researchers introduce JumpLoRA, a novel framework that uses sparse adapters with JumpReLU gating to enable continual learning in large language models while mitigating catastrophic forgetting. The method dynamically isolates parameters across tasks, outperforming existing state-of-the-art approaches like ELLA and significantly improving IncLoRA performance.

AINeutralarXiv – CS AI · Apr 136/10

🧠

Seeing is Believing: Robust Vision-Guided Cross-Modal Prompt Learning under Label Noise

Researchers introduce VisPrompt, a framework that improves prompt learning for vision-language models by injecting visual semantic information to enhance robustness against label noise. The approach keeps pre-trained models frozen while adding minimal trainable parameters, demonstrating superior performance across seven benchmark datasets under both synthetic and real-world noisy conditions.

AINeutralarXiv – CS AI · Mar 266/10

🧠

SPARE: Self-distillation for PARameter-Efficient Removal

Researchers introduce SPARE, a new machine unlearning method for text-to-image diffusion models that efficiently removes unwanted concepts while preserving model performance. The two-stage approach uses parameter localization and self-distillation to achieve selective concept erasure with minimal computational overhead.

AINeutralarXiv – CS AI · Mar 176/10

🧠

Relationship-Aware Safety Unlearning for Multimodal LLMs

Researchers propose a new framework for improving safety in multimodal AI models by targeting unsafe relationships between objects rather than removing entire concepts. The approach uses parameter-efficient edits to suppress dangerous combinations while preserving benign uses of the same objects and relations.

AIBullisharXiv – CS AI · Mar 176/10

🧠

IGU-LoRA: Adaptive Rank Allocation via Integrated Gradients and Uncertainty-Aware Scoring

Researchers introduce IGU-LoRA, a new parameter-efficient fine-tuning method for large language models that adaptively allocates ranks across layers using integrated gradients and uncertainty-aware scoring. The approach addresses limitations of existing methods like AdaLoRA by providing more stable and accurate layer importance estimates, consistently outperforming baselines across diverse tasks.

AIBullisharXiv – CS AI · Mar 126/10

🧠

Gated Adaptation for Continual Learning in Human Activity Recognition

Researchers developed a new continual learning framework for human activity recognition (HAR) in IoT wearable devices that prevents AI models from forgetting previous tasks when learning new ones. The method uses gated adaptation to achieve 77.7% accuracy while reducing forgetting from 39.7% to 16.2%, training only 2% of parameters.

AIBullisharXiv – CS AI · Feb 276/105

🧠

pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation

Researchers developed pMoE, a novel parameter-efficient fine-tuning method that combines multiple expert domains through specialized prompt tokens and dynamic dispatching. Testing across 47 visual adaptation tasks in classification and segmentation shows superior performance with improved computational efficiency compared to existing methods.

AIBullisharXiv – CS AI · Feb 276/107

🧠

Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective

Researchers introduce NTK-CL, a new framework for parameter-efficient fine-tuning in continual learning that uses Neural Tangent Kernel theory to address catastrophic forgetting. The approach achieves state-of-the-art performance by tripling feature representation and implementing adaptive mechanisms to maintain task-specific knowledge while learning new tasks.

AINeutralarXiv – CS AI · Mar 164/10

🧠

Spatio-Semantic Expert Routing Architecture with Mixture-of-Experts for Referring Image Segmentation

Researchers propose SERA, a new architecture for referring image segmentation that uses mixture-of-experts and expression-aware routing to improve pixel-level mask generation from natural language descriptions. The system introduces lightweight expert refinement stages and parameter-efficient tuning that updates less than 1% of backbone parameters while achieving superior performance on spatial localization and boundary delineation tasks.

Page 1 of 2Next →