AINeutralarXiv – CS AI · 6d ago6/10
🧠Researchers demonstrate that jointly training language models for both reasoning and tool-use in agentic RL creates measurable performance interference. They introduce DART, a framework that decouples these capabilities through separate low-rank adaptation modules, achieving superior results across thirteen benchmarks and approaching theoretical performance limits.
AINeutralarXiv – CS AI · May 286/10
🧠Researchers propose E²-LoRA, a novel continual learning method that addresses task interference by concentrating knowledge into low-rank representations rather than spreading it across multiple basis vectors. The approach theoretically proves that preserving parameters along principal drift directions minimizes reconstruction error while freeing model capacity for future tasks.
AINeutralarXiv – CS AI · May 96/10
🧠Researchers introduce CRAFT, a continual learning framework for large language models that prevents catastrophic forgetting by learning low-rank interventions on hidden representations rather than updating model weights. The three-stage approach uses KL divergence-based routing and merging to enable models to acquire new capabilities while maintaining performance on previously learned tasks.
AIBullisharXiv – CS AI · Apr 146/10
🧠Researchers propose NExt, a nonlinear extrapolation framework that accelerates reinforcement learning with verifiable rewards (RLVR) for large language models by modeling low-rank parameter trajectories. The method reduces computational overhead by approximately 37.5% while remaining compatible with various RLVR algorithms, addressing a key bottleneck in scaling LLM training.
AIBullisharXiv – CS AI · Mar 176/10
🧠AdapterTune introduces a new method for efficiently fine-tuning Vision Transformers by using zero-initialized low-rank adapters that start at the pretrained function to prevent optimization instability. The technique achieves +14.9 point accuracy improvement over head-only transfer while using only 0.92% of parameters needed for full fine-tuning.
AIBullisharXiv – CS AI · Mar 55/10
🧠Researchers have developed a new approach using multiplicative LoRA (Low-Rank Adaptation) weights for neural field representation learning, achieving improved quality in reconstruction, generation, and analysis tasks. The method constrains optimization space through pre-trained base models, creating structured weight representations that outperform existing weight-space methods when used with latent diffusion models.
AIBullisharXiv – CS AI · Mar 45/103
🧠Researchers developed GLoRIA, a parameter-efficient framework for automatic speech recognition that adapts to regional dialects using location metadata. The system achieves state-of-the-art performance while updating less than 10% of model parameters and demonstrates strong generalization to unseen dialects.
AIBullishHugging Face Blog · Jun 196/106
🧠The article discusses fine-tuning FLUX.1-dev using LoRA (Low-Rank Adaptation) techniques on consumer-grade hardware. This approach makes advanced AI model customization more accessible to individual developers and smaller organizations without requiring enterprise-level computing resources.