#continual-learning News & Analysis

111 articles tagged with #continual-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

111 articles

AIBullisharXiv – CS AI · Jun 237/10

🧠

Social World Model for Lifelong Social Intelligence

Researchers propose the Social World Model, a framework for continuous learning in language agents through structured social interaction decomposition across five dimensions. The approach demonstrates that smaller open-source models like Qwen2.5-7B can achieve competitive social intelligence capabilities comparable to closed-source alternatives while maintaining performance across difficulty levels.

🧠 Gemini

AIBullisharXiv – CS AI · Jun 197/10

🧠

MEAL: A Benchmark for Continual Multi-Agent Reinforcement Learning

Researchers introduce MEAL, the first benchmark for continual multi-agent reinforcement learning, which uses JAX and GPU acceleration to enable training on sequences of 100 tasks in hours rather than days. The work reveals that longer task sequences expose failure modes invisible in traditional small-scale benchmarks, addressing a critical gap in RL research where computational constraints have limited study to only 3-10 sequential tasks.

AIBullisharXiv – CS AI · Jun 197/10

🧠

cAPM: Continual AI-Assisted Pace-Mapping with Active Learning

Researchers introduce cAPM, an AI-assisted system that uses continual learning and active learning to improve cardiac pace-mapping procedures for treating ventricular tachycardia. The system demonstrates 81% localization accuracy using only 4.5 pacing sites compared to 38% accuracy with 13.7 sites for existing methods, potentially reducing procedure time and patient risk.

AINeutralarXiv – CS AI · Jun 117/10

🧠

Federated continual learning: A comprehensive survey on lifelong and privacy-preserving learning over distributed and non-stationary data

A comprehensive survey examines Federated Continual Learning (FCL), which combines federated learning's privacy-preserving distributed training with continual learning's ability to adapt to evolving data. The research addresses a critical gap in current FL systems that assume static data, proposing frameworks for real-world applications like healthcare and IoT where data streams continuously shift, causing performance degradation and catastrophic forgetting.

AIBullisharXiv – CS AI · Jun 97/10

🧠

Not Just After One: Sleep-Inspired Replay Prevents Catastrophic Forgetting After Sequential Tasks

Researchers demonstrate that artificial neural networks can mitigate catastrophic forgetting—the tendency to lose previously learned information when training on new tasks—by applying unsupervised replay mechanisms after sequential learning periods, mimicking biological sleep-based memory consolidation. This approach defers interference correction until after multiple new tasks are learned, suggesting a more efficient pathway for developing continual learning AI systems.

AIBullisharXiv – CS AI · Jun 87/10

🧠

Just-In-Time Reinforcement Learning: Continual Learning in LLM Agents Without Gradient Updates

Researchers introduce Just-In-Time Reinforcement Learning (JitRL), a training-free framework that enables LLM agents to continuously adapt after deployment without gradient updates or fine-tuning. The method uses dynamic memory retrieval to estimate action advantages and modulate output logits, achieving state-of-the-art performance on complex tasks while reducing computational costs by over 30 times compared to traditional fine-tuning approaches.

AINeutralarXiv – CS AI · Jun 57/10

🧠

Continual Learning Bench: Evaluating Frontier AI Systems in Real-World Stateful Environments

Researchers introduce Continual Learning Bench (CL-Bench), the first comprehensive benchmark for evaluating whether LLM-based AI systems genuinely improve through sequential experience across real-world domains. Testing frontier models reveals significant gaps in current continual learning capabilities, with systems frequently overfitting to immediate observations and failing to reuse knowledge effectively.

AIBullisharXiv – CS AI · Jun 47/10

🧠

Scaling Self-Evolving Agents via Parametric Memory

Researchers introduce TMEM, a parametric memory framework that enables AI agents to learn and evolve within a single episode by updating LoRA weights online, rather than merely retrieving frozen memories. This approach combines explicit memory storage with fast adaptive weights, allowing agents to genuinely improve their policy during rollouts and demonstrates consistent performance gains across multiple benchmarks.

AIBullisharXiv – CS AI · Jun 17/10

🧠

SHIELD: Secure Hypernetworks for Incremental Expansion Learning Defense

Researchers introduce SHIELD, a novel machine learning framework that combines Interval Bound Propagation with hypernetwork architecture to achieve certifiably robust continual learning without replay buffers. The method uses task-specific embeddings and a new Interval MixUp training strategy to maintain security across sequential tasks while outperforming existing approaches on adversarial benchmarks.

AIBullisharXiv – CS AI · May 297/10

🧠

Overcoming Forgetting in LLM Fine-Tuning with Evolution Strategies

Researchers demonstrate that Evolution Strategies (ES) can effectively fine-tune large language models without catastrophic forgetting of prior tasks, contrary to recent concerns. By introducing Anchored Weight Decay (AWD), a regularization technique that constrains optimization toward initial parameters, the work shows ES-based continual learning is viable and computationally efficient compared to reinforcement learning approaches.

AINeutralarXiv – CS AI · May 287/10

🧠

The Future of Facts: Tracing the Factual Generation-Verification Gap

Researchers reveal that language models verify factual information more reliably than they generate it, a phenomenon driven by distinct training dynamics rather than computational limitations. The study traces this generation-verification gap across model families and training phases, finding that models can simultaneously accept contradictory facts after updates, creating consistency issues for AI systems deployed as knowledge interfaces.

AIBullisharXiv – CS AI · May 287/10

🧠

CLANE: Continual Learning of Actions on Neuromorphic Hardware from Event Cameras

Researchers have developed CLANE, a neuromorphic hardware system deployed on Intel Loihi 2 that enables continuous learning of human actions from event cameras without forgetting previously learned classes. The system achieves 70.4% accuracy on a 50-class action recognition dataset while consuming 100x less energy and delivering 16x lower latency than conventional GPU-based approaches, advancing on-device AI for AR/VR and robotics applications.

AINeutralarXiv – CS AI · May 277/10

🧠

ICCU: In-Context Continual Unlearning via Pattern-Induced Refusal Rules

Researchers introduce ICCU, an in-context continual unlearning framework that removes specific data influence from language models without modifying parameters. The method uses pattern-induced refusal rules applied at inference time, addressing the inefficiency of sequential unlearning requests in production deployments.

AIBullisharXiv – CS AI · May 117/10

🧠

CASCADE: Case-Based Continual Adaptation for Large Language Models During Deployment

Researchers introduce CASCADE, a framework enabling large language models to continuously learn and improve during deployment without modifying parameters, using an episodic memory system formulated as a contextual bandit problem. The approach demonstrates 20.9% improvement over zero-shot prompting across 16 diverse tasks, addressing a fundamental limitation in current LLM lifecycles where learning stops after training ends.

AIBullisharXiv – CS AI · May 117/10

🧠

From Storage to Experience: A Survey on the Evolution of LLM Agent Memory Mechanisms

Researchers propose a unified evolutionary framework for LLM agent memory systems, categorizing development into three stages: Storage, Reflection, and Experience. The framework addresses fragmented research by synthesizing engineering and cognitive science perspectives, offering design principles for building more capable autonomous AI agents.

AIBullisharXiv – CS AI · May 97/10

🧠

Continually Evolving Skill Knowledge in Vision Language Action Model

Researchers introduce Stellar VLA, a continual learning framework for vision-language-action models that improves knowledge accumulation without adding network parameters. The approach uses knowledge-guided expert routing and hierarchical task structures, achieving strong performance on robotics benchmarks with minimal data replay and validated real-world transfer capabilities.

AIBullisharXiv – CS AI · May 77/10

🧠

Skill Neologisms: Towards Skill-based Continual Learning

Researchers propose skill neologisms—soft tokens added to LLM vocabularies—as a scalable approach to continual learning that enables models to acquire new capabilities without catastrophic forgetting or weight updates. The method demonstrates that independently trained skill tokens can compose zero-shot and work with out-of-distribution tasks, offering a practical alternative to fine-tuning.

AIBullisharXiv – CS AI · May 77/10

🧠

Memory as a Markov Matrix: Sample Efficient Knowledge Expansion via Token-to-Dictionary Mapping

Researchers propose a novel framework that models language model memory as a Markov transition matrix, enabling efficient incorporation of new knowledge without catastrophic forgetting. The approach requires only linear sample complexity in the number of existing tokens and achieves zero forgetting through minimal parameter updates via an embedding-tuning algorithm.

AINeutralarXiv – CS AI · Apr 207/10

🧠

Why Fine-Tuning Encourages Hallucinations and How to Fix It

Researchers identify that supervised fine-tuning of large language models increases hallucinations by degrading pre-existing knowledge through semantic interference. The study proposes self-distillation and parameter freezing techniques to mitigate this problem while preserving task performance.

AINeutralarXiv – CS AI · Apr 107/10

🧠

Information as Structural Alignment: A Dynamical Theory of Continual Learning

Researchers introduce the Informational Buildup Framework (IBF), a new approach to continual learning that eliminates catastrophic forgetting by treating information as structural alignment rather than stored parameters. The framework demonstrates superior performance across multiple domains including chess and image classification, achieving near-zero forgetting without requiring raw data replay.

AINeutralarXiv – CS AI · Mar 267/10

🧠

Evidence of an Emergent "Self" in Continual Robot Learning

Researchers propose a method to identify 'self-awareness' in AI systems by analyzing invariant cognitive structures that remain stable during continual learning. Their study found that robots subjected to continual learning developed significantly more stable subnetworks compared to control groups, suggesting this could be evidence of an emergent 'self' concept.

AIBearisharXiv – CS AI · Mar 177/10

🧠

Narrow Fine-Tuning Erodes Safety Alignment in Vision-Language Agents

Research reveals that fine-tuning aligned vision-language AI models on narrow harmful datasets causes severe safety degradation that generalizes across unrelated tasks. The study shows multimodal models exhibit 70% higher misalignment than text-only evaluation suggests, with even 10% harmful training data causing substantial alignment loss.

AIBullisharXiv – CS AI · Mar 57/10

🧠

Self-adapting Robotic Agents through Online Continual Reinforcement Learning with World Model Feedback

Researchers have developed a new framework for robotic agents that can adapt and learn continuously during operation, rather than being limited to fixed parameters from offline training. The system uses world model prediction residuals to detect unexpected events and automatically trigger self-improvement without external supervision.

AIBullisharXiv – CS AI · Mar 56/10

🧠

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

Researchers discovered that pretrained Vision-Language-Action (VLA) models demonstrate remarkable resistance to catastrophic forgetting in continual learning scenarios, unlike smaller models trained from scratch. Simple Experience Replay techniques achieve near-zero forgetting with minimal replay data, suggesting large-scale pretraining fundamentally changes continual learning dynamics for robotics applications.

AIBullisharXiv – CS AI · Mar 46/103

🧠

Beyond Prompt Degradation: Prototype-guided Dual-pool Prompting for Incremental Object Detection

Researchers propose PDP, a new framework for Incremental Object Detection that addresses prompt degradation issues in AI models. The method achieves significant improvements of 9.2% AP on MS-COCO and 3.3% AP on PASCAL VOC benchmarks through dual-pool prompt decoupling and prototype-guided pseudo-label generation.

Page 1 of 5Next →