y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#catastrophic-forgetting News & Analysis

41 articles tagged with #catastrophic-forgetting. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

41 articles
AINeutralarXiv – CS AI · Apr 146/10
🧠

If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs

Researchers introduce LIFESTATE-BENCH, a benchmark for evaluating lifelong learning capabilities in large language models through multi-turn interactions using narrative datasets like Hamlet. Testing shows nonparametric approaches significantly outperform parametric methods, but all models struggle with catastrophic forgetting over extended interactions, revealing fundamental limitations in LLM memory and consistency.

🧠 GPT-4🧠 Llama
AIBullisharXiv – CS AI · Apr 146/10
🧠

Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training

Researchers present Data Mixing Agent, an AI framework that uses reinforcement learning to automatically optimize how large language models balance training data from source and target domains during continual pre-training. The approach outperforms manual reweighting strategies while generalizing across different models, domains, and fields without requiring retraining.

AIBullisharXiv – CS AI · Mar 176/10
🧠

Deconfounded Lifelong Learning for Autonomous Driving via Dynamic Knowledge Spaces

Researchers propose DeLL, a new framework for autonomous driving systems that addresses lifelong learning challenges through dynamic knowledge spaces and causal inference mechanisms. The system uses Dirichlet process mixture models to prevent catastrophic forgetting and improve adaptability to new driving scenarios while maintaining previously learned knowledge.

AIBullisharXiv – CS AI · Mar 176/10
🧠

CATFormer: When Continual Learning Meets Spiking Transformers With Dynamic Thresholds

Researchers introduce CATFormer, a new spiking neural network architecture that solves catastrophic forgetting in continual learning through dynamic threshold neurons. The framework uses context-adaptive thresholds and task-agnostic inference to maintain knowledge across multiple learning tasks without performance degradation.

AINeutralarXiv – CS AI · Mar 166/10
🧠

Continual Learning in Large Language Models: Methods, Challenges, and Opportunities

This comprehensive survey examines continual learning methodologies for large language models, focusing on three core training stages and methods to mitigate catastrophic forgetting. The research reveals that while current approaches show promise in specific domains, fundamental challenges remain in achieving seamless knowledge integration across diverse tasks and temporal scales.

AIBullisharXiv – CS AI · Mar 166/10
🧠

Multimodal Continual Learning with MLLMs from Multi-scenario Perspectives

Researchers developed UNIFIER, a continual learning framework for multimodal large language models (MLLMs) to adapt to changing visual scenarios without catastrophic forgetting. The framework addresses visual discrepancies across different environments like high-altitude, underwater, low-altitude, and indoor scenarios, showing significant improvements over existing methods.

🏢 Hugging Face
AIBullisharXiv – CS AI · Mar 126/10
🧠

Gated Adaptation for Continual Learning in Human Activity Recognition

Researchers developed a new continual learning framework for human activity recognition (HAR) in IoT wearable devices that prevents AI models from forgetting previous tasks when learning new ones. The method uses gated adaptation to achieve 77.7% accuracy while reducing forgetting from 39.7% to 16.2%, training only 2% of parameters.

AIBullisharXiv – CS AI · Mar 116/10
🧠

MSSR: Memory-Aware Adaptive Replay for Continual LLM Fine-Tuning

Researchers propose MSSR (Memory-Inspired Sampler and Scheduler Replay), a new framework for continual fine-tuning of large language models that mitigates catastrophic forgetting while maintaining adaptability. The method estimates sample-level memory strength and schedules rehearsal at adaptive intervals, showing superior performance across three backbone models and 11 sequential tasks compared to existing replay-based strategies.

AIBullisharXiv – CS AI · Mar 36/108
🧠

IDER: IDempotent Experience Replay for Reliable Continual Learning

Researchers propose IDER (Idempotent Experience Replay), a new continual learning method that addresses catastrophic forgetting in neural networks while improving prediction reliability. The approach uses idempotent properties to help AI models retain previously learned knowledge when acquiring new tasks, with demonstrated improvements in accuracy and reduced computational overhead.

AIBullisharXiv – CS AI · Mar 36/109
🧠

Surgical Post-Training: Cutting Errors, Keeping Knowledge

Researchers introduce Surgical Post-Training (SPoT), a new method to improve Large Language Model reasoning while preventing catastrophic forgetting. SPoT achieved 6.2% accuracy improvement on Qwen3-8B using only 4k data pairs and 28 minutes of training, offering a more efficient alternative to traditional post-training approaches.

AIBullisharXiv – CS AI · Mar 37/104
🧠

Modular Memory is the Key to Continual Learning Agents

Researchers propose combining In-Weight Learning (IWL) and In-Context Learning (ICL) through modular memory architectures to solve continual learning challenges in AI. The framework aims to enable AI agents to continuously adapt and accumulate knowledge without catastrophic forgetting, addressing key limitations of current foundation models.

AIBullisharXiv – CS AI · Feb 276/107
🧠

Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective

Researchers introduce NTK-CL, a new framework for parameter-efficient fine-tuning in continual learning that uses Neural Tangent Kernel theory to address catastrophic forgetting. The approach achieves state-of-the-art performance by tripling feature representation and implementing adaptive mechanisms to maintain task-specific knowledge while learning new tasks.

AINeutralarXiv – CS AI · Mar 44/102
🧠

Temporal Imbalance of Positive and Negative Supervision in Class-Incremental Learning

Researchers at arXiv have identified temporal imbalance as a key factor causing catastrophic forgetting in Class-Incremental Learning (CIL) systems. They propose Temporal-Adjusted Loss (TAL), a new method that uses temporal decay kernels to reweight negative supervision, demonstrating significant improvements in reducing forgetting across multiple CIL benchmarks.

AINeutralGoogle Research Blog · Nov 74/105
🧠

Introducing Nested Learning: A new ML paradigm for continual learning

A new machine learning paradigm called Nested Learning has been introduced for continual learning applications. This represents a theoretical advancement in AI algorithms that could improve how AI systems learn and adapt over time without forgetting previous knowledge.

AINeutralarXiv – CS AI · Mar 34/106
🧠

Quantifying Catastrophic Forgetting in IoT Intrusion Detection Systems

Researchers developed a framework to address catastrophic forgetting in IoT intrusion detection systems using continual learning approaches. The study benchmarked five methods across 48 attack domains, finding that replay-based approaches performed best overall while Synaptic Intelligence achieved near-zero forgetting with high efficiency.

$NEAR
← PrevPage 2 of 2