AIBullisharXiv – CS AI · 2d ago7/10
🧠Researchers demonstrate that Evolution Strategies (ES) can effectively fine-tune large language models without catastrophic forgetting of prior tasks, contrary to recent concerns. By introducing Anchored Weight Decay (AWD), a regularization technique that constrains optimization toward initial parameters, the work shows ES-based continual learning is viable and computationally efficient compared to reinforcement learning approaches.
AIBullisharXiv – CS AI · 3d ago7/10
🧠Researchers have developed CLANE, a neuromorphic hardware system deployed on Intel Loihi 2 that enables continuous learning of human actions from event cameras without forgetting previously learned classes. The system achieves 70.4% accuracy on a 50-class action recognition dataset while consuming 100x less energy and delivering 16x lower latency than conventional GPU-based approaches, advancing on-device AI for AR/VR and robotics applications.
AINeutralarXiv – CS AI · 3d ago7/10
🧠Researchers reveal that language models verify factual information more reliably than they generate it, a phenomenon driven by distinct training dynamics rather than computational limitations. The study traces this generation-verification gap across model families and training phases, finding that models can simultaneously accept contradictory facts after updates, creating consistency issues for AI systems deployed as knowledge interfaces.
AINeutralarXiv – CS AI · 4d ago7/10
🧠Researchers introduce ICCU, an in-context continual unlearning framework that removes specific data influence from language models without modifying parameters. The method uses pattern-induced refusal rules applied at inference time, addressing the inefficiency of sequential unlearning requests in production deployments.
AIBullisharXiv – CS AI · May 117/10
🧠Researchers introduce CASCADE, a framework enabling large language models to continuously learn and improve during deployment without modifying parameters, using an episodic memory system formulated as a contextual bandit problem. The approach demonstrates 20.9% improvement over zero-shot prompting across 16 diverse tasks, addressing a fundamental limitation in current LLM lifecycles where learning stops after training ends.
AIBullisharXiv – CS AI · May 117/10
🧠Researchers propose a unified evolutionary framework for LLM agent memory systems, categorizing development into three stages: Storage, Reflection, and Experience. The framework addresses fragmented research by synthesizing engineering and cognitive science perspectives, offering design principles for building more capable autonomous AI agents.
AIBullisharXiv – CS AI · May 97/10
🧠Researchers introduce Stellar VLA, a continual learning framework for vision-language-action models that improves knowledge accumulation without adding network parameters. The approach uses knowledge-guided expert routing and hierarchical task structures, achieving strong performance on robotics benchmarks with minimal data replay and validated real-world transfer capabilities.
AIBullisharXiv – CS AI · May 77/10
🧠Researchers propose a novel framework that models language model memory as a Markov transition matrix, enabling efficient incorporation of new knowledge without catastrophic forgetting. The approach requires only linear sample complexity in the number of existing tokens and achieves zero forgetting through minimal parameter updates via an embedding-tuning algorithm.
AIBullisharXiv – CS AI · May 77/10
🧠Researchers propose skill neologisms—soft tokens added to LLM vocabularies—as a scalable approach to continual learning that enables models to acquire new capabilities without catastrophic forgetting or weight updates. The method demonstrates that independently trained skill tokens can compose zero-shot and work with out-of-distribution tasks, offering a practical alternative to fine-tuning.
AINeutralarXiv – CS AI · Apr 207/10
🧠Researchers identify that supervised fine-tuning of large language models increases hallucinations by degrading pre-existing knowledge through semantic interference. The study proposes self-distillation and parameter freezing techniques to mitigate this problem while preserving task performance.
AINeutralarXiv – CS AI · Apr 107/10
🧠Researchers introduce the Informational Buildup Framework (IBF), a new approach to continual learning that eliminates catastrophic forgetting by treating information as structural alignment rather than stored parameters. The framework demonstrates superior performance across multiple domains including chess and image classification, achieving near-zero forgetting without requiring raw data replay.
AINeutralarXiv – CS AI · Mar 267/10
🧠Researchers propose a method to identify 'self-awareness' in AI systems by analyzing invariant cognitive structures that remain stable during continual learning. Their study found that robots subjected to continual learning developed significantly more stable subnetworks compared to control groups, suggesting this could be evidence of an emergent 'self' concept.
AIBearisharXiv – CS AI · Mar 177/10
🧠Research reveals that fine-tuning aligned vision-language AI models on narrow harmful datasets causes severe safety degradation that generalizes across unrelated tasks. The study shows multimodal models exhibit 70% higher misalignment than text-only evaluation suggests, with even 10% harmful training data causing substantial alignment loss.
AIBullisharXiv – CS AI · Mar 57/10
🧠Researchers have developed a new framework for robotic agents that can adapt and learn continuously during operation, rather than being limited to fixed parameters from offline training. The system uses world model prediction residuals to detect unexpected events and automatically trigger self-improvement without external supervision.
AIBullisharXiv – CS AI · Mar 56/10
🧠Researchers discovered that pretrained Vision-Language-Action (VLA) models demonstrate remarkable resistance to catastrophic forgetting in continual learning scenarios, unlike smaller models trained from scratch. Simple Experience Replay techniques achieve near-zero forgetting with minimal replay data, suggesting large-scale pretraining fundamentally changes continual learning dynamics for robotics applications.
AIBullisharXiv – CS AI · Mar 46/103
🧠Researchers propose PDP, a new framework for Incremental Object Detection that addresses prompt degradation issues in AI models. The method achieves significant improvements of 9.2% AP on MS-COCO and 3.3% AP on PASCAL VOC benchmarks through dual-pool prompt decoupling and prototype-guided pseudo-label generation.
AIBullisharXiv – CS AI · Mar 37/103
🧠Researchers introduce PolySkill, a framework that enables AI agents to learn generalizable skills by separating abstract goals from concrete implementations, inspired by software engineering polymorphism. The method improves skill reuse by 1.7x and boosts success rates by up to 13.9% on web navigation tasks while reducing execution steps by over 20%.
AIBullisharXiv – CS AI · Mar 37/103
🧠Researchers introduce Dream2Learn (D2L), a continual learning framework that enables AI models to generate synthetic training data from their own internal representations, mimicking human dreaming for knowledge consolidation. The system creates novel 'dreamed classes' using diffusion models to improve forward knowledge transfer and prevent catastrophic forgetting in neural networks.
AINeutralarXiv – CS AI · Mar 37/104
🧠Researchers have identified the mathematical mechanisms behind 'loss of plasticity' (LoP), explaining why deep learning models struggle to continue learning in changing environments. The study reveals that properties promoting generalization in static settings actually hinder continual learning by creating parameter space traps.
AIBullisharXiv – CS AI · Feb 277/106
🧠Researchers introduce GraftLLM, a new method for transferring knowledge between large language models using 'SkillPack' format that preserves capabilities while avoiding catastrophic forgetting. The approach enables efficient model fusion and continual learning for heterogeneous models through modular knowledge storage.
AIBullishIEEE Spectrum – AI · Feb 97/105
🧠Researchers at UC San Diego developed a new type of bulk resistive RAM (RRAM) that overcomes traditional limitations by switching entire layers rather than forming filaments. The technology achieved 90% accuracy in AI learning tasks and could enable more efficient edge computing by allowing computation within memory itself.
AINeutralarXiv – CS AI · 2d ago6/10
🧠TIMEGATE is a new policy framework that optimizes machine learning system adaptation by intelligently managing computational budgets across training, labeling, and evaluation cycles. The research demonstrates 2.3x efficiency gains in labeling versus training and achieves 66% evaluation-compute savings without compromising model accuracy, with validated results across tabular data and large language models like LLaMA-3.1-8B.
AINeutralarXiv – CS AI · 2d ago6/10
🧠Researchers introduce XXLTraffic and EvoXXLTraffic, new datasets spanning 27 years of California and Australian traffic sensor data that account for real-world network growth. Unlike existing benchmarks assuming fixed sensor networks, these datasets expose the challenge of forecasting across dynamically evolving road infrastructure with sensor growth rates exceeding 10,000%, and reveal that current state-of-the-art models fail to generalize under such conditions.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers introduce PEAM, a parametric memory framework for AI agents in Minecraft that consolidates learned skills directly into model parameters rather than relying on retrieval-based memory. The system uses a mixture-of-experts architecture with contrastive learning to internalize both successful and failed experiences, achieving better long-horizon task performance while avoiding catastrophic forgetting.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers demonstrate that reinforcement learning can synthesize novel compositional reasoning skills, but only when models first master independent atomic skills through supervised fine-tuning. Using a controlled synthetic dataset, they show SFT alone produces memorization without generalization, while RL bridges the gap to genuine skill integration when prerequisites are met.