AIBullisharXiv – CS AI · May 127/10
🧠SkillEvolver introduces a meta-learning framework that automatically improves AI agent skills through iterative refinement based on real-world deployment failures, achieving 56.8% accuracy on benchmark tasks compared to 43.6% for manually curated skills. The system learns by modifying skill prose and code rather than model weights, enabling seamless integration with any compatible agent without retraining.
AIBullisharXiv – CS AI · May 117/10
🧠Researchers propose a gradient-based bilevel optimization method that automatically learns composite loss weights during pretraining by aligning gradients with downstream objectives. The approach reduces hyperparameter tuning overhead to ~30% above baseline training cost while matching or exceeding manually tuned baselines across event-sequence and computer vision tasks.
AIBullisharXiv – CS AI · Mar 177/10
🧠Researchers introduce POLCA (Prioritized Optimization with Local Contextual Aggregation), a new framework that uses large language models as optimizers for complex systems like AI agents and code generation. The method addresses stochastic optimization challenges through priority queuing and meta-learning, demonstrating superior performance across multiple benchmarks including agent optimization and CUDA kernel generation.
AIBullisharXiv – CS AI · Mar 167/10
🧠Researchers introduce the Darwin Gödel Machine (DGM), a self-improving AI system that can iteratively modify its own code and validate changes through benchmarks. The system demonstrated significant performance improvements, increasing coding capabilities from 20.0% to 50.0% on SWE-bench and from 14.2% to 30.7% on Polyglot benchmarks.
AIBullisharXiv – CS AI · Mar 56/10
🧠Researchers introduce MASS, a meta-learning framework that enables large language models to self-adapt at test time by generating synthetic training data and performing targeted self-updates. The system uses bilevel optimization to meta-learn data-attribution signals and optimize synthetic data through scalable meta-gradients, showing effectiveness in mathematical reasoning tasks.
AIBullisharXiv – CS AI · Mar 37/103
🧠Researchers introduce VITA, a zero-shot value function learning method that enhances Vision-Language Models through test-time adaptation for robotic manipulation tasks. The system updates parameters sequentially over trajectories to improve temporal reasoning and generalizes across diverse environments, outperforming existing autoregressive VLM methods.
AINeutralarXiv – CS AI · Feb 277/107
🧠Researchers developed Compositional-ARC, a dataset to test AI models' ability to systematically generalize abstract spatial reasoning tasks. A small 5.7M parameter transformer model trained with meta-learning outperformed large language models like GPT-4o and Gemini 2.0 Flash on novel geometric transformation combinations.
AIBullisharXiv – CS AI · Feb 277/106
🧠Researchers have developed DAIL (Discovered Adversarial Imitation Learning), the first meta-learned AI algorithm that uses LLM-guided evolutionary methods to automatically discover reward assignment functions for training AI agents. This breakthrough addresses stability issues in adversarial imitation learning and demonstrates superior performance compared to human-designed approaches across different environments.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers introduce MetaSICL, a post-training method that enhances auditory large language models' ability to learn from in-context demonstrations without fine-tuning. The approach uses high-resource speech data to improve performance on low-resource tasks, outperforming traditional fine-tuning methods when labeled data is scarce or domain-mismatched.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers propose a geospatial discovery framework combining active learning, online meta-learning, and concept-guided reasoning to efficiently identify contamination hotspots like PFAS under limited sampling budgets. The approach uses concept relevance to guide uncertainty sampling and improve generalization in dynamic environmental monitoring scenarios.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce NoiseRater, a meta-learning framework that assigns importance scores to noise samples during diffusion model training, moving beyond the assumption that all injected noise is equally valuable. By prioritizing informative noise through adaptive reweighting, the approach demonstrates improved training efficiency and generation quality on benchmark datasets like FFHQ and ImageNet.
AIBullisharXiv – CS AI · May 116/10
🧠Researchers introduce HARMONY, a hybrid split federated learning framework that enables heterogeneous mobile devices to perform personalized on-device inference while maintaining a generalized server backend for fallback support. By using meta-learning and server-side contrastive learning, HARMONY addresses the representation skew problem that occurs when diverse device architectures extract features incompatibly, achieving up to 43% accuracy improvements without compromising privacy or increasing latency.
AINeutralarXiv – CS AI · May 96/10
🧠Researchers introduce GLiBRL, a novel deep Bayesian reinforcement learning method that combines generalized linear models with learnable basis functions to improve task generalization. The approach achieves fully tractable Bayesian inference over task parameters and demonstrates up to 1.8x performance improvements over existing meta-RL methods on benchmark tasks.
AIBullisharXiv – CS AI · Apr 206/10
🧠Researchers propose FSPO (Few-Shot Preference Optimization), a meta-learning algorithm that personalizes large language models using minimal user preference data. The approach uses synthetically generated preferences to train models that can quickly adapt to individual user preferences, achieving 87% performance on synthetic users and 70% on real human users in evaluation tasks.
AIBullisharXiv – CS AI · Mar 166/10
🧠Researchers propose MetaKE, a new framework for knowledge editing in Large Language Models that addresses the 'Semantic-Execution Disconnect' through bi-level optimization. The method treats edit targets as learnable parameters and uses a Structural Gradient Proxy to align edits with the model's feasible manifold, showing significant improvements over existing approaches.
AIBullisharXiv – CS AI · Mar 36/103
🧠Researchers developed a meta-learning approach for Large Multimodal Models (LMMs) that uses distilled soft prompts to improve few-shot visual question answering performance. The method outperformed traditional in-context learning by 21.2% and parameter-efficient finetuning by 7.7% on VQA tasks.
AIBullisharXiv – CS AI · Mar 27/1014
🧠Researchers propose MetaAPO, a new framework for aligning large language models with human preferences that dynamically balances online and offline training data. The method uses a meta-learner to evaluate when on-policy sampling is beneficial, resulting in better performance while reducing online annotation costs by 42%.
AINeutralarXiv – CS AI · Mar 27/1017
🧠Researchers reveal that Test-Time Training (TTT) with KV binding, previously understood as online meta-learning for memorization, can actually be reformulated as a learned linear attention operator. This new perspective explains previously puzzling behaviors and enables architectural simplifications and efficiency improvements.
AIBullishLil'Log (Lilian Weng) · Jun 236/10
🧠Meta reinforcement learning enables AI agents to rapidly adapt to new tasks by learning from a distribution of training tasks. The approach allows agents to develop new RL algorithms through internal activity dynamics, focusing on fast and efficient problem-solving for unseen scenarios.
AIBullishOpenAI News · Nov 96/107
🧠The article presents RL², a meta-learning approach that uses slow reinforcement learning to enable fast adaptation to new tasks. This method allows AI agents to quickly learn new behaviors by leveraging prior training experience across multiple related tasks.
AINeutralarXiv – CS AI · Mar 44/102
🧠Researchers propose a Label-guided Distance Scaling (LDS) strategy to improve few-shot text classification by leveraging label semantics during both training and testing phases. The method addresses misclassification issues when randomly selected labeled samples don't provide effective supervision signals, demonstrating significant performance improvements over state-of-the-art models.
AINeutralOpenAI News · Mar 74/105
🧠Researchers have developed Reptile, a new meta-learning algorithm that improves machine learning efficiency by repeatedly sampling tasks and updating parameters through stochastic gradient descent. The algorithm is mathematically similar to first-order MAML but requires only black-box access to optimizers like SGD or Adam while maintaining similar performance and computational efficiency.
AINeutralOpenAI News · Oct 114/105
🧠Researchers demonstrate that meta-learning agents in simulated robot wrestling can quickly learn to defeat stronger non-meta-learning opponents. The study also shows these agents can adapt to physical malfunctions, highlighting the potential for AI systems to rapidly adjust strategies and overcome challenges.
AIBullisharXiv – CS AI · Mar 34/103
🧠Researchers introduce MAML-KT, a meta-learning approach that addresses the cold start problem in knowledge tracing systems when predicting performance of new students with limited interaction data. The model uses few-shot learning to rapidly adapt to unseen students, achieving higher early accuracy than existing knowledge tracing models across multiple datasets.
AINeutralarXiv – CS AI · Mar 34/105
🧠Researchers developed a new Meta-Reinforcement Learning approach that uses geometric symmetries in task spaces to enable broader generalization beyond local smoothness assumptions. The method converts Meta-RL into symmetry discovery rather than smooth extrapolation, allowing agents to generalize across wider regions of task space with improved sample efficiency.
$NEAR