y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#multi-task-learning News & Analysis

30 articles tagged with #multi-task-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

30 articles
AIBullisharXiv – CS AI · 2d ago7/10
🧠

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Alibaba's Qwen team released Qwen-VLA, a unified foundation model that combines vision, language, and action capabilities for robotics across multiple tasks and robot types. The model demonstrates strong performance on manipulation, navigation, and trajectory prediction benchmarks while generalizing well to out-of-distribution scenarios and real-world robot deployments.

AIBullisharXiv – CS AI · May 117/10
🧠

Flow-OPD: On-Policy Distillation for Flow Matching Models

Researchers introduce Flow-OPD, a post-training framework that applies on-policy distillation to Flow Matching text-to-image models, addressing reward sparsity and gradient interference problems. Built on Stable Diffusion 3.5 Medium, the method achieves significant performance gains—GenEval scores improve from 63 to 92 and OCR accuracy from 59 to 94—while maintaining image quality and surpassing individual teacher models.

🧠 Stable Diffusion
AIBullisharXiv – CS AI · May 117/10
🧠

Goal-Conditioned Decision Transformer for Multi-Goal Offline Reinforcement Learning

Researchers introduce a Goal-Conditioned Decision Transformer designed for offline reinforcement learning in robotics, enabling multi-goal task learning from pre-collected datasets. The method demonstrates superior performance compared to online baselines on complex robotic tasks while maintaining effectiveness in sparse-reward environments with limited expert data.

AIBullisharXiv – CS AI · May 97/10
🧠

LANTERN: LLM-Augmented Neurosymbolic Transfer with Experience-Gated Reasoning Networks

Researchers introduce LANTERN, a framework that uses large language models to automatically generate task descriptions and intelligently aggregate knowledge from multiple source tasks for reinforcement learning. The system achieves 40-60% improvements in sample efficiency by adaptively weighting source policies based on task similarity and managing teacher-student knowledge transfer through uncertainty-aware gating.

AIBullisharXiv – CS AI · May 97/10
🧠

Continually Evolving Skill Knowledge in Vision Language Action Model

Researchers introduce Stellar VLA, a continual learning framework for vision-language-action models that improves knowledge accumulation without adding network parameters. The approach uses knowledge-guided expert routing and hierarchical task structures, achieving strong performance on robotics benchmarks with minimal data replay and validated real-world transfer capabilities.

AIBullisharXiv – CS AI · Mar 67/10
🧠

KARL: Knowledge Agents via Reinforcement Learning

Researchers present KARL, a reinforcement learning system for training enterprise search agents that outperforms GPT 5.2 and Claude 4.6 on diverse search tasks. The system introduces KARLBench evaluation suite and demonstrates superior cost-quality trade-offs through multi-task training and synthetic data generation.

🧠 GPT-5🧠 Claude
AINeutralarXiv – CS AI · Mar 67/10
🧠

On Emergences of Non-Classical Statistical Characteristics in Classical Neural Networks

Researchers introduce Non-Classical Network (NCnet), a classical neural architecture that exhibits quantum-like statistical behaviors through gradient competitions between neurons. The study reveals that multi-task neural networks can develop non-local correlations without explicit communication, providing new insights into deep learning training dynamics.

AIBullisharXiv – CS AI · Mar 47/105
🧠

NeuroProlog: Multi-Task Fine-Tuning for Neurosymbolic Mathematical Reasoning via the Cocktail Effect

Researchers introduce NeuroProlog, a neurosymbolic framework that improves mathematical reasoning in Large Language Models by converting math problems into executable Prolog programs. The multi-task 'Cocktail' training approach shows significant accuracy improvements of 3-5% across different model sizes, with larger models demonstrating better error correction capabilities.

AIBullisharXiv – CS AI · Mar 47/103
🧠

Can Computational Reducibility Lead to Transferable Models for Graph Combinatorial Optimization?

Researchers developed a new neural solver model using GCON modules and energy-based loss functions that achieves state-of-the-art performance across multiple graph combinatorial optimization tasks. The study demonstrates effective transfer learning between related optimization problems through computational reducibility-informed pretraining strategies, representing progress toward foundational AI models for combinatorial optimization.

AIBullisharXiv – CS AI · Mar 37/103
🧠

MagicAgent: Towards Generalized Agent Planning

Researchers have developed MagicAgent, a series of foundation models designed for generalized AI agent planning that outperforms existing sub-100B models and even surpasses leading ultra-scale models like GPT-5.2. The models achieve superior performance through a novel synthetic data framework and two-stage training paradigm that addresses gradient interference in multi-task learning.

AIBullisharXiv – CS AI · Mar 37/103
🧠

AdaRank: Adaptive Rank Pruning for Enhanced Model Merging

Researchers introduce AdaRank, a new AI model merging framework that adaptively selects optimal singular directions from task vectors to combine multiple fine-tuned models. The technique addresses cross-task interference issues in existing SVD-based approaches by dynamically pruning problematic components during test-time, achieving state-of-the-art performance with nearly 1% gap from individual fine-tuned models.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

MuNet: A Mutualistic Network for Joint 3D Human Mesh Recovery and 3D Clothed Human Reconstruction from Single Images

Researchers introduce MuNet, a unified deep learning framework that jointly optimizes 3D human mesh recovery and clothed human reconstruction from single images using graph convolutional networks. The approach leverages mutualistic feedback between the two tasks to achieve state-of-the-art results across six benchmark datasets, with code released for research purposes.

AINeutralarXiv – CS AI · May 126/10
🧠

Probing the Impact of Scale on Data-Efficient, Generalist Transformer World Models for Atari

Researchers demonstrate that transformer-based world models exhibit distinct scaling behaviors across Atari environments, with joint multi-task training stabilizing performance gains. The study reveals that individual environments respond differently to model scaling, but unified training across 26 Atari games ensures consistent improvements regardless of inherent task complexity.

AIBullisharXiv – CS AI · May 96/10
🧠

Decomposing the Basic Abilities of Large Language Models: Mitigating Cross-Task Interference in Multi-Task Instruct-Tuning

Researchers propose BADIT, a novel approach to improve large language model training by decomposing shared parameters into orthogonal basic abilities, mitigating the cross-task interference problem that degrades performance in multi-task instruction-tuning. The method outperforms existing solutions on the SuperNI benchmark across 6 LLMs by maintaining parameter orthogonality through spherical clustering during training.

AIBullisharXiv – CS AI · May 96/10
🧠

Schedule-and-Calibrate: Utility-Guided Multi-Task Reinforcement Learning for Code LLMs

Researchers introduce ASTOR, a multi-task reinforcement learning framework that trains a single code LLM across multiple coding tasks more efficiently than task-specific models. By dynamically prioritizing training data and adjusting optimization constraints based on task utility, ASTOR achieves 9.0-9.5% performance gains over specialized models and 7.5-12.8% improvements over existing multi-task approaches.

AIBullisharXiv – CS AI · Mar 176/10
🧠

Reason2Decide: Rationale-Driven Multi-Task Learning

Researchers introduce Reason2Decide, a two-stage training framework that improves clinical decision support systems by aligning AI explanations with predictions. The system achieves better performance than larger foundation models while using 40x smaller models, making clinical AI more accessible for resource-constrained deployments.

AIBullisharXiv – CS AI · Mar 126/10
🧠

One Model, Many Skills: Parameter-Efficient Fine-Tuning for Multitask Code Analysis

Researchers conducted the first comprehensive evaluation of parameter-efficient fine-tuning (PEFT) for multi-task code analysis, showing that a single PEFT module can match full fine-tuning performance while reducing computational costs by up to 85%. The study found that even 1B-parameter models with multi-task PEFT outperform large general-purpose LLMs like DeepSeek and CodeLlama on code analysis tasks.

AIBearisharXiv – CS AI · Mar 36/106
🧠

LangGap: Diagnosing and Closing the Language Gap in Vision-Language-Action Models

Researchers reveal that state-of-the-art Vision-Language-Action (VLA) models largely ignore language instructions despite achieving 95% success on standard benchmarks. The new LangGap benchmark exposes significant language understanding deficits, with targeted data augmentation only partially addressing the fundamental challenge of diverse instruction comprehension.

AIBullisharXiv – CS AI · Mar 36/104
🧠

Scalable Multi-Task Learning through Spiking Neural Networks with Adaptive Task-Switching Policy for Intelligent Autonomous Agents

Researchers have developed SwitchMT, a novel methodology using Spiking Neural Networks with adaptive task-switching for multi-task learning in autonomous agents. The approach addresses task interference issues and demonstrates competitive performance in multiple Atari games while maintaining low power consumption and network complexity.

AINeutralarXiv – CS AI · Mar 54/10
🧠

A benchmark for joint dialogue satisfaction, emotion recognition, and emotion state transition prediction

Researchers have created a new multi-task Chinese dialogue dataset that enables prediction of user satisfaction, emotion recognition, and emotional state transitions across multiple conversation turns. The dataset addresses limitations in existing Chinese resources and aims to improve understanding of how user emotions evolve during interactions to better predict satisfaction.

Page 1 of 2Next →