AIBullisharXiv – CS AI · Mar 117/10
🧠Researchers introduce ACTIVEULTRAFEEDBACK, an active learning pipeline that reduces the cost of training Large Language Models by using uncertainty estimates to identify the most informative responses for annotation. The system achieves comparable performance using only one-sixth of the annotated data compared to static baselines, potentially making LLM training more accessible for low-resource domains.
🏢 Hugging Face
AINeutralarXiv – CS AI · 4d ago5/10
🧠Researchers propose Bayesian Selective Latent Inference (BSLI), a machine learning method that uses wastewater surveillance data to monitor influenza spread in communities before clinical cases are reported. The system intelligently decides whether additional data sources are needed or if abstention is appropriate, improving disease monitoring accuracy while managing data acquisition costs.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers propose an active learning framework that combines foundation model priors with smaller models to address class imbalance and label noise in real-world datasets. The method achieves over 50% annotation savings compared to existing active learning baselines while maintaining model performance across image and text domains.
AIBullisharXiv – CS AI · 4d ago6/10
🧠CLASP is a modular robotic system that combines task-parameterized learning with vision-language models to enable robots to understand natural language commands while maintaining data efficiency. The approach achieves 73-100% success rates on manipulation tasks by learning skills from minimal demonstrations and composing them dynamically without fine-tuning the underlying models.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers propose Deep Active Re-Labeling (DARL), a framework addressing human annotation errors in deep active learning by allocating budget to re-annotate potentially mislabeled data. The method uses noise detection strategies to identify suspect instances, improving data quality and model performance under annotation noise.
AINeutralarXiv – CS AI · Jun 26/10
🧠Researchers demonstrate a physics-informed machine learning framework called PALTO for optimizing GaN tri-gate FinFET designs in power delivery systems, achieving 2× better performance than industrial benchmarks through intelligent exploration of device parameters. The approach addresses computational limitations of traditional TCAD simulations while enabling discovery of optimal gate-to-drain configurations and channel thickness ratios.
AINeutralarXiv – CS AI · Jun 26/10
🧠Researchers introduce Learning-to-Measure (L2M), a meta-learning framework that enables AI systems to learn optimal feature acquisition strategies across multiple tasks without task-specific retraining. The approach combines uncertainty quantification with a greedy acquisition agent, demonstrating superior performance on tabular datasets with missing features and limited labels.
AINeutralarXiv – CS AI · Jun 16/10
🧠Researchers introduce an active learning framework for inferring continuous probability distributions from sparse data snapshots, addressing a key challenge in fields like single-cell biology where data collection is destructive and expensive. The method uses Linearized Optimal Transport to map probability distributions into a space suitable for Gaussian Process modeling, enabling uncertainty-guided selection of optimal measurement times.
AINeutralarXiv – CS AI · Jun 16/10
🧠Researchers introduce PInVerify, an offline benchmark for training embodied AI agents to verify whether objects match fine-grained descriptions through active viewpoint selection. The benchmark includes 3,000 episodes across 18 object categories and evaluates multimodal language models at on-device scale, with best results reaching 85.6% accuracy using fine-tuned approaches.
AINeutralarXiv – CS AI · Jun 16/10
🧠Researchers introduce PROWL, an adversarial training framework that improves world model robustness by actively discovering failure modes rather than passively learning from demonstration data. The approach uses a KL-constrained policy to expose high-error trajectories in diffusion-based video models while maintaining behavioral constraints, with a prioritized buffer that focuses training on unresolved weaknesses. Results demonstrate significant improvements in handling rare, interaction-critical transitions critical for downstream planning and policy performance.
AINeutralarXiv – CS AI · May 276/10
🧠Researchers introduce SILO, a self-improvement imitation framework for protein design that optimizes protein sequences under limited evaluation budgets. The method combines hierarchical editing, stochastic beam search, and active learning to outperform existing reinforcement learning and generative approaches across multiple protein fitness landscapes.
AINeutralarXiv – CS AI · May 276/10
🧠Researchers propose a geospatial discovery framework combining active learning, online meta-learning, and concept-guided reasoning to efficiently identify contamination hotspots like PFAS under limited sampling budgets. The approach uses concept relevance to guide uncertainty sampling and improve generalization in dynamic environmental monitoring scenarios.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce the Hidden Utility Bandit (HUB) framework to address a critical limitation in reward learning systems: their reliance on feedback from a single idealized teacher. The framework models teacher heterogeneity in rationality, expertise, and cost, enabling Active Teacher Selection (ATS) algorithms that strategically choose which teachers to query, demonstrating superior performance in paper recommendation and vaccine testing applications.
AIBullisharXiv – CS AI · May 96/10
🧠Researchers introduced BALAR, a Bayesian algorithm that enables large language models to engage in structured multi-turn dialogue by actively reasoning about missing information and strategically asking clarifying questions. The system demonstrated significant performance improvements across three diverse benchmarks—14.6% to 38.5% higher accuracy—without requiring fine-tuning, suggesting a more principled approach to interactive AI reasoning.
AINeutralarXiv – CS AI · May 96/10
🧠Researchers propose an active learning framework for optimizing communication structures in multi-agent systems powered by large language models, using ensemble-based task selection to identify the most informative training tasks while reducing token consumption and computational costs.
AINeutralarXiv – CS AI · Apr 66/10
🧠Research from arXiv shows that Active Preference Learning (APL) provides minimal improvements over random sampling in training modern LLMs through Direct Preference Optimization. The study found that random sampling performs nearly as well as sophisticated active selection methods while being computationally cheaper and avoiding capability degradation.
AINeutralarXiv – CS AI · Mar 174/10
🧠Researchers introduce EAGLE, a new framework for explaining black-box machine learning models using information-theoretic active learning to select optimal data perturbations. The method produces feature importance scores with uncertainty estimates and demonstrates improved explanation reproducibility and stability compared to existing approaches like LIME.
AIBullisharXiv – CS AI · Mar 175/10
🧠Researchers introduce IDALC, a semi-supervised framework for voice-controlled dialog systems that improves intent detection and reduces manual annotation costs. The system achieves 5-10% higher accuracy and 4-8% better macro-F1 scores while requiring annotation of only 6-10% of unlabeled data.
AINeutralarXiv – CS AI · Mar 165/10
🧠Researchers introduce BoSS (Best-of-Strategies Selector), a new oracle strategy for active learning that outperforms existing methods by using an ensemble approach to select optimal data annotation batches. The study reveals that current state-of-the-art active learning strategies still significantly underperform compared to oracle performance, particularly on large-scale datasets.
AINeutralarXiv – CS AI · Mar 24/107
🧠Researchers propose LEMP4HG, a new language model-enhanced approach for improving graph neural networks on heterophilic graphs where connected nodes have different characteristics. The method leverages language models to better understand semantic relationships between text-attributed nodes, outperforming existing methods while maintaining efficiency through selective message enhancement.