#few-shot-learning News & Analysis

65 articles tagged with #few-shot-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

65 articles

AIBullisharXiv – CS AI · Jun 257/10

🧠

Anatomically-conditioned Latent Diffusion Model for Data-Efficient Few-Shot Cross-Domain 3D Glioma MRI Synthesis

Researchers propose ALDM, an anatomically-conditioned latent diffusion model that synthesizes 3D brain MRI scans from limited data to improve glioma classification across medical imaging centers. The framework achieves superior synthetic image quality and clinical classification performance with only 16 target images, addressing a critical challenge in medical AI where domain shifts and data scarcity limit model generalization.

AIBullisharXiv – CS AI · Jun 237/10

🧠

FOCA: Future-Oriented Conditioning for Data-Efficient Vision-Language-Action Adaptation

Researchers introduce FOCA, a new framework for improving Vision-Language-Action (VLA) models in robotic control with limited training data. The method achieves significant performance gains in few-shot learning scenarios, reaching 95.7% success on benchmark tasks with just 20 demonstrations and up to 26% improvements on real robots.

AINeutralarXiv – CS AI · Jun 47/10

🧠

SpurAudio: A Benchmark for Studying Shortcut Learning in Few-Shot Audio Classification

Researchers introduce SpurAudio, a new benchmark for evaluating few-shot audio classification that reveals how state-of-the-art models exploit spurious correlations between foreground content and background noise. The study demonstrates that even large pretrained audio foundation models suffer significant performance degradation when background contexts shift, exposing a critical vulnerability in current evaluation methodologies that has been largely overlooked in audio research.

AIBullisharXiv – CS AI · Jun 17/10

🧠

Graph Machine Learning in the Era of Large Language Models (LLMs)

A comprehensive survey examines the convergence of Graph Machine Learning and Large Language Models, exploring how LLMs can enhance graph neural networks while graphs provide factual knowledge to improve LLM reasoning and reduce hallucinations. This bidirectional relationship addresses key challenges in both domains, including data labeling, heterophily, and out-of-distribution generalization.

AIBullisharXiv – CS AI · May 127/10

🧠

MC-RFM: Geometry-Aware Few-Shot Adaptation via Mixed-Curvature Riemannian Flow Matching

Researchers introduce MC-RFM, a novel framework for efficiently adapting frozen vision models to new tasks using mixed-curvature Riemannian geometry. The method represents adapted features on a product manifold combining hyperbolic and Euclidean spaces, outperforming existing parameter-efficient adaptation techniques across multiple benchmarks and backbone architectures.

AIBullisharXiv – CS AI · May 77/10

🧠

Geometry over Density: Few-Shot Cross-Domain OOD Detection

Researchers introduce UFCOD, a novel framework that enables out-of-distribution detection across arbitrary domains using a single pre-trained diffusion model and minimal inference-time samples. The approach achieves 93.7% average AUROC on cross-domain benchmarks with approximately 500× better sample efficiency than existing methods, requiring only ~100 unlabeled samples rather than 50k-163k training samples.

AIBullisharXiv – CS AI · Mar 117/10

🧠

BiCLIP: Domain Canonicalization via Structured Geometric Transformation

Researchers introduce BiCLIP, a new framework that improves vision-language models' ability to adapt to specialized domains through geometric transformations. The approach achieves state-of-the-art results across 11 benchmarks while maintaining simplicity and low computational requirements.

AIBullisharXiv – CS AI · Mar 56/10

🧠

TATRA: Training-Free Instance-Adaptive Prompting Through Rephrasing and Aggregation

Researchers introduce TATRA, a training-free prompting method for Large Language Models that creates instance-specific few-shot prompts without requiring labeled training data. The method achieves state-of-the-art performance on mathematical reasoning benchmarks like GSM8K and DeepMath, matching or outperforming existing prompt optimization methods that rely on expensive training processes.

AIBullisharXiv – CS AI · Mar 57/10

🧠

SPRINT: Semi-supervised Prototypical Representation for Few-Shot Class-Incremental Tabular Learning

Researchers introduce SPRINT, the first Few-Shot Class-Incremental Learning (FSCIL) framework designed specifically for tabular data domains like cybersecurity and healthcare. The system achieves 77.37% accuracy in 5-shot learning scenarios, outperforming existing methods by 4.45% through novel semi-supervised techniques that leverage unlabeled data and confidence-based pseudo-labeling.

AIBullisharXiv – CS AI · Mar 57/10

🧠

Boosting In-Context Learning in LLMs Through the Lens of Classical Supervised Learning

Researchers propose Supervised Calibration (SC), a new framework to improve In-Context Learning performance in Large Language Models by addressing systematic biases through optimal affine transformations in logit space. The method achieves state-of-the-art results across multiple LLMs including Mistral-7B, Llama-2-7B, and Qwen2-7B in few-shot learning scenarios.

🧠 Llama

AIBullisharXiv – CS AI · Mar 56/10

🧠

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

Researchers introduce RDB-PFN, the first relational foundation model for databases trained entirely on synthetic data to overcome privacy and scarcity issues with real relational databases. The model uses a Relational Prior Generator to create over 2 million synthetic tasks and demonstrates strong few-shot performance on 19 real-world relational prediction tasks through in-context learning.

AIBullishOpenAI News · Nov 77/107

🧠

Learning concepts with energy functions

Researchers developed an energy-based AI model that can learn spatial concepts like 'near' and 'above' from just five demonstrations using 2D point sets. The model demonstrates cross-domain transfer capabilities, applying concepts learned in 2D particle environments to solve 3D physics-based robotics tasks.

$NEAR

AIBullisharXiv – CS AI · Jun 256/10

🧠

A welding penetration prediction model for laser welding process based on self-supervised learning using physics-informed neural networks

Researchers introduce SimPhysNet, a self-supervised learning algorithm that predicts laser welding penetration with 96.06% accuracy using only 200 labeled images—roughly 5% of typical datasets. The physics-informed neural network approach combines contrastive learning with few-shot learning to overcome the industrial manufacturing challenge of requiring extensive labeled data for quality assurance.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Concept-Constrained Prompt Learning for Few-Shot CLIP Adaptation

Researchers introduce Concept-Constrained Prompt Learning (CCPL), a regularization framework that improves CLIP's adaptation to new tasks by anchoring learnable prompts to frozen concept prototypes. The method demonstrates notable performance gains on certain datasets while maintaining stronger generalization to unseen classes compared to existing approaches.

AIBullisharXiv – CS AI · Jun 196/10

🧠

Overcoming Labelled Data Scarcity for Defect Classification in Scanning Tunneling Microscopy

Researchers have developed an automated approach to segmentation of scanning tunneling microscopy (STM) images using few-shot and unsupervised learning, eliminating the need for large manually annotated datasets. The technique successfully identifies atomic features across multiple surfaces with strong generalization capabilities, requiring only one additional labeled data point to adapt to new materials.

AIBullisharXiv – CS AI · Jun 196/10

🧠

RoboSSM: Scalable In-context Imitation Learning via State-Space Models

Researchers introduce RoboSSM, a new in-context imitation learning framework that replaces Transformers with state-space models (SSMs) for robotic task learning. The approach demonstrates superior performance on long-context prompts and achieves better generalization to unseen tasks compared to Transformer-based methods, establishing SSMs as a viable alternative backbone for robot learning systems.

AIBullisharXiv – CS AI · Jun 116/10

🧠

The Unreasonable Effectiveness of Discrete-Time Gaussian Process Mixtures for Robot Policy Learning

Researchers introduce MiDiGap, a machine learning approach using Gaussian Process Mixtures for robot policy learning that achieves state-of-the-art results in manipulation tasks from minimal demonstrations. The method learns complex behaviors like making coffee and opening doors in under a minute on CPU, with significant performance improvements over existing benchmarks and notable cross-embodiment transfer capabilities.

AIBullisharXiv – CS AI · Jun 116/10

🧠

GILT: An LLM-Free, Tuning-Free Graph Foundational Model for In-Context Learning

Researchers introduce GILT, a Graph Foundational Model that enables in-context learning on graph neural networks without requiring large language models or per-task tuning. The approach achieves stronger few-shot performance than existing methods while reducing computational overhead, addressing a critical limitation in deploying GNNs to heterogeneous graph data.

AINeutralarXiv – CS AI · Jun 116/10

🧠

TAROT: Task-Adaptive Refinement of LLM-prior Graphs for Few-shot Tabular Learning

TAROT is a new GNN-based framework that improves few-shot tabular learning by constructing task-adaptive semantic graphs from LLM-inferred feature relationships. The approach addresses privacy concerns of direct LLM tabular data processing while achieving state-of-the-art performance on few-shot benchmarks through intelligent graph refinement that filters LLM hallucinations.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Sample-Efficient LLM-Based Detection of Malicious Web Server Logs with Forensically Explainable Reasoning

Researchers introduce CEF-Log, an LLM-based method for detecting malicious web server logs that achieves 99% F1-score using only four examples while generating forensically explainable reasoning. The approach embeds investigative methodology through structured chain-of-thought prompting, addressing the critical need for both accuracy and legal-admissible explanations in cybersecurity forensics.

AINeutralarXiv – CS AI · Jun 95/10

🧠

Few-shot Class-variable Incremental Audio Classification via Prototype Adaptation and Pseudo Class-variable Training

Researchers propose a new method for few-shot class-variable incremental audio classification that handles both increasing and decreasing numbers of classes, addressing a practical gap in existing models. The approach uses prototype adaptation and pseudo class-variable training to dynamically adjust classifier structure as classes change, demonstrating improved performance on multiple datasets.

AINeutralarXiv – CS AI · Jun 86/10

🧠

GP-Adapter: Gaussian Process CLIP-Adapter for Few-Shot Out-of-Distribution Detection

Researchers introduce GP-Adapter, a training-free framework combining CLIP with Gaussian Process uncertainty modeling to improve few-shot classification and out-of-distribution detection. The approach maintains CLIP's frozen backbone while adding probabilistic inference capabilities, requiring minimal computational overhead and achieving competitive performance on multiple benchmarks.

AINeutralarXiv – CS AI · Jun 85/10

🧠

Supervision versus Demonstration-Based In-Context Learning for Multiword Expression Classification

Researchers compared supervised learning and large language model prompting approaches for detecting Turkish idiomatic light verb constructions, finding that while zero-shot LLMs struggle with recall, few-shot demonstrations significantly improve performance. The study reveals that careful prompt engineering can match or exceed traditional supervised baselines, though results remain highly model-sensitive.

AINeutralarXiv – CS AI · Jun 86/10

🧠

CHoE: Cross-Domain Heterogeneous Graph Prompt Learning via Structure-Conditioned Experts

Researchers introduce CHoE, a cross-domain heterogeneous graph prompt learning method that addresses the limitation of existing approaches failing when pre-training and downstream task data come from different distributions. Using structure-conditioned experts and intelligent routing mechanisms, CHoE improves performance in few-shot cross-domain applications, advancing the practical applicability of foundation models across heterogeneous graph settings.

AINeutralarXiv – CS AI · Jun 56/10

🧠

In-Context Multiple Instance Learning

Researchers propose an in-context learning approach for Multiple Instance Learning (MIL) using Perceiver-style architecture pretrained on synthetic data, enabling models to solve new tasks with minimal labeled examples. The method outperforms supervised baselines across twelve benchmarks while requiring no task-specific training at inference time.

Page 1 of 3Next →