y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#few-shot-learning News & Analysis

44 articles tagged with #few-shot-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

44 articles
AIBullisharXiv – CS AI · 16h ago7/10
🧠

Graph Machine Learning in the Era of Large Language Models (LLMs)

A comprehensive survey examines the convergence of Graph Machine Learning and Large Language Models, exploring how LLMs can enhance graph neural networks while graphs provide factual knowledge to improve LLM reasoning and reduce hallucinations. This bidirectional relationship addresses key challenges in both domains, including data labeling, heterophily, and out-of-distribution generalization.

AIBullisharXiv – CS AI · May 127/10
🧠

MC-RFM: Geometry-Aware Few-Shot Adaptation via Mixed-Curvature Riemannian Flow Matching

Researchers introduce MC-RFM, a novel framework for efficiently adapting frozen vision models to new tasks using mixed-curvature Riemannian geometry. The method represents adapted features on a product manifold combining hyperbolic and Euclidean spaces, outperforming existing parameter-efficient adaptation techniques across multiple benchmarks and backbone architectures.

AIBullisharXiv – CS AI · May 77/10
🧠

Geometry over Density: Few-Shot Cross-Domain OOD Detection

Researchers introduce UFCOD, a novel framework that enables out-of-distribution detection across arbitrary domains using a single pre-trained diffusion model and minimal inference-time samples. The approach achieves 93.7% average AUROC on cross-domain benchmarks with approximately 500× better sample efficiency than existing methods, requiring only ~100 unlabeled samples rather than 50k-163k training samples.

AIBullisharXiv – CS AI · Mar 117/10
🧠

BiCLIP: Domain Canonicalization via Structured Geometric Transformation

Researchers introduce BiCLIP, a new framework that improves vision-language models' ability to adapt to specialized domains through geometric transformations. The approach achieves state-of-the-art results across 11 benchmarks while maintaining simplicity and low computational requirements.

AIBullisharXiv – CS AI · Mar 56/10
🧠

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

Researchers introduce RDB-PFN, the first relational foundation model for databases trained entirely on synthetic data to overcome privacy and scarcity issues with real relational databases. The model uses a Relational Prior Generator to create over 2 million synthetic tasks and demonstrates strong few-shot performance on 19 real-world relational prediction tasks through in-context learning.

AIBullisharXiv – CS AI · Mar 57/10
🧠

Boosting In-Context Learning in LLMs Through the Lens of Classical Supervised Learning

Researchers propose Supervised Calibration (SC), a new framework to improve In-Context Learning performance in Large Language Models by addressing systematic biases through optimal affine transformations in logit space. The method achieves state-of-the-art results across multiple LLMs including Mistral-7B, Llama-2-7B, and Qwen2-7B in few-shot learning scenarios.

🧠 Llama
AIBullisharXiv – CS AI · Mar 57/10
🧠

SPRINT: Semi-supervised Prototypical Representation for Few-Shot Class-Incremental Tabular Learning

Researchers introduce SPRINT, the first Few-Shot Class-Incremental Learning (FSCIL) framework designed specifically for tabular data domains like cybersecurity and healthcare. The system achieves 77.37% accuracy in 5-shot learning scenarios, outperforming existing methods by 4.45% through novel semi-supervised techniques that leverage unlabeled data and confidence-based pseudo-labeling.

AIBullisharXiv – CS AI · Mar 56/10
🧠

TATRA: Training-Free Instance-Adaptive Prompting Through Rephrasing and Aggregation

Researchers introduce TATRA, a training-free prompting method for Large Language Models that creates instance-specific few-shot prompts without requiring labeled training data. The method achieves state-of-the-art performance on mathematical reasoning benchmarks like GSM8K and DeepMath, matching or outperforming existing prompt optimization methods that rely on expensive training processes.

AIBullishOpenAI News · Nov 77/107
🧠

Learning concepts with energy functions

Researchers developed an energy-based AI model that can learn spatial concepts like 'near' and 'above' from just five demonstrations using 2D point sets. The model demonstrates cross-domain transfer capabilities, applying concepts learned in 2D particle environments to solve 3D physics-based robotics tasks.

$NEAR
AINeutralarXiv – CS AI · 16h ago5/10
🧠

Envisioning Beyond the Few: Disentangled Semantics and Primitives for Few-Shot Atypical Layout-to-Image Generation

Researchers propose a novel framework for layout-to-image generation that improves visual quality in few-shot learning scenarios by disentangling semantic identity from visual details. The method uses semantic anchoring and primitive imbuing to address representation fragmentation, enabling more coherent image synthesis from sparse training data.

AINeutralarXiv – CS AI · 16h ago6/10
🧠

GaMi: Geometry-Agnostic Material Identification via Cross-Modal Subtractive Disentanglement

GaMi is a multimodal material identification system that combines mmWave and acoustic sensing to accurately identify materials regardless of geometric variations like shape, orientation, and distance. Using cross-modal subtractive disentanglement and contrastive learning, the system achieves 95.2% accuracy on 20 materials and demonstrates few-shot generalization across different devices.

AIBullisharXiv – CS AI · 16h ago6/10
🧠

PictSure: Pretraining Embeddings Matters for In-Context Learning Image Classifiers

PictSure introduces a vision-only in-context learning framework for few-shot image classification that demonstrates representation quality from pretraining is the critical bottleneck, not fusion-layer training diversity. The researchers release open-source models and an MCP server enabling few-shot image classification integration directly into LLM-based systems.

🏢 Hugging Face
AINeutralarXiv – CS AI · 3d ago6/10
🧠

Temporal Stability and Few-Shot Prompting in Math Task Assessment

A longitudinal study examined how AI models (Gemini and Coteach) perform on mathematics task classification using the Task Analysis Guide, testing stability across model versions and responsiveness to few-shot prompting. Results showed newer model versions produced mixed effects, but few-shot prompting consistently improved both models' accuracy, suggesting prompt engineering is more reliable than passive model updates for specialized educational tasks.

🧠 Gemini
AIBullisharXiv – CS AI · 3d ago6/10
🧠

GiPL: Generative augmented iterative Pseudo-Labeling for Cross-Domain Few-Shot Object Detection

Researchers propose GiPL, a two-branch machine learning framework that combines iterative pseudo-labeling with generative data augmentation to improve cross-domain few-shot object detection using vision-language models. The method demonstrates significant performance improvements on three benchmark datasets, addressing critical challenges in fine-tuning with limited target-domain samples.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

Explaining is Harder Than Predicting Alone: Evaluating Concept-based Explanations of MLLMs as ICL Visual Classifiers

Researchers evaluated how multimodal large language models (MLLMs) explain their image classification decisions in few-shot learning scenarios. The study found that forcing models to generate formal, concept-based explanations actually reduces their predictive accuracy from 93.8% to 90.1%, suggesting that explicit reasoning doesn't universally improve performance despite being widely assumed to do so.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

Detect by Yourself: Self-Designing Agentic Workflows for Few-Shot Graph Anomaly Detection

SignGAD introduces a novel framework for graph anomaly detection that dynamically designs task-specific workflows rather than relying on fixed detection pipelines. The approach combines self-designing agentic workflows with a guarded refit strategy to improve detection accuracy in few-shot learning scenarios, addressing longstanding limitations in identifying anomalous nodes within attributed graphs.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

EigeNet: Geometry-Informed Multi-Modal Learning for Few-shot Novel View RIR Prediction

Researchers introduce EigeNet, a geometry-informed deep learning framework for predicting Room Impulse Response (RIR) in spatial audio from limited observations. The model combines transformer architecture with acoustic ray tracing principles to achieve state-of-the-art performance in few-shot novel view RIR prediction and demonstrates strong sim-to-real generalization capabilities.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

Adapting, Fast and Slow: On Few-Shot Transportability of Compositions

Researchers present a framework for cross-domain generalization in machine learning that extends causal transportability theory to handle sequential prediction tasks. The work introduces module and circuit transportability, enabling models to compose learned mechanisms from source domains to make zero-shot predictions on target domains, with practical few-shot learning methods requiring minimal target domain data.

AIBullisharXiv – CS AI · 5d ago6/10
🧠

Few-shot Cross-country Generalization of Tabular Machine Learning and Foundation Models for Childhood Anemia Prediction under Distribution Shift

Researchers evaluated transformer-based foundation models against classical machine learning methods for predicting childhood anemia across 16 countries using DHS data. TabPFN, a tabular foundation model, demonstrated superior performance in low-data environments with better calibration metrics, suggesting foundation models offer practical advantages for global health prediction in resource-constrained settings.

AINeutralarXiv – CS AI · 5d ago6/10
🧠

Geometry-Aware Contrastive Learning for Few-Shot Automatic Modulation Recognition

Researchers propose Dynamic-Consistency Contrastive Learning (DyCo-CL), a machine learning framework that improves automatic modulation recognition in wireless signal processing by combining virtual adversarial augmentation with semantic consistency loss. The method achieves a 6.27% accuracy improvement in few-shot learning scenarios on standard benchmarks, addressing key challenges in self-supervised learning for signal classification.

AINeutralarXiv – CS AI · 5d ago6/10
🧠

Adaptive Multi-prompt Contrastive Network for Few-shot Out-of-distribution Detection

Researchers propose Adaptive Multi-prompt Contrastive Network (AMCN), a novel approach for few-shot out-of-distribution detection that requires only minimal labeled samples. The method leverages CLIP's vision-language capabilities with learnable textual prompts to distinguish between in-distribution and outlier samples, advancing practical AI safety applications.

AINeutralarXiv – CS AI · 5d ago6/10
🧠

MetaSICL: Adapting Audiroty LLM via Meta Speech In-Context Learning

Researchers introduce MetaSICL, a post-training method that enhances auditory large language models' ability to learn from in-context demonstrations without fine-tuning. The approach uses high-resource speech data to improve performance on low-resource tasks, outperforming traditional fine-tuning methods when labeled data is scarce or domain-mismatched.

AINeutralarXiv – CS AI · May 126/10
🧠

Empowering VLMs for Few-Shot Multimodal Time Series Classification via Tailored Agentic Reasoning

Researchers introduce MarsTSC, a novel framework combining Vision Language Models with agentic reasoning for few-shot multimodal time series classification. The system uses collaborative AI roles—Generator, Reflector, and Modifier—to iteratively refine knowledge and improve classification accuracy across 12 benchmarks while providing interpretable explanations.

AIBullisharXiv – CS AI · Apr 206/10
🧠

FSPO: Few-Shot Optimization of Synthetic Preferences Personalizes to Real Users

Researchers propose FSPO (Few-Shot Preference Optimization), a meta-learning algorithm that personalizes large language models using minimal user preference data. The approach uses synthetically generated preferences to train models that can quickly adapt to individual user preferences, achieving 87% performance on synthetic users and 70% on real human users in evaluation tasks.

Page 1 of 2Next →