#domain-adaptation News & Analysis

80 articles tagged with #domain-adaptation. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

80 articles

AIBullisharXiv – CS AI · Jun 257/10

🧠

Anatomically-conditioned Latent Diffusion Model for Data-Efficient Few-Shot Cross-Domain 3D Glioma MRI Synthesis

Researchers propose ALDM, an anatomically-conditioned latent diffusion model that synthesizes 3D brain MRI scans from limited data to improve glioma classification across medical imaging centers. The framework achieves superior synthetic image quality and clinical classification performance with only 16 target images, addressing a critical challenge in medical AI where domain shifts and data scarcity limit model generalization.

AIBullisharXiv – CS AI · Jun 237/10

🧠

Explore-Execute Chain: Towards an Efficient Structured Reasoning Paradigm

Researchers introduce Explore-Execute Chain (E²C), a structured reasoning framework that separates LLM planning from execution into distinct computational phases. The approach achieves 53.3% accuracy on AIME 2024 benchmarks with significantly fewer tokens than existing methods, while enabling efficient domain adaptation through exploration-focused fine-tuning.

AIBullisharXiv – CS AI · Jun 237/10

🧠

Reinforcement learning to improve large language model-based automated code compliance systems

Researchers introduce P4IR, a two-stage framework combining supervised fine-tuning and Group Relative Policy Optimization to improve LLM accuracy in automated building code compliance systems. The approach reduces errors by up to 38.6% compared to baseline models and outperforms leading LLMs like Claude and GPT in zero-shot settings.

🧠 GPT-5🧠 Claude🧠 Sonnet

AIBullisharXiv – CS AI · Jun 107/10

🧠

MMClima: A Framework for Multimodal Climate Science Data and Evaluation

Researchers introduce MMClima, a large-scale multimodal framework containing 104k+ expert-validated QA pairs for climate science across text, video, and figures. The project benchmarks state-of-the-art multimodal AI models and releases a fine-tuned baseline model, evaluation tools, and dataset to standardize climate science AI evaluation.

AIBullisharXiv – CS AI · Jun 97/10

🧠

Zero-Shot Learning in Industrial Scenarios: New Large-Scale Benchmark, Challenges and Baseline

Researchers introduce MMIO, a large-scale industrial dataset with 80K+ samples, and RTVP, a refined prompt method for zero-shot defect detection in manufacturing. The work addresses the gap between general-purpose Large Visual Language Models and industrial applications, achieving state-of-the-art performance through improved text-visual prompt interactions and domain adaptation.

AIBullisharXiv – CS AI · Jun 87/10

🧠

PandaAI: A Practical Agent CQ2 for Neuro-symbolic Data Analysis And Integrated Decision-Making in Quantitative Finance

Researchers introduce PandaAI, a neuro-symbolic AI agent combining Large Language Models with financial domain expertise to improve sequential decision-making in quantitative finance. The system demonstrates 18.2% higher Rank IC and 25.7% lower maximum drawdown than existing time-series models on Chinese stock data, addressing the challenge of applying deep learning to low signal-to-noise ratio financial markets.

AIBullisharXiv – CS AI · Jun 57/10

🧠

TAM: Torque Adaptation Module for Robust Motion Transfer in Manipulation

Researchers introduce Torque Adaptation Module (TAM), a learned module that adapts robot torque commands to compensate for dynamics differences across robot instances, payload variations, and sim-to-real gaps. TAM enables reusable policy adaptation without requiring robot-specific retraining or real-world data collection, demonstrating robust performance on dynamic manipulation tasks with a real Franka Panda robot.

AIBullisharXiv – CS AI · Jun 47/10

🧠

Who Needs Labels? Adapting Vision Foundation Models With the Metadata You Already Have

Researchers propose FINO, a label-free method for adapting vision foundation models to specialized scientific domains using existing metadata rather than expensive labeled datasets. The approach combines self-supervised learning with metadata guidance, demonstrating superior performance across microscopy, Earth observation, and medical imaging compared to both unsupervised and fully supervised alternatives.

AIBullisharXiv – CS AI · Jun 47/10

🧠

Parthenon Law: A Self-Evolving Legal-Agent Framework

Researchers introduce Parthenon, a self-evolving legal-agent framework that addresses critical limitations in deploying AI agents for complex legal work. Through analysis of 12,510 agent trajectories, the study reveals that even frontier LLMs struggle with end-to-end legal task completion, prompting the development of a modular architecture that learns from failures without retraining underlying models.

AIBullisharXiv – CS AI · Jun 27/10

🧠

MiCU: End-to-End Smart Home Command Understanding with Large Language Model

Xiaomi researchers have developed MiCU, a domain-specific large language model optimized for smart home command understanding that handles ambiguous user requests better than traditional systems. The model employs curriculum learning, reinforcement learning, and token compression techniques, achieving 20% average accuracy gains and reducing user correction rates by 1.57% in production deployment across 1.7 million daily active users in the Xiaomi Home app.

AIBullisharXiv – CS AI · Jun 17/10

🧠

Exploring Autonomous Agentic Data Engineering for Model Specialization

Researchers introduce Autonomous Agentic Data Engineering, a framework enabling LLMs to independently curate and optimize training data for model specialization. GPT-5.2 demonstrated the capability by improving a student model's performance by 57.29% through iterative, agent-driven data adaptation without human intervention.

🧠 GPT-5

AIBullisharXiv – CS AI · May 297/10

🧠

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Researchers introduce DOMINO, a framework that synthesizes domain-specific training data for large language models by learning from reference examples rather than explicit domain descriptions. The approach combines prompt tuning with contrastive learning to generate diverse, high-quality synthetic data without manual prompt engineering, improving coding task performance by up to 4.63%.

AIBullisharXiv – CS AI · May 297/10

🧠

CityGen: Structure-Guided City-Style Synthesis for Cross-City Autonomous Driving

Researchers introduce CityGen, a diffusion-based framework that enables autonomous driving systems to generalize across different cities without labeled training data. The approach uses HD-map guidance and visual prompts to synthesize city-specific driving scenarios, addressing a critical scalability challenge in deploying autonomous vehicles to new geographic regions.

AIBullisharXiv – CS AI · May 277/10

🧠

VesselSim: learning 3D blood vessel segmentation without expert annotations

Researchers introduce VesselSim, a framework that trains 3D blood vessel segmentation models entirely on synthetic, unannotated data rather than requiring expert-labeled medical images. The system combines geometric vascular simulation with domain adaptation techniques to achieve competitive performance with state-of-the-art models on real clinical scans across multiple imaging modalities and anatomical regions.

AIBullisharXiv – CS AI · May 47/10

🧠

Lightweight Domain Adaptation of a Large Language Model for Legal Assistance in the Indian Context

Researchers developed Legal Assist AI, a framework using an 8-billion-parameter Llama 3.1 model enhanced with Retrieval-Augmented Generation to provide legal assistance tailored to Indian law. The system achieved 60.08% on the All-India Bar Examination benchmark, outperforming OpenAI's 175-billion-parameter GPT-3.5 Turbo while being 22 times more parameter-efficient.

🧠 Llama

AIBullisharXiv – CS AI · Mar 267/10

🧠

You only need 4 extra tokens: Synergistic Test-time Adaptation for LLMs

Researchers developed SyTTA, a test-time adaptation framework that improves large language models' performance in specialized domains without requiring additional labeled data. The method achieved over 120% improvement on agricultural question answering tasks using just 4 extra tokens per query, addressing the challenge of deploying LLMs in domains with limited training data.

🏢 Perplexity

AIBullisharXiv – CS AI · Mar 117/10

🧠

BiCLIP: Domain Canonicalization via Structured Geometric Transformation

Researchers introduce BiCLIP, a new framework that improves vision-language models' ability to adapt to specialized domains through geometric transformations. The approach achieves state-of-the-art results across 11 benchmarks while maintaining simplicity and low computational requirements.

AINeutralarXiv – CS AI · Jun 256/10

🧠

Domain-Specific Agents for Cherenkov Telescope Array Control Software and Gamma-Ray Data Analysis

Researchers have developed domain-adapted large language model agents to support the Cherenkov Telescope Array's operations and gamma-ray data analysis. These agents combine specialized knowledge with automated validation and error correction to improve reliability and reduce manual workload in astronomical research workflows.

AINeutralarXiv – CS AI · Jun 256/10

🧠

BCoughBench: Benchmarking Respiratory Acoustic Foundation Models Under Body-Coupled Wearable Sensor Conditions

BCoughBench introduces a standardized evaluation framework for respiratory acoustic foundation models deployed on body-coupled wearable sensors, revealing significant performance degradation compared to smartphone recordings. The study demonstrates that existing models fail to meet clinical thresholds for disease detection when adapted to wearable conditions, though demographic tasks like age regression remain robust.

AINeutralarXiv – CS AI · Jun 256/10

🧠

A cross-process welding penetration status prediction algorithm based on unsupervised domain adaptation in laser and TIG welding

Researchers have developed an unsupervised domain adaptation framework that enables deep learning models to predict weld penetration status across different welding processes without extensive relabeling. The approach achieves 80-81% accuracy in cross-process transfer between TIG and laser welding, significantly outperforming supervised baselines and reducing the cost of deploying AI systems to new welding environments.

AINeutralarXiv – CS AI · Jun 236/10

🧠

The Power of Light: Improving Synthetic-to-Real Domain Adaptation through Physically-Based Indirect Illumination

Researchers present SmartSDG, an automated pipeline using physically-based rendering to improve synthetic-to-real domain adaptation for object detection. The study demonstrates that indirect lighting and complex backgrounds significantly reduce the performance gap between synthetic training data and real-world applications, with implications for industrial automation and computer vision systems.

🏢 Nvidia

AINeutralarXiv – CS AI · Jun 236/10

🧠

Domain Adaptation Under Wireless Network Constraints: When Does It Become Green?

Researchers investigate the energy consumption trade-offs of Unsupervised Domain Adaptation (UDA) versus retraining in 6G wireless networks, proposing a framework to determine when UDA becomes more energy-efficient when accounting for labeling costs and multiple target domains.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Speaker Identity in Non-Verbal Vocalizations: Conditional Distillation and Mixture of Experts Approach

Researchers present a novel framework for speaker verification in non-verbal vocalizations (NVVs) like laughter and sighs, combining Data2Vec features with ECAPA-TDNN and a Mixture of Experts module. The approach reduces speech-to-NVV error rates from 38.93% to 22.66% while maintaining speech verification accuracy, addressing a critical gap in voice authentication systems as TTS and voice conversion technologies become increasingly sophisticated.

AINeutralarXiv – CS AI · Jun 236/10

🧠

MixedPEFT: Combining Multiple PEFT Methods with Mixed Objectives for Unsupervised Domain Adaptation

Researchers present MixedPEFT, a parameter-efficient fine-tuning method combining multiple adaptation techniques to improve pre-trained language models' performance on new domains without full retraining. The approach achieves state-of-the-art results on domain adaptation benchmarks while using only 7% of trainable parameters, demonstrating that strategic architectural combinations can outperform both existing efficient methods and computationally expensive full fine-tuning.

AINeutralarXiv – CS AI · Jun 195/10

🧠

A Comparative Study of Pretrained Transformer Models for Quranic ASR: Speech Representations, Label Formats, and Dataset Composition

Researchers developed improved Automatic Speech Recognition (ASR) models for Quranic recitation using pretrained Transformer architectures (Wav2Vec2.0, HuBERT, XLS-R), achieving 8% word error rates compared to 16.3% baseline performance. The study demonstrates that domain-specific fine-tuning with 870+ hours of professional and user-recited Quranic audio, combined with Arabic text without diacritics, significantly enhances transcription accuracy while reducing training time by 71%.

Page 1 of 4Next →