AIBullisharXiv – CS AI · 2d ago7/10
🧠Researchers introduce DOMINO, a framework that synthesizes domain-specific training data for large language models by learning from reference examples rather than explicit domain descriptions. The approach combines prompt tuning with contrastive learning to generate diverse, high-quality synthetic data without manual prompt engineering, improving coding task performance by up to 4.63%.
AIBullisharXiv – CS AI · 2d ago7/10
🧠Researchers introduce CityGen, a diffusion-based framework that enables autonomous driving systems to generalize across different cities without labeled training data. The approach uses HD-map guidance and visual prompts to synthesize city-specific driving scenarios, addressing a critical scalability challenge in deploying autonomous vehicles to new geographic regions.
AIBullisharXiv – CS AI · 4d ago7/10
🧠Researchers introduce VesselSim, a framework that trains 3D blood vessel segmentation models entirely on synthetic, unannotated data rather than requiring expert-labeled medical images. The system combines geometric vascular simulation with domain adaptation techniques to achieve competitive performance with state-of-the-art models on real clinical scans across multiple imaging modalities and anatomical regions.
AIBullisharXiv – CS AI · May 47/10
🧠Researchers developed Legal Assist AI, a framework using an 8-billion-parameter Llama 3.1 model enhanced with Retrieval-Augmented Generation to provide legal assistance tailored to Indian law. The system achieved 60.08% on the All-India Bar Examination benchmark, outperforming OpenAI's 175-billion-parameter GPT-3.5 Turbo while being 22 times more parameter-efficient.
🧠 Llama
AIBullisharXiv – CS AI · Mar 267/10
🧠Researchers developed SyTTA, a test-time adaptation framework that improves large language models' performance in specialized domains without requiring additional labeled data. The method achieved over 120% improvement on agricultural question answering tasks using just 4 extra tokens per query, addressing the challenge of deploying LLMs in domains with limited training data.
🏢 Perplexity
AIBullisharXiv – CS AI · Mar 117/10
🧠Researchers introduce BiCLIP, a new framework that improves vision-language models' ability to adapt to specialized domains through geometric transformations. The approach achieves state-of-the-art results across 11 benchmarks while maintaining simplicity and low computational requirements.
AINeutralarXiv – CS AI · 2d ago6/10
🧠Researchers developed a specialized Named Entity Recognition model for identifying disease-related clinical entities in immunology and infectious disease texts, achieving 0.89 F1 score through transformer-based architecture with clinical embeddings. The model outperforms general-purpose NLP systems and LLMs in extracting granular biomedical concepts from unstructured medical narratives, enabling improved cohort identification and clinical decision support.
AINeutralarXiv – CS AI · 2d ago6/10
🧠Researchers introduce MineC2FNet, a deep learning framework that leverages abundant coarse-grained remote sensing data to improve fine-grained mining footprint segmentation in multispectral imagery. The approach uses domain incremental learning with attentive distillation to bridge the gap between coarse and fine datasets, addressing a critical gap in environmental monitoring of global mining operations.
AIBullisharXiv – CS AI · 2d ago6/10
🧠Researchers propose GiPL, a two-branch machine learning framework that combines iterative pseudo-labeling with generative data augmentation to improve cross-domain few-shot object detection using vision-language models. The method demonstrates significant performance improvements on three benchmark datasets, addressing critical challenges in fine-tuning with limited target-domain samples.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers introduce LearnWeak, a framework that improves small computer-use agents by having them learn from their own failures in specific domains rather than training on generic synthetic data. The approach achieves 11-12 percentage point improvements on benchmark tests, demonstrating that targeted, error-aware specialization is more efficient than broad data synthesis for adapting AI agents to particular software environments.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers present a framework for cross-domain generalization in machine learning that extends causal transportability theory to handle sequential prediction tasks. The work introduces module and circuit transportability, enabling models to compose learned mechanisms from source domains to make zero-shot predictions on target domains, with practical few-shot learning methods requiring minimal target domain data.
AINeutralarXiv – CS AI · 3d ago5/10
🧠Researchers propose REED, a post-training representation editing method that improves linguistic steganalysis detection across different domains without modifying model architecture or updating parameters. The technique uses domain-offset vectors and source-domain cover-to-stego directions to adapt detectors to unseen domains with different vocabularies and writing styles.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers have developed a new deepfake detection framework called T-AVFD that addresses a critical gap in audio-visual forgery detection by handling singing scenarios, where traditional cross-modal inconsistency methods fail. The study introduces the SHDF dataset and demonstrates improved detection performance across both talking and singing deepfakes through text-guided pattern learning.
AIBullisharXiv – CS AI · 3d ago6/10
🧠Researchers propose a reinforcement learning framework that enables safer and more efficient transfer of AI agents from simulation to real-world deployment by using probabilistic latent embeddings and dynamic policy adaptation. The approach addresses the critical sim-to-real gap problem in cyber-physical systems like autonomous vehicles by inferring environment context and adjusting risk levels during deployment.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers have extended LELA, an LLM-based entity linking framework, into a practical Python library that combines zero-shot Named Entity Recognition with entity disambiguation. The end-to-end pipeline addresses limitations in existing approaches by offering domain-agnostic capabilities and demonstrating robust performance across diverse entity linking tasks, making it more applicable to real-world usage scenarios.
AIBullisharXiv – CS AI · May 126/10
🧠Researchers introduced PolyLM, a 9-billion-parameter language model that predicts polymer physical and mechanical properties directly from scientific literature without requiring structural chemical data. The model achieved a median R² of 0.74 across 22 diverse properties by training on 185,000 papers and 276,400 polymer samples, demonstrating that natural language processing can effectively capture the experimental context that traditional structure-only models miss.
AINeutralarXiv – CS AI · May 125/10
🧠Researchers propose SimpleST, a lightweight prompt tuning framework that enhances spatio-temporal graph neural networks' ability to generalize across different traffic prediction scenarios. By keeping pre-trained model parameters fixed while adapting through efficient prompting, the approach reduces computational overhead while improving accuracy on real-world urban datasets.
AINeutralarXiv – CS AI · May 126/10
🧠A new study compares Retrieval-Augmented Generation (RAG) and fine-tuning approaches for adapting Large Language Models to enterprise question-answering tasks in the automotive industry. The research finds that RAG offers superior cost-efficiency while maintaining comparable answer quality, even enabling open-source models to match premium model performance.
AIBullisharXiv – CS AI · May 116/10
🧠Researchers propose STDA-Net, a deep learning framework for sleep stage classification that uses 2D spectrograms instead of traditional 1D EEG signals, combined with domain adaptation techniques to work across different datasets. The method achieves 89.03% accuracy and demonstrates superior stability compared to existing approaches, advancing automated sleep staging technology.
AIBullisharXiv – CS AI · May 116/10
🧠TimeLesSeg introduces a unified deep learning framework for segmenting Multiple Sclerosis lesions that works across different imaging contrasts and with or without temporal data. The model uses stochastic generative techniques and domain randomization to address the fragmentation between cross-sectional and longitudinal segmentation approaches, demonstrating superior performance on multiple datasets.
AIBullisharXiv – CS AI · Apr 206/10
🧠Researchers introduce MM-Telco, a comprehensive multimodal benchmark and model suite designed to adapt large language models for telecommunications applications. The framework addresses domain-specific challenges in network optimization, troubleshooting, and customer support, with fine-tuned models demonstrating significant performance improvements over baseline LLMs.
AINeutralarXiv – CS AI · Apr 146/10
🧠Researchers have developed PlantXpert, a multimodal AI benchmark for evaluating vision-language models on agricultural phenotyping tasks for soybean and cotton. The benchmark tests 11 state-of-the-art models across disease detection, pest control, weed management, and yield prediction, revealing that fine-tuned models achieve up to 78% accuracy but struggle with complex reasoning and cross-crop generalization.
AIBullisharXiv – CS AI · Apr 146/10
🧠Researchers present Data Mixing Agent, an AI framework that uses reinforcement learning to automatically optimize how large language models balance training data from source and target domains during continual pre-training. The approach outperforms manual reweighting strategies while generalizing across different models, domains, and fields without requiring retraining.
AINeutralarXiv – CS AI · Apr 106/10
🧠Researchers have developed Luwen, an open-source Chinese legal language model built on Baichuan that uses continual pre-training, supervised fine-tuning, and retrieval-augmented generation to excel at legal tasks. The model outperforms baselines on five legal benchmarks including judgment prediction, judicial examination, and legal reasoning, demonstrating effective domain adaptation for specialized legal applications.
AINeutralarXiv – CS AI · Apr 106/10
🧠Researchers introduce FedDAP, a federated learning framework that addresses domain shift challenges by constructing domain-specific global prototypes rather than single aggregated prototypes. The method aligns local features with prototypes from the same domain while encouraging separation from different domains, improving model generalization across heterogeneous client data.