AIBullisharXiv – CS AI · Mar 46/102
🧠Researchers developed SAE-based Transferability Score (STS), a new metric using sparse autoencoders to predict how well fine-tuned large language models will perform across different domains without requiring actual training. The method achieves correlation coefficients above 0.7 with actual performance changes and provides interpretable insights into model adaptation.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers benchmarked 22 embedding models on patent data, finding that optimal fine-tuning strategies vary by task and that single-landscape fine-tuning degrades cross-domain performance. The study reveals significant gaps between in-domain and out-of-domain retrieval that cannot be closed with hybrid approaches, challenging assumptions about universal embedding solutions.
🧠 Llama
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce IntentGrasp, a comprehensive benchmark dataset for evaluating how well large language models understand user intent across 12 diverse domains. Testing 20 frontier LLMs reveals widespread performance gaps, with most models scoring below 60% accuracy and many performing worse than random chance on challenging subsets, while a proposed fine-tuning method achieves 20-30+ point improvements.
🧠 GPT-5🧠 Claude🧠 Gemini
AINeutralarXiv – CS AI · Mar 116/10
🧠Researchers introduce EgoCross, a new benchmark to evaluate multimodal AI models on egocentric video understanding across diverse domains like surgery, extreme sports, and industrial settings. The study reveals that current AI models, including specialized egocentric models, struggle with cross-domain generalization beyond common daily activities.
AINeutralarXiv – CS AI · Mar 27/1017
🧠Researchers conducted a benchmark study on IoT botnet intrusion detection systems, finding that models trained on one network domain suffer significant performance degradation when applied to different environments. The study evaluated three feature sets across four IoT datasets and provided guidelines for improving cross-domain robustness through better feature engineering and algorithm selection.
AIBullisharXiv – CS AI · Feb 276/105
🧠Researchers propose TAESAR, a new data-centric framework for improving recommendation models by transforming mixed-domain data into unified target-domain sequences. The approach uses contrastive decoding to address domain gaps and data sparsity issues, outperforming traditional model-centric solutions while generalizing across various sequential models.
AINeutralarXiv – CS AI · Feb 274/105
🧠Researchers introduce CGSA, a new framework for source-free domain adaptive object detection that integrates Object-Centric Learning into DETR-based detectors. The approach uses Hierarchical Slot Awareness and Class-Guided Slot Contrast modules to improve cross-domain object detection without retaining source data, demonstrating superior performance on multiple datasets.
AINeutralarXiv – CS AI · Mar 34/105
🧠Researchers developed TAR-FAS, a new AI framework that uses external visual tools to improve face anti-spoofing detection across different domains. The system employs a Chain-of-Thought approach with visual tools to detect subtle spoofing patterns that traditional methods miss, achieving state-of-the-art performance.
AINeutralarXiv – CS AI · Mar 24/105
🧠Researchers propose BDGxRL, a novel framework using Diffusion Schrödinger Bridge to enable reinforcement learning agents to transfer policies across different domains without direct target environment access. The method aligns source domain transitions with target dynamics through offline demonstrations and introduces reward modulation for consistent learning.