244 articles tagged with #deep-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AINeutralarXiv โ CS AI ยท Mar 164/10
๐ง Researchers propose SERA, a new architecture for referring image segmentation that uses mixture-of-experts and expression-aware routing to improve pixel-level mask generation from natural language descriptions. The system introduces lightweight expert refinement stages and parameter-efficient tuning that updates less than 1% of backbone parameters while achieving superior performance on spatial localization and boundary delineation tasks.
AINeutralarXiv โ CS AI ยท Mar 165/10
๐ง Researchers introduce BoSS (Best-of-Strategies Selector), a new oracle strategy for active learning that outperforms existing methods by using an ensemble approach to select optimal data annotation batches. The study reveals that current state-of-the-art active learning strategies still significantly underperform compared to oracle performance, particularly on large-scale datasets.
AINeutralarXiv โ CS AI ยท Mar 114/10
๐ง Researchers have developed a comprehensive multi-model approach for autonomous driving that integrates deep learning and computer vision techniques for traffic sign classification, vehicle detection, lane detection, and behavioral cloning. The study utilizes pre-trained and custom neural networks with data augmentation and transfer learning techniques, testing on datasets including the German Traffic Sign Recognition Benchmark and Udacity simulator data.
AINeutralarXiv โ CS AI ยท Mar 115/10
๐ง Researchers introduce the Overfitting-Underfitting Indicator (OUI) to analyze learning rate sensitivity in PPO reinforcement learning systems. The metric can identify problematic learning rates early in training by measuring neural activation patterns, enabling more efficient hyperparameter screening without full training runs.
AINeutralarXiv โ CS AI ยท Mar 94/10
๐ง A research paper reviews molecular representations inspired by natural language processing for AI applications in chemistry and materials science. The paper serves as a guide for NLP researchers to understand chemical representations and their AI-based applications.
AINeutralarXiv โ CS AI ยท Mar 94/10
๐ง Researchers propose a novel Residual Masking Network that combines deep residual networks with attention mechanisms for facial expression recognition. The method achieves state-of-the-art accuracy on FER2013 and VEMO datasets by using segmentation networks to refine feature maps and focus on relevant facial information.
AIBullisharXiv โ CS AI ยท Mar 95/10
๐ง Researchers introduce CLAIRE, a deep learning framework that combines unsupervised autoencoders with supervised classification for fault detection in industrial manufacturing. The system transforms high-dimensional sensor data into compact representations and uses explainable AI techniques to identify key features contributing to fault predictions.
AINeutralarXiv โ CS AI ยท Mar 44/102
๐ง Researchers at arXiv have identified temporal imbalance as a key factor causing catastrophic forgetting in Class-Incremental Learning (CIL) systems. They propose Temporal-Adjusted Loss (TAL), a new method that uses temporal decay kernels to reweight negative supervision, demonstrating significant improvements in reducing forgetting across multiple CIL benchmarks.
AINeutralarXiv โ CS AI ยท Mar 44/102
๐ง Researchers developed a transfer learning approach for detecting peatland fires using deep learning models adapted from conventional wildfire detection systems. The method addresses the unique challenges of peatland fires, which have distinct characteristics like low flame intensity and persistent smoke that make them difficult to detect with standard wildfire detection models.
AINeutralarXiv โ CS AI ยท Mar 44/104
๐ง Researchers have developed TVF (Time-Varying Filtering), a lightweight 1 million parameter speech enhancement model that combines digital signal processing with deep learning for real-time speech denoising. The model uses a neural network to predict coefficients for a 35-band IIR filter cascade, offering interpretable processing while adapting dynamically to changing noise conditions.
AINeutralarXiv โ CS AI ยท Mar 44/102
๐ง Researchers propose a novel neural network training strategy that cycles models through multiple activation sparsity regimes using global top-k constraints. Preliminary experiments on CIFAR-10 show this approach outperforms dense baseline training, suggesting joint training across sparse and dense activation patterns may improve generalization.
AINeutralarXiv โ CS AI ยท Mar 44/102
๐ง Researchers developed an AI diffusion model to reconstruct missing terrain data from Martian satellite imagery for Virtual Reality space exploration applications. The method trained on 12,000 NASA HiRISE heightmaps outperformed traditional interpolation techniques by 4-15% in accuracy and 29-81% in perceptual similarity.
AINeutralarXiv โ CS AI ยท Mar 44/102
๐ง Researchers developed CASR-Net, a deep learning pipeline for automated coronary artery segmentation in X-ray angiograms that combines image preprocessing, UNet-based segmentation, and refinement stages. The system achieved superior performance with 61.43% IoU and 76.10% DSC on public datasets, potentially improving clinical diagnosis of coronary artery disease.
AINeutralarXiv โ CS AI ยท Mar 35/107
๐ง Researchers developed SubstratumGraphEnv, a reinforcement learning framework that models Windows system attack paths using graph representations derived from Sysmon logs. The system combines Graph Convolutional Networks with Actor-Critic models to automate cybersecurity threat analysis and identify malicious process sequences.
AINeutralarXiv โ CS AI ยท Mar 34/103
๐ง Researchers have created MAC, the first public conversion rate prediction dataset featuring labels from multiple attribution mechanisms, along with PyMAL, an open-source library for multi-attribution learning approaches. The study introduces a new method called Mixture of Asymmetric Experts (MoAE) that significantly outperforms existing state-of-the-art multi-attribution learning methods.
AINeutralarXiv โ CS AI ยท Mar 34/103
๐ง Researchers introduce LieFlow, a machine learning framework that automatically discovers symmetries in data by treating symmetry discovery as a distribution learning problem on Lie groups. The approach can identify both continuous and discrete symmetries within a unified framework, significantly outperforming existing methods like LieGAN in experiments on synthetic and real datasets.
AINeutralarXiv โ CS AI ยท Mar 34/103
๐ง Researchers developed a two-stage method using Structural Causal Models in latent space to generate high-quality 3D brain MRI counterfactuals, addressing the challenge of limited training data in medical imaging. The approach combines VQ-VAE encoding with causal modeling to produce diverse, high-fidelity brain MRI data beyond the original training distribution.
AINeutralarXiv โ CS AI ยท Mar 34/104
๐ง Researchers developed improved out-of-distribution detection methods for wildlife classification, specifically focusing on Africa's Big Five animals to reduce human-wildlife conflict. The study found that feature-based methods using Nearest Class Mean with ImageNet pre-trained features achieved significant improvements of 2%, 4%, and 22% over existing out-of-distribution detection methods.
AIBullisharXiv โ CS AI ยท Mar 34/104
๐ง Researchers propose TADSR, a Time-Aware one-step Diffusion Network that improves real-world image super-resolution by dynamically varying timesteps instead of using fixed ones. The method achieves state-of-the-art performance while allowing controllable trade-offs between image fidelity and realism in a single processing step.
AINeutralarXiv โ CS AI ยท Mar 34/103
๐ง Researchers propose Rejuvenated Cross-Entropy for Knowledge Distillation (RCE-KD) to improve knowledge distillation in recommender systems by addressing limitations of Cross-Entropy loss when distilling teacher model rankings. The method splits teacher's top items into subsets and uses adaptive sampling to better align with theoretical assumptions.
AINeutralarXiv โ CS AI ยท Mar 34/103
๐ง Researchers propose iMOOE, a physics-guided invariant learning method for forecasting partial differential equations (PDEs) dynamics with improved zero-shot generalization. The method addresses limitations in existing deep learning approaches that require test-time adaptation by incorporating fundamental physical invariance principles.
AINeutralarXiv โ CS AI ยท Mar 34/104
๐ง Researchers developed a data-augmented deep learning system for accurate downhole depth sensing in oil and gas wells using casing collar locator (CCL) technology. The system addresses limited real well data challenges through comprehensive preprocessing methods, achieving F1 score improvements of up to 0.057 for collar recognition models.
AINeutralarXiv โ CS AI ยท Mar 34/103
๐ง Researchers introduce CloDS (Cloth Dynamics Splatting), an unsupervised AI framework that learns cloth dynamics from visual observations without requiring known physical properties. The system uses a three-stage pipeline with dual-position opacity modulation to handle complex cloth deformations and self-occlusions through mesh-based Gaussian splatting.
AINeutralarXiv โ CS AI ยท Mar 34/103
๐ง Researchers developed GPEReg-Net, a new AI method for cross-domain image registration that eliminates the need for explicit deformation field estimation by decomposing images into domain-invariant scene representations and appearance statistics. The system achieves state-of-the-art performance on benchmarks while running 1.87x faster than existing methods, using position-encoded temporal attention for sequential image processing.
AIBullisharXiv โ CS AI ยท Mar 35/105
๐ง Researchers developed SMDIM, a new diffusion model for symbolic music generation that efficiently handles long sequences by combining global structure construction with local refinement. The model outperforms existing approaches in both generation quality and computational efficiency across various musical styles including Western classical, popular, and folk music.
$NEAR