y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#deep-learning News & Analysis

244 articles tagged with #deep-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

244 articles
AINeutralarXiv โ€“ CS AI ยท Mar 164/10
๐Ÿง 

Spatio-Semantic Expert Routing Architecture with Mixture-of-Experts for Referring Image Segmentation

Researchers propose SERA, a new architecture for referring image segmentation that uses mixture-of-experts and expression-aware routing to improve pixel-level mask generation from natural language descriptions. The system introduces lightweight expert refinement stages and parameter-efficient tuning that updates less than 1% of backbone parameters while achieving superior performance on spatial localization and boundary delineation tasks.

AINeutralarXiv โ€“ CS AI ยท Mar 165/10
๐Ÿง 

BoSS: A Best-of-Strategies Selector as an Oracle for Deep Active Learning

Researchers introduce BoSS (Best-of-Strategies Selector), a new oracle strategy for active learning that outperforms existing methods by using an ensemble approach to select optimal data annotation batches. The study reveals that current state-of-the-art active learning strategies still significantly underperform compared to oracle performance, particularly on large-scale datasets.

AINeutralarXiv โ€“ CS AI ยท Mar 114/10
๐Ÿง 

Multi-model approach for autonomous driving: A comprehensive study on traffic sign-, vehicle- and lane detection and behavioral cloning

Researchers have developed a comprehensive multi-model approach for autonomous driving that integrates deep learning and computer vision techniques for traffic sign classification, vehicle detection, lane detection, and behavioral cloning. The study utilizes pre-trained and custom neural networks with data augmentation and transfer learning techniques, testing on datasets including the German Traffic Sign Recognition Benchmark and Udacity simulator data.

AINeutralarXiv โ€“ CS AI ยท Mar 115/10
๐Ÿง 

When Learning Rates Go Wrong: Early Structural Signals in PPO Actor-Critic

Researchers introduce the Overfitting-Underfitting Indicator (OUI) to analyze learning rate sensitivity in PPO reinforcement learning systems. The metric can identify problematic learning rates early in training by measuring neural activation patterns, enabling more efficient hyperparameter screening without full training runs.

AINeutralarXiv โ€“ CS AI ยท Mar 94/10
๐Ÿง 

Facial Expression Recognition Using Residual Masking Network

Researchers propose a novel Residual Masking Network that combines deep residual networks with attention mechanisms for facial expression recognition. The method achieves state-of-the-art accuracy on FER2013 and VEMO datasets by using segmentation networks to refine feature maps and focus on relevant facial information.

AIBullisharXiv โ€“ CS AI ยท Mar 95/10
๐Ÿง 

CLAIRE: Compressed Latent Autoencoder for Industrial Representation and Evaluation -- A Deep Learning Framework for Smart Manufacturing

Researchers introduce CLAIRE, a deep learning framework that combines unsupervised autoencoders with supervised classification for fault detection in industrial manufacturing. The system transforms high-dimensional sensor data into compact representations and uses explainable AI techniques to identify key features contributing to fault predictions.

AINeutralarXiv โ€“ CS AI ยท Mar 44/102
๐Ÿง 

Temporal Imbalance of Positive and Negative Supervision in Class-Incremental Learning

Researchers at arXiv have identified temporal imbalance as a key factor causing catastrophic forgetting in Class-Incremental Learning (CIL) systems. They propose Temporal-Adjusted Loss (TAL), a new method that uses temporal decay kernels to reweight negative supervision, demonstrating significant improvements in reducing forgetting across multiple CIL benchmarks.

AINeutralarXiv โ€“ CS AI ยท Mar 44/102
๐Ÿง 

Deep Learning Based Wildfire Detection for Peatland Fires Using Transfer Learning

Researchers developed a transfer learning approach for detecting peatland fires using deep learning models adapted from conventional wildfire detection systems. The method addresses the unique challenges of peatland fires, which have distinct characteristics like low flame intensity and persistent smoke that make them difficult to detect with standard wildfire detection models.

AINeutralarXiv โ€“ CS AI ยท Mar 44/104
๐Ÿง 

Differentiable Time-Varying IIR Filtering for Real-Time Speech Denoising

Researchers have developed TVF (Time-Varying Filtering), a lightweight 1 million parameter speech enhancement model that combines digital signal processing with deep learning for real-time speech denoising. The model uses a neural network to predict coefficients for a 35-band IIR filter cascade, offering interpretable processing while adapting dynamically to changing noise conditions.

AINeutralarXiv โ€“ CS AI ยท Mar 44/102
๐Ÿง 

Joint Training Across Multiple Activation Sparsity Regimes

Researchers propose a novel neural network training strategy that cycles models through multiple activation sparsity regimes using global top-k constraints. Preliminary experiments on CIFAR-10 show this approach outperforms dense baseline training, suggesting joint training across sparse and dense activation patterns may improve generalization.

AINeutralarXiv โ€“ CS AI ยท Mar 44/102
๐Ÿง 

CASR-Net: An Image Processing-focused Deep Learning-based Coronary Artery Segmentation and Refinement Network for X-ray Coronary Angiogram

Researchers developed CASR-Net, a deep learning pipeline for automated coronary artery segmentation in X-ray angiograms that combines image preprocessing, UNet-based segmentation, and refinement stages. The system achieved superior performance with 61.43% IoU and 76.10% DSC on public datasets, potentially improving clinical diagnosis of coronary artery disease.

AINeutralarXiv โ€“ CS AI ยท Mar 35/107
๐Ÿง 

SubstratumGraphEnv: Reinforcement Learning Environment (RLE) for Modeling System Attack Paths

Researchers developed SubstratumGraphEnv, a reinforcement learning framework that models Windows system attack paths using graph representations derived from Sysmon logs. The system combines Graph Convolutional Networks with Actor-Critic models to automate cybersecurity threat analysis and identify malicious process sequences.

AINeutralarXiv โ€“ CS AI ยท Mar 34/103
๐Ÿง 

MAC: A Conversion Rate Prediction Benchmark Featuring Labels Under Multiple Attribution Mechanisms

Researchers have created MAC, the first public conversion rate prediction dataset featuring labels from multiple attribution mechanisms, along with PyMAL, an open-source library for multi-attribution learning approaches. The study introduces a new method called Mixture of Asymmetric Experts (MoAE) that significantly outperforms existing state-of-the-art multi-attribution learning methods.

AINeutralarXiv โ€“ CS AI ยท Mar 34/103
๐Ÿง 

Discovering Symmetry Groups with Flow Matching

Researchers introduce LieFlow, a machine learning framework that automatically discovers symmetries in data by treating symmetry discovery as a distribution learning problem on Lie groups. The approach can identify both continuous and discrete symmetries within a unified framework, significantly outperforming existing methods like LieGAN in experiments on synthetic and real datasets.

AINeutralarXiv โ€“ CS AI ยท Mar 34/103
๐Ÿง 

Latent 3D Brain MRI Counterfactual

Researchers developed a two-stage method using Structural Causal Models in latent space to generate high-quality 3D brain MRI counterfactuals, addressing the challenge of limited training data in medical imaging. The approach combines VQ-VAE encoding with causal modeling to produce diverse, high-fidelity brain MRI data beyond the original training distribution.

AINeutralarXiv โ€“ CS AI ยท Mar 34/104
๐Ÿง 

Improving Wildlife Out-of-Distribution Detection: Africas Big Five

Researchers developed improved out-of-distribution detection methods for wildlife classification, specifically focusing on Africa's Big Five animals to reduce human-wildlife conflict. The study found that feature-based methods using Nearest Class Mean with ImageNet pre-trained features achieved significant improvements of 2%, 4%, and 22% over existing out-of-distribution detection methods.

AIBullisharXiv โ€“ CS AI ยท Mar 34/104
๐Ÿง 

Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution

Researchers propose TADSR, a Time-Aware one-step Diffusion Network that improves real-world image super-resolution by dynamically varying timesteps instead of using fixed ones. The method achieves state-of-the-art performance while allowing controllable trade-offs between image fidelity and realism in a single processing step.

AINeutralarXiv โ€“ CS AI ยท Mar 34/103
๐Ÿง 

Rejuvenating Cross-Entropy Loss in Knowledge Distillation for Recommender Systems

Researchers propose Rejuvenated Cross-Entropy for Knowledge Distillation (RCE-KD) to improve knowledge distillation in recommender systems by addressing limitations of Cross-Entropy loss when distilling teacher model rankings. The method splits teacher's top items into subsets and uses adaptive sampling to better align with theoretical assumptions.

AINeutralarXiv โ€“ CS AI ยท Mar 34/103
๐Ÿง 

Towards Generalizable PDE Dynamics Forecasting via Physics-Guided Invariant Learning

Researchers propose iMOOE, a physics-guided invariant learning method for forecasting partial differential equations (PDEs) dynamics with improved zero-shot generalization. The method addresses limitations in existing deep learning approaches that require test-time adaptation by incorporating fundamental physical invariance principles.

AINeutralarXiv โ€“ CS AI ยท Mar 34/104
๐Ÿง 

Data-Augmented Deep Learning for Downhole Depth Sensing and Validation

Researchers developed a data-augmented deep learning system for accurate downhole depth sensing in oil and gas wells using casing collar locator (CCL) technology. The system addresses limited real well data challenges through comprehensive preprocessing methods, achieving F1 score improvements of up to 0.057 for collar recognition models.

AINeutralarXiv โ€“ CS AI ยท Mar 34/103
๐Ÿง 

CloDS: Visual-Only Unsupervised Cloth Dynamics Learning in Unknown Conditions

Researchers introduce CloDS (Cloth Dynamics Splatting), an unsupervised AI framework that learns cloth dynamics from visual observations without requiring known physical properties. The system uses a three-stage pipeline with dual-position opacity modulation to handle complex cloth deformations and self-occlusions through mesh-based Gaussian splatting.

AINeutralarXiv โ€“ CS AI ยท Mar 34/103
๐Ÿง 

Deformation-Free Cross-Domain Image Registration via Position-Encoded Temporal Attention

Researchers developed GPEReg-Net, a new AI method for cross-domain image registration that eliminates the need for explicit deformation field estimation by decomposing images into domain-invariant scene representations and appearance statistics. The system achieves state-of-the-art performance on benchmarks while running 1.87x faster than existing methods, using position-encoded temporal attention for sequential image processing.

AIBullisharXiv โ€“ CS AI ยท Mar 35/105
๐Ÿง 

Efficient Long-Sequence Diffusion Modeling for Symbolic Music Generation

Researchers developed SMDIM, a new diffusion model for symbolic music generation that efficiently handles long sequences by combining global structure construction with local refinement. The model outperforms existing approaches in both generation quality and computational efficiency across various musical styles including Western classical, popular, and folk music.

$NEAR