y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#deep-learning News & Analysis

257 articles tagged with #deep-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

257 articles
AIBullishMIT News – AI · Dec 154/104
🧠

Deep-learning model predicts how fruit flies form, cell by cell

Researchers have developed a deep-learning model that can predict fruit fly development at the cellular level. The approach has potential applications for analyzing more complex tissues and organs, which could help identify early disease markers.

AIBullishHugging Face Blog · May 215/108
🧠

nanoVLM: The simplest repository to train your VLM in pure PyTorch

nanoVLM is introduced as a simplified repository for training Vision Language Models (VLMs) using pure PyTorch. The project aims to make VLM training more accessible by providing a streamlined approach without complex dependencies.

AINeutralLil'Log (Lilian Weng) · Feb 54/10
🧠

Thinking about High-Quality Human Data

The article discusses the critical importance of high-quality human-labeled data for training modern deep learning models, particularly for classification tasks and RLHF labeling used in LLM alignment. Despite the recognized value of quality data, there's a notable preference in the ML community for model development work over data collection and annotation work.

AIBullishHugging Face Blog · Jan 194/105
🧠

Universal Image Segmentation with Mask2Former and OneFormer

This article discusses Universal Image Segmentation techniques using Mask2Former and OneFormer architectures. These are advanced computer vision models that can perform multiple segmentation tasks in a unified framework, representing significant progress in AI image understanding capabilities.

AIBullishHugging Face Blog · Jan 175/105
🧠

Welcome PaddlePaddle to the Hugging Face Hub

Hugging Face has integrated PaddlePaddle, Baidu's deep learning framework, into their model hub platform. This integration expands Hugging Face's ecosystem by adding support for another major AI framework alongside existing options like PyTorch and TensorFlow.

AIBullishHugging Face Blog · Nov 194/105
🧠

Accelerating PyTorch distributed fine-tuning with Intel technologies

The article discusses methods for accelerating PyTorch distributed fine-tuning using Intel's hardware and software technologies. It focuses on optimizations for training deep learning models more efficiently on Intel infrastructure.

AINeutralOpenAI News · Feb 264/105
🧠

Spinning Up in Deep RL: Workshop review

OpenAI held its first Spinning Up Workshop on February 2 as part of a new education initiative. This represents OpenAI's effort to expand educational resources in deep reinforcement learning.

AINeutralLil'Log (Lilian Weng) · Oct 134/10
🧠

Flow-based Deep Generative Models

This article introduces flow-based deep generative models as a third type of generative AI model that, unlike GANs and VAEs, explicitly learns the probability density function of input data. The piece explains the mathematical challenges in calculating probability density functions due to the intractability of integrating over all possible latent variable values.

AINeutralOpenAI News · Oct 114/106
🧠

OpenAI Scholars 2019: Applications open

OpenAI is accepting applications for its second cohort of OpenAI Scholars, a program offering 6-10 stipends and mentorship to underrepresented individuals. The program allows participants to study deep learning full-time for 3 months while working on open-source projects.

AIBullishOpenAI News · Mar 64/104
🧠

OpenAI Scholars

OpenAI is launching a scholarship program offering 6-10 stipends and mentorship to underrepresented individuals to study deep learning full-time for 3 months. Participants will be required to open-source a project as part of the program.

AINeutralarXiv – CS AI · Mar 34/106
🧠

OrthoAI: A Lightweight Deep Learning Framework for Automated Biomechanical Analysis in Clear Aligner Orthodontics -- A Methodological Proof-of-Concept

Researchers have developed OrthoAI, an open-source lightweight AI framework that uses 3D dental segmentation and biomechanical analysis to automate orthodontic treatment plan evaluation. The system achieves 81.4% tooth identification accuracy and runs in under 4 seconds on consumer hardware, though it has only been tested on landmark-derived data rather than real intraoral scans.

AIBullisharXiv – CS AI · Mar 34/103
🧠

Disentangled Hierarchical VAE for 3D Human-Human Interaction Generation

Researchers have developed DHVAE (Disentangled Hierarchical Variational Autoencoder), a new AI model for generating realistic 3D human-human interactions. The system uses hierarchical latent diffusion and contrastive learning to create physically plausible interactions while maintaining computational efficiency.

AINeutralarXiv – CS AI · Mar 34/107
🧠

A Case Study on Concept Induction for Neuron-Level Interpretability in CNN

Researchers successfully applied a Concept Induction framework for neural network interpretability to the SUN2012 dataset, demonstrating the method's broader applicability beyond the original ADE20K dataset. The study assigns interpretable semantic labels to hidden neurons in CNNs and validates them through statistical testing and web-sourced images.

AIBullisharXiv – CS AI · Mar 34/106
🧠

AdURA-Net: Adaptive Uncertainty and Region-Aware Network

AdURA-Net is a new AI framework designed for medical image analysis that addresses uncertainty in clinical decision-making for thoracic disease classification. The system uses adaptive dilated convolution and a dual head loss function to handle uncertain diagnostic labels in medical datasets like CheXpert and MIMIC-CXR.

AINeutralarXiv – CS AI · Mar 34/105
🧠

Rooted Absorbed Prefix Trajectory Balance with Submodular Replay for GFlowNet Training

Researchers propose RapTB, a new training objective for Generative Flow Networks (GFlowNets) that addresses mode collapse issues in fine-tuning large language models. The method includes a submodular replay strategy (SubM) and demonstrates improved performance in molecule generation tasks while maintaining diversity and validity.

AINeutralarXiv – CS AI · Mar 34/105
🧠

Phys-Diff: A Physics-Inspired Latent Diffusion Model for Tropical Cyclone Forecasting

Researchers have developed Phys-Diff, a physics-inspired latent diffusion model for tropical cyclone forecasting that incorporates physical relationships between cyclone attributes. The model integrates multimodal data including historical cyclone data, ERA5 reanalysis, and FengWu forecast fields, achieving state-of-the-art performance on global and regional datasets.

AINeutralarXiv – CS AI · Mar 34/105
🧠

An Analysis of Multi-Task Architectures for the Hierarchic Multi-Label Problem of Vehicle Model and Make Classification

Researchers analyzed multi-task learning architectures for hierarchical classification of vehicle makes and models, testing CNN and Transformer models on StanfordCars and CompCars datasets. The study found that multi-task approaches improved performance for CNNs in almost all scenarios and yielded significant improvements for both model types on the CompCars dataset.

AINeutralarXiv – CS AI · Mar 24/106
🧠

Micro-expression Recognition Based on Dual-branch Feature Extraction and Fusion

Researchers developed a dual-branch neural network for micro-expression recognition that combines residual and Inception networks with parallel attention mechanisms. The method achieved 74.67% accuracy on the CASME II dataset, significantly outperforming existing approaches like LBP-TOP by over 11%.

← PrevPage 10 of 11Next →