#diffusion-models News & Analysis

173 articles tagged with #diffusion-models. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

173 articles

AIBullisharXiv – CS AI · Mar 44/103

🧠

Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration

Researchers propose DiSE, a self-evaluation method for diffusion large language models (dLLMs) that quantifies confidence by computing token regeneration probabilities. The method enables more efficient quality assessment and introduces a flexible-length generation framework that adaptively controls sequence length based on the model's self-assessment.

AINeutralarXiv – CS AI · Mar 44/102

🧠

Diffusion-EXR: Controllable Review Generation for Explainable Recommendation via Diffusion Models

Researchers propose Diffusion-EXR, a new AI model that uses Denoising Diffusion Probabilistic Models (DDPM) to generate review text for explainable recommendation systems. The model corrupts review embeddings with Gaussian noise and learns to reconstruct them, achieving state-of-the-art performance on benchmark datasets for recommendation review generation.

AINeutralarXiv – CS AI · Mar 44/102

🧠

Inpainting the Red Planet: Diffusion Models for the Reconstruction of Martian Environments in Virtual Reality

Researchers developed an AI diffusion model to reconstruct missing terrain data from Martian satellite imagery for Virtual Reality space exploration applications. The method trained on 12,000 NASA HiRISE heightmaps outperformed traditional interpolation techniques by 4-15% in accuracy and 29-81% in perceptual similarity.

AIBullisharXiv – CS AI · Mar 35/105

🧠

Efficient Long-Sequence Diffusion Modeling for Symbolic Music Generation

Researchers developed SMDIM, a new diffusion model for symbolic music generation that efficiently handles long sequences by combining global structure construction with local refinement. The model outperforms existing approaches in both generation quality and computational efficiency across various musical styles including Western classical, popular, and folk music.

$NEAR

AINeutralarXiv – CS AI · Mar 34/104

🧠

MAGIC: Few-Shot Mask-Guided Anomaly Inpainting with Prompt Perturbation, Spatially Adaptive Guidance, and Context Awareness

MAGIC is a new AI framework for few-shot anomaly detection in industrial quality control that uses mask-guided inpainting to generate high-fidelity synthetic anomalies. The system introduces three key innovations: Gaussian prompt perturbation, spatially adaptive guidance, and context-aware mask alignment to improve anomaly generation while preserving normal regions.

AIBullisharXiv – CS AI · Mar 34/104

🧠

Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution

Researchers propose TADSR, a Time-Aware one-step Diffusion Network that improves real-world image super-resolution by dynamically varying timesteps instead of using fixed ones. The method achieves state-of-the-art performance while allowing controllable trade-offs between image fidelity and realism in a single processing step.

AINeutralarXiv – CS AI · Mar 34/103

🧠

DistillKac: Few-Step Image Generation via Damped Wave Equations

DistillKac introduces a new fast image generation method using damped wave equations and Kac representation for finite-speed probability transport. Unlike diffusion models with potentially unstable reverse-time velocities, this approach enforces bounded kinetic energy and offers improved numerical stability with fewer function evaluations.

AINeutralarXiv – CS AI · Mar 34/104

🧠

GACA-DiT: Diffusion-based Dance-to-Music Generation with Genre-Adaptive Rhythm and Context-Aware Alignment

Researchers propose GACA-DiT, a new AI framework that generates music synchronized with dance movements using diffusion transformers. The system addresses limitations of existing methods by incorporating genre-adaptive rhythm extraction and context-aware temporal alignment for better synchronization between dance and music.

AIBullisharXiv – CS AI · Feb 274/105

🧠

AHBid: An Adaptable Hierarchical Bidding Framework for Cross-Channel Advertising

Researchers propose AHBid, a new hierarchical bidding framework for cross-channel advertising that combines generative planning with real-time control using diffusion models. The system achieved a 13.57% improvement in return on investment compared to existing methods in large-scale tests.

AINeutralarXiv – CS AI · Feb 274/103

🧠

TabDLM: Free-Form Tabular Data Generation via Joint Numerical-Language Diffusion

Researchers introduce TabDLM, a new AI framework that generates synthetic tabular data containing both numerical values and free-form text using joint numerical-language diffusion models. The approach addresses limitations of existing diffusion and LLM-based methods by combining masked diffusion for text with continuous diffusion for numbers, enabling better synthetic data generation for privacy and data augmentation applications.

AINeutralarXiv – CS AI · Feb 274/104

🧠

Instruction-based Image Editing with Planning, Reasoning, and Generation

Researchers propose a new multi-modality approach for instruction-based image editing that combines Chain-of-Thought planning, region reasoning, and generation capabilities. The method uses large language models and diffusion models to improve complex image editing tasks compared to existing single-modality approaches.

AIBullisharXiv – CS AI · Feb 274/105

🧠

DICArt: Advancing Category-level Articulated Object Pose Estimation in Discrete State-Spaces

Researchers introduced DICArt, a new AI framework for articulated object pose estimation that uses discrete diffusion processes instead of continuous space regression. The method incorporates kinematic constraints and hierarchical structure modeling to improve accuracy in estimating 6D poses of complex objects in embodied AI applications.

AINeutralHugging Face Blog · Feb 34/105

🧠

SegMoE: Segmind Mixture of Diffusion Experts

SegMoE (Segmind Mixture of Experts) represents a new approach to diffusion model architecture that combines multiple specialized expert models for improved image generation capabilities. This technical development in AI model design aims to enhance efficiency and quality in diffusion-based image synthesis.

AINeutralHugging Face Blog · Mar 34/107

🧠

ControlNet in 🧨 Diffusers

The article appears to be about ControlNet integration with Diffusers, a popular library for diffusion models in AI image generation. However, the article body is empty, making detailed analysis impossible.

AIBullisharXiv – CS AI · Mar 34/103

🧠

Disentangled Hierarchical VAE for 3D Human-Human Interaction Generation

Researchers have developed DHVAE (Disentangled Hierarchical Variational Autoencoder), a new AI model for generating realistic 3D human-human interactions. The system uses hierarchical latent diffusion and contrastive learning to create physically plausible interactions while maintaining computational efficiency.

AINeutralarXiv – CS AI · Mar 34/105

🧠

Physics-Consistent Diffusion for Efficient Fluid Super-Resolution via Multiscale Residual Correction

Researchers developed ReMD, a physics-consistent diffusion framework that improves fluid super-resolution by incorporating physical constraints and multiscale modeling. The approach addresses limitations of existing image and diffusion models when applied to fluid dynamics, achieving better accuracy and spectral fidelity with fewer sampling steps.

AINeutralarXiv – CS AI · Mar 34/105

🧠

Phys-Diff: A Physics-Inspired Latent Diffusion Model for Tropical Cyclone Forecasting

Researchers have developed Phys-Diff, a physics-inspired latent diffusion model for tropical cyclone forecasting that incorporates physical relationships between cyclone attributes. The model integrates multimodal data including historical cyclone data, ERA5 reanalysis, and FengWu forecast fields, achieving state-of-the-art performance on global and regional datasets.

AINeutralarXiv – CS AI · Mar 34/105

🧠

You Only Need One Stage: Novel-View Synthesis From A Single Blind Face Image

Researchers developed NVB-Face, a one-stage AI method that generates consistent novel-view face images directly from single low-quality images. The approach bypasses traditional two-stage restoration processes by using feature manipulation and diffusion models to create 3D-aware representations, significantly improving consistency and fidelity.

AINeutralarXiv – CS AI · Mar 24/105

🧠

Bridging Dynamics Gaps via Diffusion Schr\"odinger Bridge for Cross-Domain Reinforcement Learning

Researchers propose BDGxRL, a novel framework using Diffusion Schrödinger Bridge to enable reinforcement learning agents to transfer policies across different domains without direct target environment access. The method aligns source domain transitions with target dynamics through offline demonstrations and introduces reward modulation for consistent learning.

AINeutralGoogle Research Blog · Sep 193/107

🧠

Deep researcher with test-time diffusion

The article discusses 'Deep researcher with test-time diffusion' in the context of machine intelligence. However, the provided article body contains minimal content, making it difficult to extract specific technical details or implications.

AINeutralHugging Face Blog · Sep 133/104

🧠

Introducing Würstchen: Fast Diffusion for Image Generation

The article appears to introduce Würstchen, a new fast diffusion model for AI image generation. However, the article body is empty, preventing detailed analysis of the technology's capabilities or market impact.

AINeutralHugging Face Blog · Nov 253/106

🧠

Diffusion Models Live Event

The article title references a live event focused on diffusion models, which are AI technologies used for generating images, text, and other content. However, no article body content was provided to analyze specific details, speakers, or implications.

AINeutralHugging Face Blog · Nov 301/106

🧠

VQ-Diffusion

The article title references VQ-Diffusion, which appears to be related to AI diffusion models, but no article body content was provided for analysis.

← PrevPage 7 of 7