173 articles tagged with #diffusion-models. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AIBullisharXiv – CS AI · Mar 44/103
🧠Researchers propose DiSE, a self-evaluation method for diffusion large language models (dLLMs) that quantifies confidence by computing token regeneration probabilities. The method enables more efficient quality assessment and introduces a flexible-length generation framework that adaptively controls sequence length based on the model's self-assessment.
AINeutralarXiv – CS AI · Mar 44/102
🧠Researchers propose Diffusion-EXR, a new AI model that uses Denoising Diffusion Probabilistic Models (DDPM) to generate review text for explainable recommendation systems. The model corrupts review embeddings with Gaussian noise and learns to reconstruct them, achieving state-of-the-art performance on benchmark datasets for recommendation review generation.
AINeutralarXiv – CS AI · Mar 44/102
🧠Researchers developed an AI diffusion model to reconstruct missing terrain data from Martian satellite imagery for Virtual Reality space exploration applications. The method trained on 12,000 NASA HiRISE heightmaps outperformed traditional interpolation techniques by 4-15% in accuracy and 29-81% in perceptual similarity.
AIBullisharXiv – CS AI · Mar 35/105
🧠Researchers developed SMDIM, a new diffusion model for symbolic music generation that efficiently handles long sequences by combining global structure construction with local refinement. The model outperforms existing approaches in both generation quality and computational efficiency across various musical styles including Western classical, popular, and folk music.
$NEAR
AINeutralarXiv – CS AI · Mar 34/104
🧠MAGIC is a new AI framework for few-shot anomaly detection in industrial quality control that uses mask-guided inpainting to generate high-fidelity synthetic anomalies. The system introduces three key innovations: Gaussian prompt perturbation, spatially adaptive guidance, and context-aware mask alignment to improve anomaly generation while preserving normal regions.
AIBullisharXiv – CS AI · Mar 34/104
🧠Researchers propose TADSR, a Time-Aware one-step Diffusion Network that improves real-world image super-resolution by dynamically varying timesteps instead of using fixed ones. The method achieves state-of-the-art performance while allowing controllable trade-offs between image fidelity and realism in a single processing step.
AINeutralarXiv – CS AI · Mar 34/103
🧠DistillKac introduces a new fast image generation method using damped wave equations and Kac representation for finite-speed probability transport. Unlike diffusion models with potentially unstable reverse-time velocities, this approach enforces bounded kinetic energy and offers improved numerical stability with fewer function evaluations.
AINeutralarXiv – CS AI · Mar 34/104
🧠Researchers propose GACA-DiT, a new AI framework that generates music synchronized with dance movements using diffusion transformers. The system addresses limitations of existing methods by incorporating genre-adaptive rhythm extraction and context-aware temporal alignment for better synchronization between dance and music.
AIBullisharXiv – CS AI · Feb 274/105
🧠Researchers propose AHBid, a new hierarchical bidding framework for cross-channel advertising that combines generative planning with real-time control using diffusion models. The system achieved a 13.57% improvement in return on investment compared to existing methods in large-scale tests.
AINeutralarXiv – CS AI · Feb 274/103
🧠Researchers introduce TabDLM, a new AI framework that generates synthetic tabular data containing both numerical values and free-form text using joint numerical-language diffusion models. The approach addresses limitations of existing diffusion and LLM-based methods by combining masked diffusion for text with continuous diffusion for numbers, enabling better synthetic data generation for privacy and data augmentation applications.
AINeutralarXiv – CS AI · Feb 274/104
🧠Researchers propose a new multi-modality approach for instruction-based image editing that combines Chain-of-Thought planning, region reasoning, and generation capabilities. The method uses large language models and diffusion models to improve complex image editing tasks compared to existing single-modality approaches.
AIBullisharXiv – CS AI · Feb 274/105
🧠Researchers introduced DICArt, a new AI framework for articulated object pose estimation that uses discrete diffusion processes instead of continuous space regression. The method incorporates kinematic constraints and hierarchical structure modeling to improve accuracy in estimating 6D poses of complex objects in embodied AI applications.
AINeutralHugging Face Blog · Feb 34/105
🧠SegMoE (Segmind Mixture of Experts) represents a new approach to diffusion model architecture that combines multiple specialized expert models for improved image generation capabilities. This technical development in AI model design aims to enhance efficiency and quality in diffusion-based image synthesis.
AINeutralHugging Face Blog · Mar 34/107
🧠The article appears to be about ControlNet integration with Diffusers, a popular library for diffusion models in AI image generation. However, the article body is empty, making detailed analysis impossible.
AIBullisharXiv – CS AI · Mar 34/103
🧠Researchers have developed DHVAE (Disentangled Hierarchical Variational Autoencoder), a new AI model for generating realistic 3D human-human interactions. The system uses hierarchical latent diffusion and contrastive learning to create physically plausible interactions while maintaining computational efficiency.
AINeutralarXiv – CS AI · Mar 34/105
🧠Researchers developed ReMD, a physics-consistent diffusion framework that improves fluid super-resolution by incorporating physical constraints and multiscale modeling. The approach addresses limitations of existing image and diffusion models when applied to fluid dynamics, achieving better accuracy and spectral fidelity with fewer sampling steps.
AINeutralarXiv – CS AI · Mar 34/105
🧠Researchers have developed Phys-Diff, a physics-inspired latent diffusion model for tropical cyclone forecasting that incorporates physical relationships between cyclone attributes. The model integrates multimodal data including historical cyclone data, ERA5 reanalysis, and FengWu forecast fields, achieving state-of-the-art performance on global and regional datasets.
AINeutralarXiv – CS AI · Mar 34/105
🧠Researchers developed NVB-Face, a one-stage AI method that generates consistent novel-view face images directly from single low-quality images. The approach bypasses traditional two-stage restoration processes by using feature manipulation and diffusion models to create 3D-aware representations, significantly improving consistency and fidelity.
AINeutralarXiv – CS AI · Mar 24/105
🧠Researchers propose BDGxRL, a novel framework using Diffusion Schrödinger Bridge to enable reinforcement learning agents to transfer policies across different domains without direct target environment access. The method aligns source domain transitions with target dynamics through offline demonstrations and introduces reward modulation for consistent learning.
AINeutralGoogle Research Blog · Sep 193/107
🧠The article discusses 'Deep researcher with test-time diffusion' in the context of machine intelligence. However, the provided article body contains minimal content, making it difficult to extract specific technical details or implications.
AINeutralHugging Face Blog · Sep 133/104
🧠The article appears to introduce Würstchen, a new fast diffusion model for AI image generation. However, the article body is empty, preventing detailed analysis of the technology's capabilities or market impact.
AINeutralHugging Face Blog · Nov 253/106
🧠The article title references a live event focused on diffusion models, which are AI technologies used for generating images, text, and other content. However, no article body content was provided to analyze specific details, speakers, or implications.
AINeutralHugging Face Blog · Nov 301/106
🧠The article title references VQ-Diffusion, which appears to be related to AI diffusion models, but no article body content was provided for analysis.