#stable-diffusion News & Analysis

40 articles tagged with #stable-diffusion. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

40 articles

AIBullisharXiv – CS AI · 4d ago7/10

🧠

LIFT and PLACE: A Simple, Stable, and Effective Knowledge Distillation Framework for Lightweight Diffusion Models

Researchers propose LIFT and PLACE, a knowledge distillation framework that enables stable training of extremely lightweight diffusion models by decomposing the teacher's complex denoising process into coarse and fine stages with spatially adaptive guidance. The method achieves stable convergence even at extreme compression ratios (1.6% of teacher size) where conventional distillation fails, with potential applications across image generation, latent diffusion, and flow-based models.

AIBullisharXiv – CS AI · May 117/10

🧠

Flow-OPD: On-Policy Distillation for Flow Matching Models

Researchers introduce Flow-OPD, a post-training framework that applies on-policy distillation to Flow Matching text-to-image models, addressing reward sparsity and gradient interference problems. Built on Stable Diffusion 3.5 Medium, the method achieves significant performance gains—GenEval scores improve from 63 to 92 and OCR accuracy from 59 to 94—while maintaining image quality and surpassing individual teacher models.

🧠 Stable Diffusion

AIBearisharXiv – CS AI · May 47/10

🧠

The Algorithmic Gaze of Image Quality Assessment: An Audit and Trace Ethnography of the LAION-Aesthetics Predictor

Researchers audited LAION-Aesthetics Predictor (LAP), an algorithmic model widely used to filter training datasets for visual generative AI systems like Stable Diffusion. The audit reveals LAP systematically biases toward images of women while filtering out men and LGBTQ+ individuals, and reinforces Western artistic preferences, raising critical questions about whose aesthetic values shape AI-generated imagery.

🧠 Stable Diffusion

AIBearisharXiv – CS AI · Apr 207/10

🧠

Noise Aggregation Analysis Driven by Small-Noise Injection: Efficient Membership Inference for Diffusion Models

Researchers have developed a novel membership inference attack against diffusion models that uses noise aggregation analysis and small-noise injection to determine whether specific data samples were included in training datasets. The method significantly reduces computational costs while improving accuracy compared to existing approaches, highlighting emerging privacy vulnerabilities in widely-deployed generative AI systems like Stable Diffusion.

🧠 Stable Diffusion

AIBullisharXiv – CS AI · Mar 177/10

🧠

MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models

Researchers introduce MapReduce LoRA and Reward-aware Token Embedding (RaTE) to optimize multiple preferences in generative AI models without degrading performance across dimensions. The methods show significant improvements across text-to-image, text-to-video, and language tasks, with gains ranging from 4.3% to 136.7% on various benchmarks.

🧠 Llama🧠 Stable Diffusion

AIBullisharXiv – CS AI · Mar 47/102

🧠

Fine-Tuning Diffusion Models via Intermediate Distribution Shaping

Researchers present P-GRAFT, a new method for fine-tuning diffusion models by shaping distributions at intermediate noise levels, showing improved performance on text-to-image generation tasks. The framework achieved an 8.81% relative improvement over base Stable Diffusion v2 model on popular benchmarks.

AINeutralarXiv – CS AI · 3d ago6/10

🧠

Robust and Generalizable Safety Steering for Text-to-Image Diffusion Transformers

Researchers introduce SafeDIG, a safety steering framework designed to make text-to-image diffusion transformers like FLUX.1 and Stable Diffusion 3.5 resistant to generating harmful content. The method uses sparse autoencoders and adaptive decoding to maintain safety controls across different risk domains while preserving image quality.

🧠 Stable Diffusion

AINeutralarXiv – CS AI · 3d ago6/10

🧠

Alignment-Guided Score Matching for Text-to-Image Alignment in Diffusion Models

Researchers propose Alignment-Guided Score Matching (AGSM), a reward-free post-training method that improves text-to-image alignment in diffusion models by integrating contrastive guidance into the score-matching objective. The approach addresses failure cases like over-counting and repetition in existing methods, achieving 35% improvement in counting accuracy while remaining compatible with major diffusion model architectures.

AINeutralarXiv – CS AI · 4d ago6/10

🧠

Residualized Temporal Sparse Autoencoders for Interpreting Diffusion Models

Researchers introduce residualized temporal sparse autoencoders (SAEs) to interpret how text-to-image diffusion models generate images over time. By analyzing activation trajectories across the denoising process rather than static snapshots, the method captures interpretable features that go beyond simple linear predictability, enabling better understanding of model internals.

🧠 Stable Diffusion

AINeutralarXiv – CS AI · 5d ago6/10

🧠

CFG-OEC: Classifier Free Guidance with Orthogonal Error Correction

Researchers propose CFG-OEC, an improvement to classifier-free guidance in diffusion models that corrects structural sampling errors caused by misalignment between training objectives and sampling procedures. The method demonstrates improved image generation quality on Stable Diffusion models, achieving better FID and CLIP scores than existing approaches.

🧠 Stable Diffusion

AINeutralarXiv – CS AI · May 116/10

🧠

Supervised sparse auto-encoders for interpretable and compositional representations

Researchers have developed supervised sparse auto-encoders (SAEs) that improve mechanistic interpretability of neural networks by addressing non-smoothness issues in L1 penalties and aligning learned features with human semantics. Validated on Stable Diffusion 3.5, the method enables compositional generalization and feature-level interventions for semantic image editing without prompt modification.

🧠 Stable Diffusion

AINeutralarXiv – CS AI · Apr 146/10

🧠

GLEaN: A Text-to-image Bias Detection Approach for Public Comprehension

Researchers introduce GLEaN, a visual explainability method that transforms complex AI bias detection into understandable portrait composites, enabling non-technical audiences to grasp how text-to-image models like Stable Diffusion XL associate occupations and identities with specific demographic characteristics.

🧠 Stable Diffusion

AIBullisharXiv – CS AI · Apr 146/10

🧠

Closed-Form Concept Erasure via Double Projections

Researchers present a novel closed-form method for concept erasure in generative AI models that removes unwanted concepts without iterative training. The technique uses linear transformations and two sequential projection steps to safely edit pretrained models like Stable Diffusion and FLUX while preserving unrelated concepts, completing the process in seconds.

🧠 Stable Diffusion

AIBullisharXiv – CS AI · Mar 176/10

🧠

RAZOR: Ratio-Aware Layer Editing for Targeted Unlearning in Vision Transformers and Diffusion Models

Researchers introduce RAZOR, a new framework for efficiently removing sensitive information from AI models like CLIP and Stable Diffusion without requiring full retraining. The method selectively edits specific layers and attention heads in transformer models to achieve targeted 'unlearning' while preserving overall performance.

🧠 Stable Diffusion

AIBullisharXiv – CS AI · Mar 36/104

🧠

DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing

DragFlow introduces the first framework to leverage FLUX's DiT priors for drag-based image editing, addressing distortion issues that plagued earlier Stable Diffusion-based approaches. The system uses region-based editing with affine transformations instead of point-based supervision, achieving state-of-the-art results on benchmarks.

AINeutralarXiv – CS AI · Mar 37/107

🧠

Forgetting is Competition: Rethinking Unlearning as Representation Interference in Diffusion Models

Researchers introduce SurgUn, a surgical unlearning method for text-to-image diffusion models that enables precise removal of specific visual concepts while preserving other capabilities. The approach addresses challenges in copyright compliance and content policy enforcement by applying targeted weight-space updates based on retroactive interference theory.

AIBullishHugging Face Blog · Aug 16/106

🧠

Open-sourcing Knowledge Distillation Code and Weights of SD-Small and SD-Tiny

Stability AI has open-sourced knowledge distillation code and model weights for SD-Small and SD-Tiny, making smaller and more efficient versions of Stable Diffusion available to the community. This release enables developers to run image generation models with reduced computational requirements while maintaining reasonable quality.

AIBullishHugging Face Blog · Jun 156/105

🧠

Faster Stable Diffusion with Core ML on iPhone, iPad, and Mac

Apple has announced faster Stable Diffusion implementation using Core ML framework for iPhone, iPad, and Mac devices. This development enables on-device AI image generation with improved performance and efficiency across Apple's ecosystem.

AIBullishHugging Face Blog · May 256/106

🧠

Optimizing Stable Diffusion for Intel CPUs with NNCF and 🤗 Optimum

Intel has released optimization techniques for running Stable Diffusion AI models on CPUs using NNCF (Neural Network Compression Framework) and Hugging Face Optimum. These optimizations aim to improve performance and reduce computational requirements for AI image generation on Intel hardware without requiring expensive GPUs.

AIBullishHugging Face Blog · May 236/105

🧠

Instruction-tuning Stable Diffusion with InstructPix2Pix

The article discusses InstructPix2Pix, a method for instruction-tuning Stable Diffusion models to enable text-guided image editing. This technique allows users to provide natural language instructions to modify existing images rather than generating new ones from scratch.

AIBullisharXiv – CS AI · Mar 34/104

🧠

Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution

Researchers propose TADSR, a Time-Aware one-step Diffusion Network that improves real-world image super-resolution by dynamically varying timesteps instead of using fixed ones. The method achieves state-of-the-art performance while allowing controllable trade-offs between image fidelity and realism in a single processing step.

AIBullishHugging Face Blog · Oct 225/105

🧠

Diffusers welcomes Stable Diffusion 3.5 Large

The article title indicates that Diffusers, a popular machine learning library, has added support for Stable Diffusion 3.5 Large model. However, no article body content was provided for analysis.

AIBullishHugging Face Blog · Jan 155/104

🧠

Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive

The article discusses optimization techniques for accelerating SD Turbo and SDXL Turbo inference using ONNX Runtime and Olive. These tools provide performance improvements for running Stable Diffusion models more efficiently.

AIBullishHugging Face Blog · Oct 35/105

🧠

🧨 Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e

Google demonstrates accelerated inference performance for Stable Diffusion XL using JAX framework on their Cloud TPU v5e hardware. This technical advancement showcases improved efficiency for AI image generation workloads on Google's cloud infrastructure.

AINeutralHugging Face Blog · Sep 294/107

🧠

Finetune Stable Diffusion Models with DDPO via TRL

The article appears to be about finetuning Stable Diffusion models using DDPO (likely Denoising Diffusion Policy Optimization) via TRL (Transformer Reinforcement Learning). However, the article body is empty, preventing detailed analysis of the technical implementation or implications.

Page 1 of 2Next →