#generative-models News & Analysis

150 articles tagged with #generative-models. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

150 articles

AIBullisharXiv – CS AI · Apr 147/10

🧠

Bringing Value Models Back: Generative Critics for Value Modeling in LLM Reinforcement Learning

Researchers propose Generative Actor-Critic (GenAC), a new approach to value modeling in large language model reinforcement learning that uses chain-of-thought reasoning instead of one-shot scalar predictions. The method addresses a longstanding challenge in credit assignment by improving value approximation and downstream RL performance compared to existing value-based and value-free baselines.

AIBullisharXiv – CS AI · Mar 127/10

🧠

Gradient Flow Drifting: Generative Modeling via Wasserstein Gradient Flows of KDE-Approximated Divergences

Researchers introduce Gradient Flow Drifting, a new mathematical framework for generative AI models that connects the Drifting Model to Wasserstein gradient flows of KL divergence under kernel density estimation. The framework includes a mixed-divergence strategy to avoid mode collapse and extends to Riemannian manifolds for improved semantic space applications.

$KL

AIBullisharXiv – CS AI · Mar 97/10

🧠

Generative Predictive Control: Flow Matching Policies for Dynamic and Difficult-to-Demonstrate Tasks

Researchers introduce generative predictive control, a new AI framework that enables robots to perform fast, dynamic tasks without requiring expert demonstrations. The method uses flow matching policies that can handle high-frequency feedback and maintain temporal consistency, addressing key limitations of current robotics approaches.

AINeutralarXiv – CS AI · Mar 57/10

🧠

InEdit-Bench: Benchmarking Intermediate Logical Pathways for Intelligent Image Editing Models

Researchers introduced InEdit-Bench, the first evaluation benchmark specifically designed to test image editing models' ability to reason through intermediate logical pathways in multi-step visual transformations. Testing 14 representative models revealed significant shortcomings in handling complex scenarios requiring dynamic reasoning and procedural understanding.

AIBullisharXiv – CS AI · Mar 57/10

🧠

MPFlow: Multi-modal Posterior-Guided Flow Matching for Zero-Shot MRI Reconstruction

Researchers developed MPFlow, a new zero-shot MRI reconstruction framework that uses multi-modal data and rectified flow to improve medical imaging quality. The system reduces tumor hallucinations by 15% while using 80% fewer sampling steps compared to existing diffusion methods, potentially advancing AI applications in medical diagnostics.

AIBullisharXiv – CS AI · Mar 46/102

🧠

CoBELa: Steering Transparent Generation via Concept Bottlenecks on Energy Landscapes

Researchers introduce CoBELa, a new AI framework for interpretable image generation that uses concept bottlenecks on energy landscapes to enable transparent, controllable synthesis without requiring decoder retraining. The system achieves strong performance on benchmark datasets while allowing users to compositionally manipulate concepts through energy function combinations.

AIBullisharXiv – CS AI · Mar 46/102

🧠

Rigidity-Aware Geometric Pretraining for Protein Design and Conformational Ensembles

Researchers introduce RigidSSL, a new geometric pretraining framework for protein design that improves designability by up to 43% and enhances success rates in protein generation tasks. The two-phase approach combines geometric learning from 432K protein structures with molecular dynamics refinement to better capture protein conformational dynamics.

AINeutralarXiv – CS AI · Mar 47/103

🧠

Unsupervised Representation Learning -- an Invariant Risk Minimization Perspective

Researchers propose a new unsupervised framework for Invariant Risk Minimization (IRM) that learns robust representations without labeled data. The approach introduces two methods - Principal Invariant Component Analysis (PICA) and Variational Invariant Autoencoder (VIAE) - that can capture invariant structures across different environments using only unlabeled data.

AIBullisharXiv – CS AI · Mar 37/104

🧠

Dense-Jump Flow Matching with Non-Uniform Time Scheduling for Robotic Policies: Mitigating Multi-Step Inference Degradation

Researchers developed a new robotic policy framework using dense-jump flow matching with non-uniform time scheduling to address performance degradation in multi-step inference. The approach achieves up to 23.7% performance gains over existing baselines by optimizing integration scheduling during training and inference phases.

AIBullisharXiv – CS AI · Feb 277/103

🧠

Controlling Exploration-Exploitation in GFlowNets via Markov Chain Perspectives

Researchers introduce α-GFNs, an enhanced version of Generative Flow Networks that allows tunable control over exploration-exploitation dynamics through a parameter α. The method achieves up to 10× improvement in mode discovery across various benchmarks by addressing constraints in traditional GFlowNet objectives through Markov chain theory.

$LINK

AINeutralarXiv – CS AI · Jun 256/10

🧠

Attractive and Repulsive Pattern Control in Sequence Generation

Researchers introduce a signed pattern control mechanism for variable-order Markov sequence generation that reduces unwanted repetition and controls text generation quality through weighted recurrence automata and belief propagation sampling. Testing on musical sequences from Bach, Telemann, and jazz databases demonstrates the method effectively decreases self-reuse while maintaining coherence and training data fidelity.

AINeutralarXiv – CS AI · Jun 236/10

🧠

A Generative Model for Closed-Loop Microsimulation of Signalized Intersections

Researchers present Enactor, a generative AI model designed to simulate vehicle behavior at signalized intersections with improved accuracy over existing methods. The model uses transformer-based architecture to predict vehicle trajectories in closed-loop simulations, achieving significantly better performance on safety metrics and traffic flow distribution compared to baseline approaches.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Unsupervised Disentanglement Without Compromises : How Functional Orthogonality Enforces Identifiability

Researchers present a novel approach to unsupervised disentangled representation learning using functional orthogonality constraints on the Jacobian of generative models. The method proves identifiability of nonlinear generative models without requiring statistical independence or causal assumptions, challenging previous impossibility claims in the field.

AIBullisharXiv – CS AI · Jun 236/10

🧠

Predictive Feature Caching for Training-free Acceleration of Molecular Geometry Generation

Researchers propose a training-free caching strategy that accelerates molecular geometry generation in flow matching models by predicting intermediate hidden states, achieving 2-7x speedups without quality degradation. The method is compatible with pretrained models and compounds with existing optimizations, addressing a critical inference bottleneck in computational chemistry workflows.

AINeutralarXiv – CS AI · Jun 235/10

🧠

The Impact of VAE Design on Latent Pose Representations for Diffusion-based Sign Language Production

Researchers investigate how variational autoencoder (VAE) design choices affect latent space properties in sign language production systems using diffusion models. Testing on the Phoenix14T dataset reveals that downstream generative performance correlates more strongly with latent space structure than with traditional reconstruction metrics, suggesting current evaluation methods may miss critical factors influencing model quality.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Delta-Diffusion: Modeling Longitudinal Brain Amyloid-PET Trajectories via Conditional Poisson Diffusion Bridge

Researchers introduce Delta-Diffusion, a novel AI framework using conditional Poisson Diffusion Bridges to synthesize longitudinal brain PET imaging for tracking amyloid accumulation in neurodegenerative diseases. The method addresses limitations of existing generative models by anchoring predictions to baseline patient scans and incorporating clinical progression patterns, potentially reducing the need for costly repeated imaging procedures.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Discrete State Diffusion Models: A Sample Complexity Perspective

Researchers present the first theoretical framework establishing sample complexity bounds for discrete-state diffusion models, a fundamental gap in AI research. The work provides an $\widetilde{\mathcal{O}}(\epsilon^{-2})$ sample complexity bound and decomposes score estimation error into four components, advancing understanding of how these models can be trained efficiently for text and combinatorial applications.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Generative Robust Optimisation

Researchers introduce Generative Robust Optimisation (GRO), a framework using deep generative models to define uncertainty sets for optimization problems that better capture real-world data complexity than traditional geometric approaches. The method combines neural network decoders with a five-point evaluation framework and demonstrates practical applicability through production planning and facility location studies.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Discovering Crystal Structure Prediction Algorithms with an AI Co-Scientist

Researchers introduced HACO, a Human-AI co-discovery system that identified MaskGIT, a vision-based masked generative model, as an effective framework for crystal structure prediction. The resulting MaskGXT model achieved 79.06% accuracy on MP-20 benchmarks, outperforming previous baselines by 8.19 percentage points, demonstrating how AI systems can transfer learning across scientific domains when guided by human expertise.

AINeutralarXiv – CS AI · Jun 196/10

🧠

RIVET: Robust Idempotent Voice Attribute Editing

Researchers introduce RIVET, a training framework that uses idempotency constraints to improve voice attribute editing models' robustness to noisy or inconsistent labels in large-scale speech datasets. By enforcing the property that repeated applications produce identical results, the method acts as an implicit regularizer that reduces sensitivity to mislabeled training data while preserving speaker identity.

AINeutralarXiv – CS AI · Jun 196/10

🧠

Co-policy: Responsive Human-Robot Co-Creation for Musical Performances

Researchers introduce Co-policy, a framework enabling robots to participate in real-time musical co-creation with humans by combining semantic understanding with physically executable performance. The system uses a fine-tuned vision-language model and a Gaussian-Mixture Visuomotor Policy to generate complementary musical responses rather than merely reproducing user input, demonstrating improved performance over existing diffusion-policy approaches.

AINeutralarXiv – CS AI · Jun 196/10

🧠

A Deep Generative Model for Resting-State EEG Synthesis and Transferable Representation Learning

REST-GAN introduces a generative adversarial network framework for synthesizing resting-state EEG signals while learning transferable representations without manual feature engineering. The model demonstrates strong performance in reproducing key EEG properties and outperforms direct raw-signal approaches on demographic classification tasks, offering a computationally efficient alternative to existing EEG analysis methods.

AINeutralarXiv – CS AI · Jun 196/10

🧠

BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation

Researchers introduced BrainG3N, a dual-purpose tokenizer combining a masked autoencoder encoder with a CNN decoder to generate clinically informative 3D brain MRI images. Pretrained on over 35,000 volumes across multiple disease categories and acquisition sites, the model simultaneously excels at downstream clinical tasks and enables controllable, conditional medical image generation.

AINeutralarXiv – CS AI · Jun 196/10

🧠

Residual-Space Evolutionary Optimization via Flow-based Generative Models

Researchers introduce residual-space evolutionary optimization, a framework combining flow-based generative models with evolutionary algorithms to enable data editing without requiring differentiable objectives or gradient-based optimization. The method separates local refinement and broad exploration through self-pollination and cross-pollination mechanisms, validated on image benchmarks and crystal structure data.

AINeutralarXiv – CS AI · Jun 116/10

🧠

FreeBridge: Variational Schr\"odinger Bridges for Cellular Transition Dynamics

FreeBridge, a new computational method based on Schrödinger Bridges, addresses a fundamental challenge in cellular biology by inferring continuous cell transition pathways from static snapshots. The approach constrains predicted intermediate cell states to geometrically valid regions observed in real data, improving both accuracy and biological interpretability in perturbation modeling across multiple imaging datasets.

← PrevPage 2 of 6Next →