#generative-models News & Analysis

150 articles tagged with #generative-models. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

150 articles

AINeutralarXiv – CS AI · Jun 116/10

🧠

ConsistencyPlanner: Real-time Planning with Fast-Sampling Consistency Models

ConsistencyPlanner introduces a real-time planning framework for autonomous driving that combines fast-sampling consistency models with heterogeneous feature fusion to balance multimodal driving behavior prediction and computational efficiency. The approach demonstrates improved safety metrics in the Waymax simulator compared to existing methods, addressing a key limitation in learning-based autonomous driving systems.

AINeutralarXiv – CS AI · Jun 116/10

🧠

A New Perspective on Precision and Recall for Generative Models

Researchers present a new statistical framework for evaluating generative models by estimating Precision-Recall curves through a binary classification approach. The work provides theoretical guarantees including minimax upper bounds on estimation risk and unifies several existing PR metrics under a single framework.

AINeutralarXiv – CS AI · Jun 106/10

🧠

Learning Where to Simulate: Generative Active Sampling for Online PDE Surrogate Training

Researchers introduce Online Generative Active Sampling (OGAS), an active learning method that improves PDE surrogate models by strategically sampling challenging configurations during training. Using a parallel diffusion model to steer data generation toward difficult regimes, OGAS reduces worst-case prediction errors across multiple PDE types without significant computational overhead.

AINeutralarXiv – CS AI · Jun 106/10

🧠

A Theory on Flow Matching with Neural Networks

Researchers develop theoretical foundations for flow matching, a generative modeling technique using neural networks, establishing convergence guarantees and generalization bounds that validate the approach through experiments. This work bridges the gap between practical flow-matching implementations and rigorous mathematical theory, demonstrating the reliability of neural network-based conditional velocity fields for generating high-quality samples.

AINeutralarXiv – CS AI · Jun 106/10

🧠

Flexible Flows for Biological Sequence Design

Researchers introduce Flexible Flows, an advanced generative framework for designing biological sequences using Discrete Flow Matching with structured couplings and latent edit-based parameterization. The method enables variable-length DNA and peptide sequence generation with fine-grained control while achieving state-of-the-art performance across multiple biological design tasks.

AINeutralarXiv – CS AI · Jun 106/10

🧠

Deep Generative Model for Human Mobility Behavior

Researchers introduce MobilityGen, a diffusion-based generative model that simulates detailed human mobility patterns across days to weeks at large spatial scales. The framework reproduces real-world mobility behaviors including location visit scaling laws, activity time allocation, and travel mode choices, enabling new analyses of urban accessibility and social segregation dynamics.

AINeutralarXiv – CS AI · Jun 96/10

🧠

No Free Lunch for Synthetic Images under Data Scarcity Conditions

Researchers evaluated trade-offs between fidelity, privacy, and utility in synthetic image generation across VAE, GAN, and DDPM models under data scarcity conditions. The study reveals that GANs and DDPMs maintain performance better than VAEs when differential privacy mechanisms are applied, suggesting no single generative model excels across all three dimensions simultaneously.

AINeutralarXiv – CS AI · Jun 96/10

🧠

TRACER: Token ReAssignment for Concept ERasure in Generative Recommendation

Researchers introduce TRACER, a novel framework for removing sensitive concepts from generative recommendation systems while preserving overall utility. The method uses token reassignment to handle the unique challenge that semantic IDs in recommendation systems are shared across items to forget and retain, unlike discrete tokens in language models.

AINeutralarXiv – CS AI · Jun 95/10

🧠

Generative Frontier Planning for Adaptive Peer-Referral Recruitment under Covariate-Dependent Arrivals

Researchers propose Generative Frontier Planning (GFP), a novel algorithm for optimizing peer-referral recruitment in hidden populations by modeling realistic homophily effects and covariate-dependent arrivals. The method outperforms existing baselines by using deterministic backups over generative models rather than Monte-Carlo sampling, achieving near-optimal resource allocation for public health interventions.

AINeutralarXiv – CS AI · Jun 96/10

🧠

BioVid: Autoregressive Video Generation with Biological Behavior Semantic Comprehension

BioVid introduces an autoregressive video generation framework that learns temporal structure from behavioral data rather than using fixed frame counts. The system uses a specialized tokenizer and transformer architecture to naturally determine when behavioral sequences end, matching real-world action duration distributions significantly better than existing methods.

AINeutralarXiv – CS AI · Jun 96/10

🧠

BSTabDiff: Block-Subunit Diffusion Priors for High-Dimensional Tabular Data Generation

Researchers introduce BSTabDiff, a generative framework designed to create synthetic high-dimensional tabular data with limited samples by partitioning features into latent blocks and using diffusion priors. The method addresses challenges in domains like genomics where data is sparse relative to feature count, producing more realistic synthetic data than existing approaches.

AINeutralarXiv – CS AI · Jun 96/10

🧠

MeCo: One-Step MeanFlow-based Corrector for Multi-Channel Speech Separation

Researchers propose MeCo, a MeanFlow-based generative corrector that improves multi-channel speech separation by refining discriminative model outputs in a single step. The method combines Data-Space Optimization with specialized loss functions to achieve state-of-the-art performance in both signal fidelity and human listening quality with minimal computational cost.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Generation Properties of Stochastic Interpolation under Finite Training Set

Researchers derive closed-form expressions for optimal velocity fields in stochastic interpolation generative models trained on finite datasets, demonstrating that deterministic processes exactly recover training samples while stochastic processes add Gaussian noise. The work formalizes underfitting and overfitting for generative models, showing that estimation errors produce convex combinations of training samples with mixed noise corruption.

AINeutralarXiv – CS AI · Jun 96/10

🧠

The CIFAR Synthetic Evidence Corpus for Detecting AI-Generated Evidence

Researchers introduce CIFAR, a synthetic evidence corpus dataset designed to detect AI-generated fraudulent documents in legal proceedings. The dataset addresses a critical gap by providing training data for systems that can identify subtle, localized document alterations that preserve plausibility while changing legal meaning—a challenge existing detection tools cannot adequately handle.

AINeutralarXiv – CS AI · Jun 86/10

🧠

CAF-Gen: A Multi-Agent System for Enriching Argumentation Structures

CAF-Gen is a new multi-agent AI system that automatically enriches basic argument structures into complex, formally-structured argumentation models using the Carneades Argumentation Framework. The iterative Creator-Reviewer pipeline improves reasoning formalization in computational linguistics by validating outputs through collaborative feedback loops rather than single-pass generation.

AINeutralarXiv – CS AI · Jun 86/10

🧠

Breaking the Lock-in: Diversifying Text-to-Image Generation via Representation Modulation

Researchers present DAVE, a training-free method that enhances diversity in text-to-image generation by attenuating the DC (zero-frequency) component of intermediate Transformer features during early generation stages. The technique addresses the problem of identical outputs from the same prompt without requiring expensive sampling overhead or auxiliary optimization.

AINeutralarXiv – CS AI · Jun 86/10

🧠

Beyond Skeletons: Learning Animation Directly from Driving Videos with Same2X Training Strategy

DirectAnimator is a new AI framework that generates human animations from static images by learning directly from driving videos, eliminating reliance on potentially error-prone pose estimators. The system introduces a Same2X training strategy that improves cross-identity animation while maintaining computational efficiency and robustness to occlusions.

AIBullisharXiv – CS AI · Jun 86/10

🧠

MatterDoor: Sampling Zero-shot Spatio-semantic Priors using Generative Models

Researchers introduce MatterDoor, a method enabling autonomous robots to infer hidden room structure and semantics from doorway-occluded views using pretrained generative vision models without task-specific training. The approach combines VLM-guided outpainting, depth estimation, and semantic segmentation to generate 3D hypotheses of unobserved spaces, evaluated on a new Matterport3D-derived benchmark for robot navigation and object-reaching tasks.

AIBullisharXiv – CS AI · Jun 56/10

🧠

DiG-Plan: Mitigating Early Commitment for Tool-Graph Planning via Diffusion Guidance

Researchers introduce DiG-Plan, a novel framework addressing the early commitment problem in tool-graph planning by combining diffusion-based proposal generation with autoregressive refinement. The approach improves solution coverage from 32% to 94.3% and delivers 10% relative gains over traditional autoregressive baselines on TaskBench benchmarks.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Where Should Knowledge Enter? A Layered Framework for Knowledge Infusion in Multimodal Iterative Generative Mo

Researchers propose a four-layer framework for knowledge infusion in multimodal generative models, categorizing intervention points as surface, trajectory, latent, and parametric. Testing on diffusion models with safety constraints demonstrates that cumulative multi-layer approaches reduce knowledge-violating outputs by 71%, showing each layer addresses distinct failure modes.

AINeutralarXiv – CS AI · Jun 56/10

🧠

OneReason Technical Report

OneReason introduces a novel framework for improving reasoning capabilities in generative recommendation models by addressing perception and cognition limitations. The approach combines semantic grounding of item tokens with multi-level chain-of-thought sequences, demonstrating that effective reasoning requires both language understanding and coherent interest modeling rather than scaling alone.

AIBullisharXiv – CS AI · Jun 46/10

🧠

Scaling Novel Graph Generation via Lightweight Structure-Guided Autoregressive Models

Researchers propose a lightweight autoregressive framework for graph generation that achieves near log-linear complexity by using structure-guided topological ordering, addressing scalability limitations in current diffusion and autoregressive models. The two-phase training strategy reduces overfitting and promotes novel graph generation while maintaining validity, with applications spanning molecular discovery, circuit design, and cybersecurity.

AINeutralarXiv – CS AI · Jun 46/10

🧠

GeM-NR: Geometry-Aware Multi-View Editing for Nonrigid Scene Changes

GeM-NR is a new training-free method for multi-view consistent image editing that handles nonrigid scene changes—edits that significantly alter geometry and appearance. The approach works by using an edited anchor image to guide consistent edits across multiple viewpoints, addressing a major limitation in existing generative image editing systems.

AINeutralarXiv – CS AI · Jun 46/10

🧠

Markov Chain Decoders Overcome the Heavy-Tail Limitations of Lipschitz Generative Models

Researchers demonstrate that standard generative models cannot produce heavy-tailed distributions due to Gaussian decoder limitations and Lipschitz constraints. They propose replacing Gaussian decoders with Phase-Type distributions based on Markov chains, achieving up to 10x improvement in extreme quantile error for heavy-tailed data generation.

AINeutralarXiv – CS AI · Jun 36/10

🧠

When Helping Hurts and How to Fix It: Multi-Agent Debate for Data Cleaning

Researchers identify when multi-agent debate helps or hurts data cleaning tasks, finding it degrades generation quality but improves error detection. They establish a mathematical condition predicting debate effectiveness and demonstrate that adversarial separation with code-execution grounding can overcome critique-induced confusion, achieving the first significant improvement on generative tasks.

← PrevPage 3 of 6Next →