y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#non-autoregressive News & Analysis

8 articles tagged with #non-autoregressive. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

8 articles
AIBullisharXiv – CS AI · Jun 107/10
🧠

Whisfusion: Parallel ASR Decoding with Masked Diffusion

Whisfusion introduces a masked diffusion decoder that achieves faster speech-to-text processing than Whisper-large-v3 while matching or exceeding its accuracy across multilingual benchmarks. By replacing autoregressive decoding with parallel diffusion decoding, the system runs 4-5x faster while maintaining competitive performance with leading ASR systems, establishing non-autoregressive diffusion as a viable paradigm for high-throughput transcription.

AIBearisharXiv – CS AI · Jun 97/10
🧠

Hacking Generative Perplexity: Why Unconditional Text Evaluation Needs Distributional Metrics

Researchers demonstrate that generative perplexity (gen-PPL), the primary metric for evaluating non-autoregressive language models, is fundamentally flawed because it measures only predictability under frozen scorers, not actual text quality. They construct deliberately naive samplers that achieve state-of-the-art results while producing incoherent text, proving the metric's inadequacy and advocating for distributional divergence metrics instead.

🏢 Perplexity
AINeutralarXiv – CS AI · Jun 26/10
🧠

DiffuSent: Towards a Unified Diffusion Framework for Aspect-Based Sentiment Analysis

Researchers introduce DiffuSent, a non-autoregressive diffusion framework that reformulates seven aspect-based sentiment analysis (ABSA) subtasks as boundary denoising processes. The approach achieves significant improvements over existing generative models, particularly on multi-word expressions, while delivering up to 181x faster inference speeds through parallel decoding rather than sequential token generation.

AIBullisharXiv – CS AI · May 296/10
🧠

NaRA: Noise-Aware LoRA for Parameter-Efficient Fine-Tuning of Diffusion LLMs

Researchers introduce NaRA (Noise-aware Low-Rank Adaptation), a parameter-efficient fine-tuning method designed specifically for diffusion large language models that adapts to noise levels during the denoising process. Unlike existing methods like LoRA that use static parameters, NaRA employs a hypernetwork to dynamically adjust low-rank matrices based on noise, achieving better performance on reasoning and code generation tasks.

AINeutralarXiv – CS AI · May 96/10
🧠

Continuous Latent Diffusion Language Model

Researchers propose Cola DLM, a hierarchical latent diffusion language model that generates text through continuous semantic modeling rather than traditional left-to-right autoregressive decoding. The approach achieves comparable performance to autoregressive models while offering greater flexibility, better scaling properties, and a potential pathway for unified modeling across discrete and continuous modalities.

AINeutralarXiv – CS AI · Apr 146/10
🧠

Early Decisions Matter: Proximity Bias and Initial Trajectory Shaping in Non-Autoregressive Diffusion Language Models

Researchers identify a critical failure mode in non-autoregressive diffusion language models caused by proximity bias, where the denoising process concentrates on adjacent tokens, creating spatial error propagation. They propose a minimal-intervention approach using a lightweight planner and temperature annealing to guide early token selection, achieving substantial improvements on reasoning and planning tasks.

AIBullisharXiv – CS AI · Mar 176/10
🧠

SyncSpeech: Efficient and Low-Latency Text-to-Speech based on Temporal Masked Transformer

Researchers introduce SyncSpeech, a new text-to-speech model that combines autoregressive and non-autoregressive approaches using a Temporal Mask Transformer architecture. The model achieves 5.8x lower first-packet latency and 8.8x improved real-time performance while maintaining comparable speech quality to existing models.

AINeutralarXiv – CS AI · Feb 276/1011
🧠

Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?

Researchers identify why Diffusion Language Models (DLMs) struggle with parallel token generation, finding that training data structure forces autoregressive-like behavior. They propose NAP, a data-centric approach using multiple independent reasoning trajectories that improves parallel decoding performance on math benchmarks.