#constrained-decoding News & Analysis

5 articles tagged with #constrained-decoding. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

5 articles

AIBullisharXiv – CS AI · May 297/10

🧠

Accelerating Constrained Decoding with Token Space Compression

Researchers introduce CFGzip, a token space compression technique that dramatically accelerates constrained decoding for large language models using context-free grammars. The method achieves up to 100x latency reduction and 7.5x total speedup, making complex grammar-constrained generation feasible at scale.

AIBullisharXiv – CS AI · Mar 57/10

🧠

Draft-Conditioned Constrained Decoding for Structured Generation in LLMs

Researchers introduce Draft-Conditioned Constrained Decoding (DCCD), a training-free method that improves structured output generation in large language models by up to 24 percentage points. The technique uses a two-step process that first generates an unconstrained draft, then applies constraints to ensure valid outputs like JSON and API calls.

AIBullisharXiv – CS AI · Mar 47/102

🧠

$\texttt{SEM-CTRL}$: Semantically Controlled Decoding

Researchers introduce SEM-CTRL, a new approach that ensures Large Language Models produce syntactically and semantically correct outputs without requiring fine-tuning. The system uses token-level Monte Carlo Tree Search guided by Answer Set Grammars to enforce context-sensitive constraints, allowing smaller pre-trained LLMs to outperform larger models on tasks like reasoning and planning.

AINeutralarXiv – CS AI · May 296/10

🧠

Projectional Decoding: Towards Semantic-Aware LLM Generation

Researchers propose projectional decoding, a framework that integrates semantic validation directly into LLM generation by maintaining a partial graph model alongside text output. This approach aims to ensure semantic validity of software artifacts with provable guarantees, addressing a critical limitation of existing constrained decoding techniques that enforce syntax but struggle with broader semantic correctness.

AINeutralarXiv – CS AI · Mar 54/10

🧠

Token-Oriented Object Notation vs JSON: A Benchmark of Plain and Constrained Decoding Generation

A benchmark study compares Token-Oriented Object Notation (TOON) with JSON for structured data serialization in LLMs, finding that while TOON reduces token usage, plain JSON shows better accuracy overall. The research reveals that TOON's efficiency benefits may only emerge at scale where syntax savings offset the initial prompt overhead.