AIBullisharXiv – CS AI · 6h ago7/10
🧠
TAPS: Target-Aware Prefix Tree Selection for Diffusion-Drafted Speculative Decoding
Researchers introduce TAPS, a target-aware prefix selection method that improves speculative decoding by optimizing how draft trees are verified in diffusion models. The technique achieves up to 7.9x speedup over standard autoregressive decoding and outperforms competing methods by 1.36-1.74x, addressing a fundamental inefficiency where existing approaches verify unreachable token sequences.