AIBullisharXiv โ CS AI ยท 10h ago7/10
๐ง
Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning
Researchers introduce MARVAL, a distillation framework that accelerates masked auto-regressive diffusion models by compressing inference into a single step while enabling practical reinforcement learning applications. The method achieves 30x speedup on ImageNet with comparable quality, making RL post-training feasible for the first time with these models.