AIBullisharXiv โ CS AI ยท 6h ago1
๐ง
MetaState: Persistent Working Memory for Discrete Diffusion Language Models
Researchers introduce MetaState, a recurrent augmentation for discrete diffusion language models (dLLMs) that adds persistent working memory to improve text generation quality. The system addresses the 'Information Island' problem where intermediate representations are discarded between denoising steps, achieving improved accuracy on LLaDA-8B and Dream-7B models with minimal parameter overhead.