AIBullisharXiv – CS AI · Mar 37/103
🧠
UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model
Researchers introduce UniWeTok, a unified binary tokenizer with a massive 2^128 codebook for multimodal large language models. The system achieves state-of-the-art image generation performance on ImageNet while requiring significantly less training compute than existing solutions.