AIBullisharXiv โ CS AI ยท 6d ago7/103
๐ง
UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model
Researchers introduce UniWeTok, a unified binary tokenizer with a massive 2^128 codebook for multimodal large language models. The system achieves state-of-the-art image generation performance on ImageNet while requiring significantly less training compute than existing solutions.