AIBullisharXiv โ CS AI ยท Feb 277/105
๐ง
AngelSlim: A more accessible, comprehensive, and efficient toolkit for large model compression
Tencent Hunyuan team introduces AngelSlim, a comprehensive toolkit for large model compression featuring quantization, speculative decoding, and pruning techniques. The toolkit includes the first industrially viable 2-bit large model (HY-1.8B-int2) and achieves 1.8x to 2.0x throughput gains while maintaining output quality.