y0news
AnalyticsDigestsSourcesRSSAICrypto
#2-bit-model1 article
1 articles
AIBullisharXiv โ€“ CS AI ยท Feb 277/105
๐Ÿง 

AngelSlim: A more accessible, comprehensive, and efficient toolkit for large model compression

Tencent Hunyuan team introduces AngelSlim, a comprehensive toolkit for large model compression featuring quantization, speculative decoding, and pruning techniques. The toolkit includes the first industrially viable 2-bit large model (HY-1.8B-int2) and achieves 1.8x to 2.0x throughput gains while maintaining output quality.