AIBullisharXiv – CS AI · 10h ago7/10
🧠
UniRank: Unified Rank Allocation for Low-Rank LLM Compression
Researchers propose UniRank, a new method for efficiently allocating ranks in low-rank decomposition of large language models by scoring components via local singular energy and global functional importance. The approach achieves up to 50% perplexity reduction compared to baseline methods without additional fine-tuning, addressing a key bottleneck in LLM compression.
🏢 Perplexity