Return to Article Details Structured Compression of Large Language Models with Sensitivity-aware Pruning Mechanisms
Download