←
Return to Article Details
Structured Compression of Large Language Models with Sensitivity-aware Pruning Mechanisms
Download