Wang, Yichen. “Structured Compression of Large Language Models With Sensitivity-Aware Pruning Mechanisms”. Journal of Computer Technology and Software, vol. 3, no. 9, Dec. 2024, doi:10.5281/zenodo.15851638.