WANG, Yichen. Structured Compression of Large Language Models with Sensitivity-aware Pruning Mechanisms. Journal of Computer Technology and Software, [S. l.], v. 3, n. 9, 2024. DOI: 10.5281/zenodo.15851638. Disponível em: https://www.ashpress.org/index.php/jcts/article/view/187. Acesso em: 8 jul. 2026.