Zhu, Lin, Fan Guo, Guohui Cai, and Yumeng Ma. “Structured Preference Modeling for Reinforcement Learning-Based Fine-Tuning of Large Models”. Journal of Computer Technology and Software 4, no. 4 (April 30, 2025). Accessed June 7, 2025. https://www.ashpress.org/index.php/jcts/article/view/156.