Zhu L, Guo F, Cai G, Ma Y. Structured Preference Modeling for Reinforcement Learning-Based Fine-Tuning of Large Models. JCTS [Internet]. 2025 Apr. 30 [cited 2026 Jul. 8];4(4). Available from: https://www.ashpress.org/index.php/jcts/article/view/156