Two-stage prompt tuning method for automated preference alignment

doi:10.11772/j.issn.1001-9081.2024081083

Abstract

Abstract: Because user prompts often lack professionalism in specific fields and the use of terminology, it is difficult for LLMs to accurately understand the intentions and generate information that meets the requirements of the field. Based on this, an Automated Preference Alignment Dual-Stage Prompt Tuning (APADPT) method has been pro-posed to solve the preference alignment problem faced by Large Language Models (LLMs) when applied in vertical fields. APADPT achieves the refinement adjustment of input prompts by constructing a supervised fine-tuning da-taset containing human preferences and using LLMs for semantic analysis and evaluation of pairwise replies. After two-stage training, the model not only masters the prompt optimization rules in the general field but also conducts specialized adjustments based on the characteristics of the vertical field. In the experiments in the medical field, APADPT significantly improved the preference alignment consistency of API-based LLMs and open-source LLMs, with the winning rate increasing by 9.5% to 20.5%. In addition, this method shows good robustness and generaliza-tion ability on open-source models with different parameter scales, providing a new optimization strategy for the application of LLMs in vertical specialized fields, contributing to achieving higher performance standards while maintaining the generalization and adaptability of the model.

Key words: large language model, vertical domain optimization, preference alignment, prompt optimization

摘要： 由于用户提示常常缺乏特定领域的专业性和术语使用，导致LLMs难以准确理解意图和生成符合领域要求的信息。基于此提出了一种自动化偏好对齐的双阶段提示调优方法（Automated Preference Alignment Dual-Stage Prompt Tuning, APADPT），以解决大型语言模型（LLMs）在垂直领域应用时面临的偏好对齐问题。APADPT通过构建包含人类偏好的监督微调数据集，并利用LLMs进行成对回复的语义分析和评估，实现对输入提示的精细化调整。经过两阶段训练，模型不仅掌握了通用领域的提示优化规律，还针对垂直领域特性进行了专业化调整。在医疗领域的实验中，APADPT显著提升了基于API的LLMs与开源LLMs的偏好对齐一致性，平均胜率提高9.5%至20.5%。此外，该方法在不同参数规模的开源模型上展现了良好的鲁棒性和泛化能力，为LLMs在垂直专业化领域中的应用提供了新的优化策略，有助于实现更高的性能标准，同时保持模型的泛化性和适应性。

关键词: 大语言模型, 垂直领域优化, 偏好对齐, 提示优化

CLC Number:

TP399

冯涛刘晨. 自动化偏好对齐的双阶段提示调优方法[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2024081083.

[1]	Yiheng SUN, Maofu LIU. Tender information extraction method based on prompt tuning of knowledge [J]. Journal of Computer Applications, 2025, 45(4): 1169-1176.
[2]	Chenwei SUN, Junli HOU, Xianggen LIU, Jiancheng LYU. Large language model prompt generation method for engineering drawing understanding [J]. Journal of Computer Applications, 2025, 45(3): 801-807.
[3]	Yanmin DONG, Jiajia LIN, Zheng ZHANG, Cheng CHENG, Jinze WU, Shijin WANG, Zhenya HUANG, Qi LIU, Enhong CHEN. Design and practice of intelligent tutoring algorithm based on personalized student capability perception [J]. Journal of Computer Applications, 2025, 45(3): 765-772.
[4]	Can MA, Ruizhang HUANG, Lina REN, Ruina BAI, Yaoyao WU. Chinese spelling correction method based on LLM with multiple inputs [J]. Journal of Computer Applications, 2025, 45(3): 849-855.
[5]	Xiaolin QIN, Xu GU, Dicheng LI, Haiwen XU. Survey and prospect of large language models [J]. Journal of Computer Applications, 2025, 45(3): 685-696.
[6]	Chengzhe YUAN, Guohua CHEN, Dingding LI, Yuan ZHU, Ronghua LIN, Hao ZHONG, Yong TANG. ScholatGPT： a large language model for academic social networks and its intelligent applications [J]. Journal of Computer Applications, 2025, 45(3): 755-764.
[7]	Chaofeng LU, Ye TAO, Lianqing WEN, Fei MENG, Xiugong QIN, Yongjie DU, Yunlong TIAN. Speaker-emotion voice conversion method with limited corpus based on large language model and pre-trained model [J]. Journal of Computer Applications, 2025, 45(3): 815-822.
[8]	Peng CAO, Guangqi WEN, Jinzhu YANG, Gang CHEN, Xinyi LIU, Xuechun JI. Efficient fine-tuning method of large language models for test case generation [J]. Journal of Computer Applications, 2025, 45(3): 725-731.
[9]	Jing HE, Yang SHEN, Runfeng XIE. Recognition and optimization of hallucination phenomena in large language models [J]. Journal of Computer Applications, 2025, 45(3): 709-714.
[10]	Wei CHEN, Changyong SHI, Chuanxiang MA. Crop disease recognition method based on multi-modal data fusion [J]. Journal of Computer Applications, 2025, 45(3): 840-848.
[11]	Kun SHENG, Zhongqing WANG. Synaesthesia metaphor analysis based on large language model and data augmentation [J]. Journal of Computer Applications, 2025, 45(3): 794-800.
[12]	Xuefei ZHANG, Liping ZHANG, Sheng YAN, Min HOU, Yubo ZHAO. Personalized learning recommendation in collaboration of knowledge graph and large language model [J]. Journal of Computer Applications, 2025, 45(3): 773-784.
[13]	Yuemei XU, Yuqi YE, Xueyi HE. Bias challenges of large language models： identification， evaluation， and mitigation [J]. Journal of Computer Applications, 2025, 45(3): 697-708.
[14]	Yan YANG, Feng YE, Dong XU, Xuejie ZHANG, Jin XU. Construction of digital twin water conservancy knowledge graph integrating large language model and prompt learning [J]. Journal of Computer Applications, 2025, 45(3): 785-793.
[15]	Yuemei XU, Ling HU, Jiayi ZHAO, Wanze DU, Wenqing WANG. Technology application prospects and risk challenges of large language models [J]. Journal of Computer Applications, 2024, 44(6): 1655-1662.

Two-stage prompt tuning method for automated preference alignment

自动化偏好对齐的双阶段提示调优方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics