Chinese story ending generation model based on bidirectional contrastive training

doi:10.11772/j.issn.1001-9081.2023091244

Journal of Computer Applications

Chinese story ending generation model based on bidirectional contrastive training

SHUAI Qi ¹, WANG Hairui ¹, ZHU Guifu ²

1. Faculty of information Engineering and Automation, Kunming University of Science and Technology 2. Information Technology Construction Management Center, Kunming University of Science and Technology

Received:2023-09-11 Revised:2023-11-17 Online:2024-03-21 Published:2024-03-21
About author:SHUAI Qi, born in 1999, M.S. candidate. His research interests include Natural Language Process. WANG Hairui, born in 1969, Ph.D., professor. His research interests include Fundamentals of Computer Applications, Engineering Technology. ZHU Guifu, born in 1984, M.S, senior engineer. His research interests include computer vision.
Supported by:
National Natural Science Foundation of China (61863016)

基于双向对比训练的中文故事结尾生成模型

帅奇¹,王海瑞¹,朱贵富²

1.昆明理工大学信息工程与自动化学院 2. 昆明理工大学信息化建设管理中心

通讯作者: 朱贵富
作者简介:帅奇(1999—)，男，四川眉山人，硕士研究生，主要研究方向：自然语言处理；王海瑞(1969—)，男，云南昆明人，教授，博士，主要研究方向：计算机应用基础、工程技术；朱贵富(1984—)，男，江西赣州人，高级工程师，硕士，主要研究方向：图像处理。
基金资助:
国家自然科学基金资助项目(61863016)

Abstract

Abstract: Chinese Story Ending Generation (SEG) is one of the downstream tasks in natural language processing.CLSEG (Contrastive Learning of Story Ending Generation) based on completely incorrect performs well in terms of story consistency. However, due to the fact that the wrong ending also contains the same content as the original ending text, using only the wrong ending for comparative training can result in the main part of the generated text with the correct ending being stripped off. Therefore, adding forward ending enhancement training on the basis of CLSEG to preserve the correct parts lost in comparative training. At the same time, by introducing forward endings, the generated endings have stronger diversity and relevance. The proposed model consisted of two main parts: 1) multi ended sampling, which obtains positive enhanced endings and reverse contrasted erroneous endings through different model methods; 2) compare training and modify the loss function during the training process to make the generated end close to the positive end and away from the wrong end. Experimental results on the publicly available story dataset OutGen show that the compared to models such as Gpt2.ft, and Della, the proposed model achieves better results in BERTScore, Meteor, and other indicators, resulting in more diverse and correlated endings.

Key words: Chinese story ending generation, contrastive training, text sampling, text generation, natural language process

摘要： 中文故事结尾生成（SEG）是自然语言处理中的一个下游任务之一。基于全错误结尾的CLSEG（Contrastive Learning of Story Ending Generation）在故事的一致性方面表现较好。然而，由于错误结尾中也包含与原结尾文本相同的内容，仅使用错误结尾的对比训练会导致生成文本中原结尾正确的主要部分被剥离。因此，在CLSEG基础上增加正向结尾增强训练，以保留对比训练中损失的正确部分;同时，通过正向结尾的引入，使生成的结尾具有更强的多样性和关联性。所提模型包含两个主要部分：1）多结尾采样，通过不同的模型方法获取正向增强的结尾和反向对比的错误结尾；2）对比训练，在训练过程中修改损失函数，使生成的结尾接近正向结尾，远离错误结尾。在公开的故事数据集OutGen上的实验结果表明，相较于Gpt2.ft、Della等模型，所提模型的BERTScore、Meteor等指标均取得了较优的结果，生成的结尾更具有多样性和关联性。

关键词: 中文故事结尾生成, 对比训练, 文本采样, 文本生成, 自然语言处理

CLC Number:

TP183

SHUAI Qi, WANG Hairui, ZHU Guifu. Chinese story ending generation model based on bidirectional contrastive training[J]. Journal of Computer Applications, DOI: 10.11772/j.issn.1001-9081.2023091244.

帅奇王海瑞朱贵富. 基于双向对比训练的中文故事结尾生成模型 [J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2023091244.

[1]	Baoshan YANG, Zhi YANG, Xingyuan CHEN, Bing HAN, Xuehui DU. Analysis of consistency between sensitive behavior and privacy policy of Android applications [J]. Journal of Computer Applications, 2024, 44(3): 788-796.
[2]	Kaitian WANG, Qing YE, Chunlei CHENG. Classification method for traditional Chinese medicine electronic medical records based on heterogeneous graph representation [J]. Journal of Computer Applications, 2024, 44(2): 411-417.
[3]	Chenghao FENG, Zhenping XIE, Bowen DING. Selective generation method of test cases for Chinese text error correction software [J]. Journal of Computer Applications, 2024, 44(1): 101-112.
[4]	Xiaomin ZHOU, Fei TENG, Yi ZHANG. Automatic international classification of diseases coding model based on meta-network [J]. Journal of Computer Applications, 2023, 43(9): 2721-2726.
[5]	Xinyue ZHANG, Rong LIU, Chiyu WEI, Ke FANG. Aspect-based sentiment analysis method with integrating prompt knowledge [J]. Journal of Computer Applications, 2023, 43(9): 2753-2759.
[6]	Junjian JIANG, Dawei LIU, Yifan LIU, Yougui REN, Zhibin ZHAO. Few-shot object detection algorithm based on Siamese network [J]. Journal of Computer Applications, 2023, 43(8): 2325-2329.
[7]	Kezheng CHEN, Xiaoran GUO, Yong ZHONG, Zhenping LI. Relation extraction method based on negative training and transfer learning [J]. Journal of Computer Applications, 2023, 43(8): 2426-2430.
[8]	Zexi JIN, Lei LI, Ji LIU. Transfer learning model based on improved domain separation network [J]. Journal of Computer Applications, 2023, 43(8): 2382-2389.
[9]	Yao LIU, Xin TONG, Yifeng CHEN. Algorithm path self-assembling model for business requirements [J]. Journal of Computer Applications, 2023, 43(6): 1768-1778.
[10]	Yongbing GAO, Juntian GAO, Rong MA, Lidong YANG. User granularity-level personalized social text generation model [J]. Journal of Computer Applications, 2023, 43(4): 1021-1028.
[11]	Ming XU, Linhao LI, Qiaoling QI, Liqin WANG. Abductive reasoning model based on attention balance list [J]. Journal of Computer Applications, 2023, 43(2): 349-355.
[12]	Xingbin LIAO, Xiaolin QIN, Siqi ZHANG, Yangge QIAN. Review of interactive machine translation [J]. Journal of Computer Applications, 2023, 43(2): 329-334.
[13]	Jianle CAO, Nana LI. Semantically enhanced sentiment classification model based on multi-level attention [J]. Journal of Computer Applications, 2023, 43(12): 3703-3710.
[14]	Fei XIA, Shuaiqi CHEN, Min HUA, Bihong JIANG. Chinese word segmentation method in electric power domain based on improved BERT [J]. Journal of Computer Applications, 2023, 43(12): 3711-3718.
[15]	Mingyue WU, Dong ZHOU, Wenyu ZHAO, Wei QU. Sentence embedding optimization based on manifold learning [J]. Journal of Computer Applications, 2023, 43(10): 3062-3069.

Chinese story ending generation model based on bidirectional contrastive training

基于双向对比训练的中文故事结尾生成模型

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics