《计算机应用》唯一官方网站

• •    下一篇

基于知识增强和提示学习的小样本新闻主题分类方法

余新言 1,曾诚 1,2,3*,王乾 1,何鹏 2,3,4,丁晓玉 1
  

  1. 1.湖北大学 人工智能学院,武汉 4300622. 湖北省软件工程技术研究中心,武汉 4300623. 智慧政务与人工智能应用湖北省工程研究中心,武汉 4300624. 湖北大学 网络空间安全学院,武汉 430062
  • 收稿日期:2023-06-06 修回日期:2023-11-15 接受日期:2023-12-05 发布日期:2024-01-04 出版日期:2024-01-04
  • 作者简介:余新言(1995—),女,湖北荆州人,硕士研究生,主要研究方向:自然语言处理,小样本学习;曾诚(1976—),男,湖北武汉人,教授,博士,CCF 专业会员,主要研究方向:服务计算, 人工智能;王乾(1999—),男,河南洛阳人,硕士研究生,CCF学生会员,主要研究方向:自然语言处理,文本分类;何鹏 (1988—),男,江西宜春人,教授,博士, CCF 专业会员,主要研究方向:软件质量分析,缺陷预测;丁晓玉 (1998—),女,湖北孝感人,硕士研究生,主要研究方向:自然语言处理,知识图谱。
  • 基金资助:
    湖北省重点研发计划项目(2021BAA188,2021BAA184,2022BAA044); 国家自然科学基金(62102136)

Few-shot news topic classification method based on knowledge enhancement and prompt learning #br#

YU Xinyan1, ZENG Cheng1,2,3*, WANG Qian1, HE Peng2,3,4, DING Xiaoyu1   

  1. 1. School of Artificial Intelligence, Hubei University, Wuhan Hubei 430062, China;2. Engineering and Technical Research Center of Hubei Province in Software Engineering, Wuhan Hubei 430062, China;3. Engineering Research Center of Hubei Province in Intelligent Government Affairs and Application of Artificial Intelligence, Wuhan Hubei 430062, China;4. School of Cyber Science and Technology, Hubei University, Wuhan Hubei 430062, China
  • Received:2023-06-06 Revised:2023-11-15 Accepted:2023-12-05 Online:2024-01-04 Published:2024-01-04
  • About author:YU Xinyan, born in 1995, M. S. candidate. Her research interests include natural language processing, few-shot learning. ZENG Cheng, born in 1976, Ph. D., professor. His research interests include service computing, artificial intelligence. WANG Qian, born in 1999, M. S. candidate. His research interests include natural language processing, text classification. HE Peng, born in 1988, Ph. D., professor. His research interests include software quality analysis, defect prediction. DING Xiaoyu, born in 1998, M. S. candidate. Her research interests include natural language processing, knowledge graph
  • Supported by:

    This work is supported by Key R&D Projects in Hubei Province(2021BAA188,2021BAA184,2022BAA044),  National Natural Science Foundation of China (62102136).

摘要: 基于预训练微调的分类方法通常需要大量带标注的数据,导致无法在小样本分类任务上使用。因此,针对中文小样本新闻主题分类任务,提出一种基于知识增强和提示学习的分类方法(KPL)。首先,利用预训练模型在训练集上学习到最优的提示模板;其次,将提示模板与输入文本结合,使分类任务转化为完形填空任务;同时利用外部知识来扩充标签词空间,丰富标签词的语义信息;最后,对预测的标签词与原始的标签进行映射。通过在 THUCNewsSHNews Toutiao 三个新闻数据集上进行随机采样,形成小样本训练集和验证集,进行实验。实验结果表明,所提方法在上述数据集上的 1-shot5-shot10- shot20-shot 任务上整体表现有所提升,尤其在 1-shot 任务上提升效果突出,与基线小样本分类方法相比,准确率分别提高了7.952.11 3.1 百分点,验证了知识增强和提示学习在小样本新闻主题分类任务上的有效性。

Abstract: Classification methods based on fine-tuning pre-trained models usually require a large amount of annotated data, resulting in the inability to be used for few-shot classification tasks. Therefore, a Knowledge-Enhanced and Prompt Learning (KPL) method was proposed for Chinese few-shot news topic classification. Firstrly, an optimal prompt template was learned from the training set by using a pre-trained model. Then the template was integrated with the input text, effectively transforming the classification task into a cloze-filling task. External knowledge was simultaneously utilized to expand the label word space, enhancing the semantic richness of label words. Predicted label words were subsequently mapped back to the original labels. Experiments conducted by randomly sampling on three news datasets, THUCNews, SHNews and Toutiao, to form a small-sample training set and validation set. The experimental results shown that the proposed method improves the overall performance on the 1-shot, 5-shot, 10-shot, and 20-shot tasks on the above datasets.. Notably, a significant improvement is observed in the 1-shot task. Compared to baseline few-shot classification methods, the accuracy rate increased by 7.95%, 2.11%, and 3.1% respectively, confirming the effectiveness of the knowledge enhancement and prompt learning approach in few-shot news topic classification tasks.