基于预训练和多层次信息的中文人物关系抽取模型

doi:10.11772/j.issn.1001-9081.2021001090

计算机应用

• 人工智能与仿真 • 下一篇

基于预训练和多层次信息的中文人物关系抽取模型

姚博文,曾碧卿,蔡剑,丁美荣

华南师范大学软件学院

收稿日期:2021-01-18 修回日期:2021-04-27 发布日期:2021-06-04 出版日期:2021-06-04
通讯作者: 曾碧卿
作者简介:姚博文(1997—)，男，江西赣州人，硕士研究生，CCF 会员，主要研究方向：自然语言处理、关系抽取；曾碧卿 (1969—)，男，湖南衡阳人，教授，博士，CCF 会员，主要研究方向：自然语言处理、人工智能；蔡剑(1996—)，男，广东揭阳人，硕士研究生，主要研究方向：自然语言处理、关系抽取；丁美荣(1972—)，女，内蒙古杭锦后旗人，副教授，硕士， CCF 会员，主要研究方向：自然语言处理。
基金资助:
国家自然科学基金资助项目（62076103）；广东省普通高校人工智能重点领域专项(2019KZDZX1033)；广东省信息物理融合系统重点实验室建设专项（2020B1212060069）。

Chinese character relation extraction model based on pre-training and multi-level information

YAO Bowen , ZENG Biqing, CAI Jian, DING Meirong

Received:2021-01-18 Revised:2021-04-27 Online:2021-06-04 Published:2021-06-04
About author:YAO Bowen, born in 1997, M. S. candidate. His research interests include Natural language processing, Relation extraction. ZENG Biqing, born in 1969, Ph. D., professor. His research interests include Natural language processing, text sentiment analysis, relation extraction, dialogue system CAI Jian, born in 1996, M. S. candidate. His research interests include Natural language processing, Relation extraction. DING Meirong, born in 1972, M.S., associate professor. Her research interests include Natural language processing
Supported by:
This work is partially supported by the National Natural Science Foundation of China(62076103); Key fields of artificial intelligence in Guangdong Universities (2019kzdzx1033); Opening Project of Guangdong Province Key Laboratory of Cyber-Physical System(2020b1212060069).

摘要/Abstract

摘要： 关系抽取任务旨在从文本中抽取实体对之间的关系，是当前自然语言处理领域的热门方向之一。针对中文人物关系抽取语料中语法结构复杂，无法有效学习文本语义特征的问题，提出一个基于预训练和多层次信息的中文人物关系抽取模型（CCREPMI）。模型首先利用预训练模型较强的语义表征能力生成词向量，并将原始句子分成句子层次、实体层次和实体邻近层次分别进行特征提取，最终融合句子结构特征、实体含义和实体与邻近词的依赖关系等信息进行关系分类预测。在中文人物关系数据集上的实验结果表明，该模型准确率达到 81.5%，召回率达到 82.3%，F1 值达到 81.9%，相比 BERT 和BERT-LSTM 等基线模型有所提升。此外，模型在 SemEval2010-task8 英文数据集上 F1 值达到 81.2%，证明模型对英文语料具有一定的泛化能力。

关键词: 关系抽取, 预训练模型, 词嵌入, 特征融合, 语义理解

Abstract: Relation extraction task is aimed to extract the relationship between entity pairs from text, which is one of the hot directions in the field of natural language processing. In view of the problem that the grammar structure of the text in Chinese character relation extraction corpus is complex and the semantic features of the text can’t be learned effectively. A Chinese character relation extraction model based on pre-training and multi-level information was proposed for this reason. Firstly, the word vector was generated by the pre-training model which possesses powerful semantic representation ability. Then the original sentence was divided into sentence level, entity level and entity adjacent level for feature extraction. Finally, the relation prediction was performed by the information fusion of the sentence structure features, Entity meanings, and dependency relationship between entities and adjacency words. The experimental results on the Chinese people relationship dataset show that the model has an precision rate of 81.5%, a recall rate of 82.3%, and an F1 value of 81.9%, which is an improvement compared to the baseline model. Moreover, the F1 score on the SemEval2010-task8 English data set reaches 81.2%, which proves that the model has a certain generalization ability for English corpus.

Key words: relation extraction, pre-training model, word embedding, feature fusion, semantic understanding

中图分类号:

TP399

姚博文曾碧卿蔡剑丁美荣. 基于预训练和多层次信息的中文人物关系抽取模型[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2021001090.

YAO Bowen,ZENG Biqing, CAI Jian, DING Meirong. Chinese character relation extraction model based on pre-training and multi-level information[J]. Journal of Computer Applications, DOI: 10.11772/j.issn.1001-9081.2021001090.

[1]	蒋占军, 吴佰靖, 马龙, 廉敬. 多尺度特征和极化自注意力的Faster-RCNN水漂垃圾识别[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 938-944.
[2]	李新叶, 侯晔凝, 孔英会, 燕志旗. 结合特征融合与增强注意力的少样本目标检测[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 745-751.
[3]	贾宗泽, 高鹏飞, 马应龙, 刘晓峰, 夏海鑫. 基于注意力机制的多特征融合对话行为层次化分类方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 715-721.
[4]	余杭, 周艳玲, 翟梦鑫, 刘涵. 基于预训练模型与标签融合的文本分类[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 709-714.
[5]	吴宁, 罗杨洋, 许华杰. 基于多尺度特征融合的遥感图像语义分割方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 737-744.
[6]	郑宇亮, 陈云华, 白伟杰, 陈平华. 融合事件数据和图像帧的车辆目标检测[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 931-937.
[7]	黄子麒, 胡建鹏. 实体类别增强的汽车领域嵌套命名实体识别[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 377-384.
[8]	郭安迪, 贾真, 李天瑞. 基于伪实体数据增强的高精准率医学领域实体关系抽取[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 393-402.
[9]	黄巧玲, 郑伯川, 丁梓成, 吴泽东. 融合监督注意力模块和跨阶段特征融合的图像修复改进网络[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 572-579.
[10]	王楷天, 叶青, 程春雷. 基于异构图表示的中医电子病历分类方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 411-417.
[11]	朱志平, 杨燕, 王杰. 基于场景图感知的跨模态图像描述模型[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 58-64.
[12]	高芸芸, 赵腊生, 张强. 基于双向长短时记忆和卷积Transformer的声学词嵌入模型[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 123-128.
[13]	杨昊, 张轶. 基于上下文信息和多尺度融合重要性感知的特征金字塔网络算法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2727-2734.
[14]	田悦霖, 黄瑞章, 任丽娜. 融合局部语义特征的学者细粒度信息提取方法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2707-2714.
[15]	张心月, 刘蓉, 魏驰宇, 方可. 融合提示知识的方面级情感分析方法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2753-2759.

基于预训练和多层次信息的中文人物关系抽取模型

Chinese character relation extraction model based on pre-training and multi-level information

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics