Joint extraction method of entities and relations based on subject attention

doi:10.11772/j.issn.1001-9081.2020111842

Abstract

Abstract: Extracting entities and relations is crucial for building large-scale knowledge graph and different knowledge extraction tasks. Based on the pre-trained language model, an entity-oriented joint extraction method combining subject attention was proposed. In this method, the key information of the subject was extracted by using Convolutional Neural Network (CNN) and the dependency relationship between the subject and the object was captured by the attention mechanism. Followed by the above, a Joint extraction model based on Subject Attention (JSA) was built. In experiments on public dataset New York Times corpus (NYT) and the dataset of artificial intelligence built by distant supervision, the F1 score of the proposed model was improved by 1.8 and 8.9 percentage points respectively compared with Cascade binary tagging framework for Relational triple extraction (CasRel).

Key words: entity and relation extraction, joint extraction, Natural Language Processing (NLP), attention mechanism, domain knowledge graph, subject

摘要： 实体关系抽取是构建大规模知识图谱及各种信息抽取任务的关键步骤。基于预训练语言模型，提出基于头实体注意力的实体关系联合抽取方法。该方法采用卷积神经网络（CNN）提取头实体关键信息，并采用注意力机制捕获头实体与尾实体之间的依赖关系，构建了基于头实体注意力的联合抽取模型（JSA）。在公共数据集纽约时报语料库（NYT）和采用远程监督方法构建的人工智能领域数据集上进行实验，所提模型的F1值相较于级联二元标记框架（CasRel）分别获得了1.8和8.9个百分点的提升。

关键词: 实体关系抽取, 联合抽取, 自然语言处理, 注意力机制, 领域知识图谱, 头实体

CLC Number:

TP182

LIU Yaxuan, ZHONG Yong. Joint extraction method of entities and relations based on subject attention[J]. Journal of Computer Applications, 2021, 41(9): 2517-2522.

刘雅璇, 钟勇. 基于头实体注意力的实体关系联合抽取方法[J]. 计算机应用, 2021, 41(9): 2517-2522.

References

[1] 马娜梅. 大数据背景下图书馆知识咨询服务策略[J]. 图书馆研究,2019,44(4):90-93.(MA N M. The service strategy of knowledge consultation in libraries under the background of big data[J]. Library Research,2019,44(4):90-93.)
[2] DOS SANTOS C,XIANG B,ZHOU B W. Classifying relations by ranking with convolutional neural networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics/the 7th International Joint Conference on Natural Language Processing. Stroudsburg, PA:Association for Computational Linguistics,2015:626-634.
[3] HASHIMOTO K, MIWA M, TSURUOKA Y, et al. Simple customization of recursive neural networks for semantic relation classification[C]//Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA:Association for Computational Linguistics,2013:1372-1376.
[4] NGUYEN T H,GRISHMAN R. Combining neural networks and log-linear models to improve relation extraction[EB/OL]. (2015-11-18)[2020-08-20]. https://arxiv.org/pdf/1511.05926.pdf.
[5] DEVLIN J,CHANG M W,LEE K,et al. BERT:pre-training of deep bidirectional transformers for language understanding[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies. Stroudsburg, PA:Association for Computational Linguistics,2019:4171-4186.
[6] GUO Z J,ZHANG Y,LU W. Attention guided graph convolutional networks for relation extraction[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics, 2019:241-251.
[7] FU T J,LI P H,MA W Y. GraphRel:modeling text as relational graphs for joint entity and relation extraction[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2019:1409-1418.
[8] SOARES L B,FITZGERALD N,LING J,et al. Matching the blanks:Distributional similarity for relation learning[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2019:2895-2905.
[9] 鄂海红, 张文静, 肖思琪, 等. 深度学习实体关系抽取研究综述[J]. 软件学报,2019,30(6):1793-1818.(E H H,ZHANG W J, XIAO S Q,et al. Survey of entity relationship extraction based on deep learning[J]. Journal of Software,2019,30(6):1793-1818.)
[10] YUAN Y,ZHOU X F,PAN S R,et al. A relation-specific attention network for joint entity and relation extraction[C]//Proceedings of the 29th International Joint Conference on Artificial Intelligence. Palo Alto,CA:AAAI Press,2020:4054-4060.
[11] SOCHER R, HUVAL B, MANNING C D, et al. Semantic compositionality through recursive matrix-vector spaces[C]//Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Stroudsburg, PA:Association for Computational Linguistics,2012:1201-1211.
[12] ZENG D J,LIU K,LAI S W,et al. Relation classification via convolutional deep neural network[C]//Proceedings of the 25th International Conference on Computational Linguistics:Technical Papers. Stroudsburg, PA:Association for Computational Linguistics,2014:2335-2344.
[13] WANG L L,CAO Z,DE MELO G,et al. Relation classification via multi-level attention CNNs[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics, 2016:1298-1307.
[14] XU Y,MOU L L,LI G,et al. Classifying relations via long short term memory networks along shortest dependency paths[C]//Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Stroudsburg,PA:Association for Computational Linguistics,2015:1785-1794.
[15] CAI R, ZHANG X D, WANG H F. Bidirectional recurrent convolutional neural network for relation classification[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2016:756-765.
[16] MIWA M, BANSAL M. End-to-end relation extraction using LSTMs on sequences and tree structures[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2016:1105-1116.
[17] ZHENG S C,WANG F,BAO H Y,et al. Joint extraction of entities and relations based on a novel tagging scheme[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2017:1227-1236.
[18] ZHONG Z X,CHEN D Q. A frustratingly easy approach for joint entity and relation extraction[C]//Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies. Stroudsburg, PA:Association for Computational Linguistics, 2021:50-61.
[19] DAI D,XIAO X Y,LYU Y J,et al. Joint extraction of entities and overlapping relations using position-attentive sequence labeling[C]//Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Palo Alto,CA:AAAI Press,2019:6300-6308.
[20] YU B W,ZHANG Z Y,SU X B. Joint extraction of entities and relations based on a novel decomposition strategy[C]//Proceedings of the 24th European Conference on Artificial Intelligence. Amsterdam:IOS Press,2020:2282-2289.
[21] WEI Z P,SU J L,WANG Y,et al. A novel cascade binary tagging framework for relational triple extraction[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2020:1476-1488.