Entity category enhanced nested named entity recognition in automotive domain

doi:10.11772/j.issn.1001-9081.2023020239

Abstract

Abstract:

Aiming at the problem of poor recognition of nested entities and long entities in the Chinese automotive domain entity extraction task， an Entity Category Enhanced nested Named Entity Recognition （ECE-NER） model was proposed. Firstly， the model’s perception of domain entity boundaries was improved based on feature fusion encoding. Then， the tail word recognition module was used to obtain the entity tail word set by multi-layer perceptron. Finally， the forward boundary recognition module was used to obtain entity category-enhanced entity representation of candidate tail words， based on the sememe-constructed entity category features and self-attention mechanism. By fusing domain entity category features， a biaffine encoder was used to calculate the entity span probabilities of the specific tail words in order to determine the named entities. The experimental evaluation was carried out on the failure dataset of the automobile production line， the failure extraction and evaluation dataset of the automobile industry CCL2022， and the Chinese medical text dataset CHIP2020. The experimental results on the first two datasets show that ECE-NER model increases F1 value by 4.1， 1.8， 1.6 percentage points and 9.0， 5.4， 7.3 percentage points respectively compared with the baseline models including the sequence labeling model （BERT+BiLSTM+CRF） and the span-based entity extraction models （PURE（Princeton University Relation Extraction）， SpERT（Span-based Entity and Relation Transformer））. Especially， ECE-NER model increases F1 value of nested entity recognition by 13.3， 8.3 and 21.7， 9.3 percentage points in the first and third datasets compared to PURE and SpERT models. The experimental results verify the effectiveness of the proposed model on the recognition of nested entities.

Key words: feature fusion, sememe characteristic, self-attention mechanism, biaffine encoder, Chinese nested named entity recognition

摘要：

针对中文汽车领域实体抽取任务中对嵌套实体、长实体识别效果差的问题，提出一种实体类别增强的嵌套实体抽取（ECE-NER）模型。首先，基于特征融合编码，提高模型对领域实体边界的感知能力；然后，尾词识别模块利用多层感知机得到实体尾词集合；最后，前向边界识别模块基于义原构造的实体类别特征和自注意力机制得到实体类别增强的候选尾词表征，融合领域实体类别特征，利用双仿射编码器计算特定尾词和实体类型的实体跨度概率，从而确定命名实体。在某汽车企业生产线故障数据集、汽车工业故障抽取评测数据集CCL2022和中文医学文本数据集CHIP2020上进行模型验证。实验结果表明，所提模型在前两个数据集上的实体识别F1值比序列标注模型（BERT+BiLSTM+CRF）、基于跨度的实体抽取模型（PURE（Princeton University Relation Extraction）、SpERT（Span-based Entity and Relation Transformer））分别提高了4.1、1.8、1.6个百分点和9.0、5.4、7.3个百分点；在第一个数据集和第三个数据集中嵌套实体识别F1值与PURE、SpERT模型相比提高了13.3、8.3个百分点和21.7、9.3个百分点，验证了所提模型在嵌套实体识别上的有效性。

关键词: 特征融合, 义原特征, 自注意力机制, 双仿射编码器, 中文嵌套命名实体识别

CLC Number:

TP391.1

Ziqi HUANG, Jianpeng HU. Entity category enhanced nested named entity recognition in automotive domain[J]. Journal of Computer Applications, 2024, 44(2): 377-384.

黄子麒, 胡建鹏. 实体类别增强的汽车领域嵌套命名实体识别[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 377-384.

Figures/Tables 11

References 24

1	MIWA M， BANSAL M. End-to-end relation extraction using LSTMs on sequences and tree structures［C］// Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics （Volume 1： Long Papers）. Stroudsburg： ACL， 2016： 1105-1116. 10.18653/v1/p16-1105
2	HANIFAH A F， KUSUMANINGRUM R. Non-factoid answer selection in Indonesian science question answering system using Long Short-Term Memory （LSTM）［J］. Procedia Computer Science， 2021， 179： 736-746. 10.1016/j.procs.2021.01.062
3	BAI T， GE Y， GUO S， et al. Enhanced natural language interface for web-based information retrieval［J］. IEEE Access， 2020， 9： 4233-4241. 10.1109/access.2020.3048164
4	赵鹏飞，赵春江，吴华瑞，等. 基于BERT的多特征融合农业命名实体识别［J］. 农业工程学报， 2022， 38（3）： 112-118. 10.11975/j.issn.1002-6819.2022.03.013
	ZHAO P F， ZHAO C J， WU H R， et al. Recognition of the agricultural named entities with multi-feature fusion based on BERT［J］. Transactions of the Chinese Society of Agricultural Engineering， 2022， 38（3）： 112-118. 10.11975/j.issn.1002-6819.2022.03.013
5	刘文松，胡竹青，张锦辉，等. 基于文本特征增强的电力命名实体识别［J］. 电力系统自动化， 2022， 46（21）： 134-142. 10.7500/AEPS20210323003
	LIU W S， HU Z Q， ZHANG J H， et al. Named entity recognition for electric power industry based on enhanced text features ［J］. Automation of Electric Power Systems， 2022， 46： 134-142. 10.7500/AEPS20210323003
6	马建红，张炳斐，张少光，等. 基于主动MCNN-SCRF的新能源汽车命名实体识别［J］. 计算机工程与应用， 2019， 55（7）： 23-29.
	MA J H， ZHANG B F， ZHANG S G， et al. Named entity recognition for new energy vehicles based on active MCNN-SCRF［J］. Computer Engineering and Applications， 2019， 55（7）： 23-29.
7	张永平. 汽车领域命名实体识别方法研究［D］. 株洲：湖南工业大学， 2016： 2-7.
	ZHANG Y P. Research on named entity recognition method in automobile field［D］. Zhuzhou： Hunan University of Technology， 2016： 2-7.
8	BHATTACHARYA M， BHAT S， TRIPATHY S， et al. Improving biomedical named entity recognition through transfer learning and asymmetric tri-training［J］. Procedia Computer Science， 2023， 218： 2723-2733. 10.1016/j.procs.2023.01.244
9	DEVLIN J， CHANG M-W， LEE K， et al. BERT： pre-training of deep bidirectional Transformers for language understanding［C］ // Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics： Human Language Technologies（Volume 1： Long and Short Papers）. Stroudsburg： ACL， 2019： 4171-4186. 10.18653/v1/n18-2
10	LU W， ROTH D. Joint mention extraction and classification with mention hypergraphs［C］ // Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Stroudsburg： ACL， 2015： 857-867. 10.18653/v1/d15-1102
11	KATIYAR A， CARDIE C. Nested named entity recognition revisited［C］// Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics： Human Language Technologies（Volume 1： Long Papers）. Stroudsburg： ACL， 2018： 861-871. 10.18653/v1/n18-1079
12	LI H， YU L， LYU M， et al. Fusion deep learning and machine learning for multi-source heterogeneous military entity recognition［C］// Proceedings of the 2021 IEEE Conference on Telecommunications， Optics and Computer Science. Piscataway： IEEE， 2021： 535-539. 10.1109/tocs53301.2021.9688813
13	JU M， MIWA M， ANANIADOU S. A neural layered model for nested named entity recognition［C］// Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics： Human Language Technologies， Volume 1 （Long Papers）. Stroudsburg： ACL， 2018： 1446-1459. 10.18653/v1/n18-1131
14	SOHRAB M G， MIWA M. Deep exhaustive model for nested named entity recognition［C］// Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Stroudsburg： ACL， 2018： 2843-2849. 10.18653/v1/d18-1309
15	ZHONG Z， CHEN D. A frustratingly easy approach for entity and relation extraction［C］// Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics： Human Language Technologies. Stroudsburg： ACL， 2021： 50-61. 10.18653/v1/2021.naacl-main.5
16	EBERTS M， ULGES A. Span-based joint entity and relation extraction with transformer pre-training［EB/OL］. （2019-09-17）［2023-03-06］. . 10.18653/v1/2021.eacl-main.319
17	黄昌宁，李玉梅，朱晓丹. 中文文本标注规范（5.0版）［EB/OL］. 微软亚洲研究院（2006-03-27）［2023-02-10］. .
	HUANG C N， LI Y M， ZHU X D. Chinese text annotation specification （Version 5.0）［EB/OL］. Microsoft Asia Research Institute（2006-03-27）［2023-02-10］. .
18	PAN S J， YANG Q. A survey on transfer learning［J］. IEEE Transactions on Knowledge Data Engineering， 2010， 22（10）： 1345-1359. 10.1109/tkde.2009.191
19	MIKOLOV T， CHEN K， CORRADO G， et al. Efficient estimation of word representations in vector space［EB/OL］. （2013-01-16）［2023-02-01］. . 10.3126/jiee.v3i1.34327
20	PENNINGTON J， SOCHER R， MANNING C. GloVe： global vectors for word representation［C］// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Stroudsburg： ACL， 2014： 1532-1543. 10.3115/v1/d14-1162
21	CUI Y， CHE W， LIU T， et al. Pre-training with whole word masking for Chinese BERT［J］. IEEE/ACM Transactions on Audio， Speech， and Language Processing， 2021， 29： 3504-3514. 10.1109/taslp.2021.3124365
22	CHE W， FENG Y， QIN L， et al. N-LTP： an open-source neural language technology platform for Chinese［EB/OL］. （2021-09-23）［2023-03-06］. . 10.18653/v1/2021.emnlp-demo.6
23	LI X， FENG J， MENG Y， et al. A unified MRC framework for named entity recognition［C］// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg： ACL， 2020： 5849-5859. 10.18653/v1/2020.acl-main.519
24	QI F， YANG C， LIU Z， et al. OpenHowNet： an open sememe-based lexical knowledge base［EB/OL］. （2019-01-28）［2023-03-06］. . 10.18653/v1/p19-1571

实体类别	备注	实例	实体数
总和			12 204
设备单元	设备的名称、型号	主轴、夹爪	4 592
设备功能	设备的功能表述	切割、对刀	475
检修动作	检修设备的动作	更换、紧固	3 746
检修工具	对设备检修的工具	电脑、图纸	60
失效模式	设备的故障描述	电压异常	2 734
设备属性	设备理化属性	电压13.5 V	597

实体类别	备注	实例	实体数
总和			12 204
设备单元	设备的名称、型号	主轴、夹爪	4 592
设备功能	设备的功能表述	切割、对刀	475
检修动作	检修设备的动作	更换、紧固	3 746
检修工具	对设备检修的工具	电脑、图纸	60
失效模式	设备的故障描述	电压异常	2 734
设备属性	设备理化属性	电压13.5 V	597

数据集	训练集	验证集	测试集
CCL2022	2 400	300	300
FDoAPL	2 543	318	318
CHIP2020	16 000	2 000	2 000

数据集	训练集	验证集	测试集
CCL2022	2 400	300	300
FDoAPL	2 543	318	318
CHIP2020	16 000	2 000	2 000

模型参数	参数值	模型参数	参数值
batch size	4	CNN窗大小	8.0
学习率	5×10^-5	α	0.5
优化器	AdamW	γ	0.5
tail threshold	0.6	max_span_size	11.0