联合边界生成的多目标学习的嵌套命名实体识别

doi:10.11772/j.issn.1001-9081.2024070980

《计算机应用》唯一官方网站 ›› 2025, Vol. 45 ›› Issue (7): 2229-2236.DOI: 10.11772/j.issn.1001-9081.2024070980

联合边界生成的多目标学习的嵌套命名实体识别

徐章杰¹^,²^,³, 陈艳平¹^,²^,³(), 扈应¹^,²^,³, 黄瑞章¹^,²^,³, 秦永彬¹^,²^,³

^1.文本计算与认知智能教育部工程研究中心（贵州大学），贵阳 550025
^2.公共大数据国家重点实验室（贵州大学），贵阳 550025
^3.贵州大学计算机科学与技术学院，贵阳 550025

收稿日期:2024-07-10 修回日期:2024-10-08 接受日期:2024-10-09 发布日期:2025-07-10 出版日期:2025-07-10
通讯作者: 陈艳平
作者简介:徐章杰（2000—），女，贵州贵阳人，硕士研究生，CCF学生会员，主要研究方向：自然语言处理、信息抽取
扈应（1996—），男，重庆人，博士研究生，主要研究方向：自然语言处理
黄瑞章（1979—），女，天津人，教授，博士，CCF会员，主要研究方向：数据融合分析、文本挖掘、网络挖掘、知识发现、机器学习
秦永彬（1980—），男，山东烟台人，教授，博士，CCF高级会员，主要研究方向：大数据治理与应用、多源数据融合、智能计算、机器学习、算法设计。
基金资助:
黔科合重大专项(［2024］003);国家重点研发计划项目(2023YFC3304500);国家自然科学基金资助项目(62166007)

Nested named entity recognition combined with boundary generation by multi-objective learning

Zhangjie XU¹^,²^,³, Yanping CHEN¹^,²^,³(), Ying HU¹^,²^,³, Ruizhang HUANG¹^,²^,³, Yongbin QIN¹^,²^,³

^1.Engineering Research Center of Ministry of Education for Text Computing and Cognitive Intelligence （Guizhou University），Guiyang Guizhou 550025，China
^2.State Key Laboratory of Public Big Data （Guizhou University），Guiyang Guizhou 550025，China
^3.College of Computer Science and Technology，Guizhou University，Guiyang Guizhou 550025，China

Received:2024-07-10 Revised:2024-10-08 Accepted:2024-10-09 Online:2025-07-10 Published:2025-07-10
Contact: Yanping CHEN
About author:XU Zhangjie， born in 2000， M. S. candidate. Her research interests include natural language processing， information extraction.
HU Ying， born in 1996， Ph. D. candidate. His research interests include natural language processing.
HUANG Ruizhang， born in 1979， Ph. D.， professor. Her research interests include data fusion analysis， text mining， network mining， knowledge discovery， machine learning.
QIN Yongbin， born in 1980， Ph. D.， professor. His research interests include big data governance and application， multi-source data fusion， intelligent computing， machine learning， algorithm design.
Supported by:
Major Science and Technology Project of Guizhou Province （Qiankehe(［2024］003);National Key Research and Development Program of China(2023YFC3304500);National Natural Science Foundation of China(62166007)

摘要/Abstract

摘要：

命名实体识别（NER）旨在从非结构化文本中识别预定义的实体类型。基于跨度的NER方法通过枚举所有可能的跨度进行分类，然而文本中相邻的跨度共享上下文语义，会导致跨度之间的边界语义信息模糊，从而使模型难以获取跨度间的依赖信息。针对跨度间边界语义信息模糊的问题，提出一种联合边界生成的多目标学习NER模型。该模型通过联合NER任务和边界生成任务，以多目标学习的方式进行共同训练。其中：使用边界生成任务作为辅助任务引导模型网络关注跨度的边界信息，以增强跨度的边界语义，进而提升NER的性能。在ACE2004、ACE2005和GENIA数据集上进行测试，所提模型的F1值分别达到了87.83%、86.90%和81.65%，实验结果充分验证了该模型在不同数据集上的有效性，也进一步验证了该模型在命名实体识别任务中的优越性能。

关键词: 命名实体识别, 跨度分类, 多目标学习, 边界生成, 神经网络

Abstract:

Named Entity Recognition （NER） aims to identify predefined entity types from unstructured text. Span-based NER methods recognize entities through enumerating all the spans. However， adjacent spans in the text share contextual semantics， which leads to semantic information ambiguity among span boundaries， thus making it difficult for models to capture dependency information among spans. To address the issue of semantic information ambiguity among span boundaries， a multi-objective learning NER model combined with boundary generation was proposed. The model was trained through a multi-objective learning approach jointly through combining NER task with boundary generation task. Among which， the boundary generation task was used as an auxiliary task to guide the model network to focus on boundary information of the spans， thus improving the performance of NER. Tests conducted on the ACE2004， ACE2005， and GENIA datasets show that the proposed model achieves F1 scores of 87.83%， 86.90%， and 81.65%， respectively. Experimental results fully validate the effectiveness of the model on different datasets and also further confirm its superior performance in named entity recognition tasks.

Key words: Named Entity Recognition (NER), span classification, multi-objective learning, boundary generation, neural network

中图分类号:

TP391.1

徐章杰, 陈艳平, 扈应, 黄瑞章, 秦永彬. 联合边界生成的多目标学习的嵌套命名实体识别[J]. 计算机应用, 2025, 45(7): 2229-2236.

Zhangjie XU, Yanping CHEN, Ying HU, Ruizhang HUANG, Yongbin QIN. Nested named entity recognition combined with boundary generation by multi-objective learning[J]. Journal of Computer Applications, 2025, 45(7): 2229-2236.

图/表 8

图1 二维跨度矩阵示例

Fig. 1 Two-dimensional span matrix example

图2 本文模型的结构

Fig. 2 Architecture of proposed model

图3 边界生成任务结构

Fig. 3 Structure of boundary generation task

表1 数据集的统计信息

Tab. 1 Dataset statistics

数据集		句子数量	句子平均长度	实体数	实体平均长度	嵌套实体数	嵌套实体比例/%
训练集	ACE2004	6 200	23.50	22 204	2.63	10 149	45.71
	ACE2005	7 194	19.21	24 441	2.42	9 389	38.41
	GENIA	15 023	25.27	45 144	1.95	7 997	17.71
验证集	ACE2004	745	23.02	2 514	2.67	1 092	46.69
	ACE2005	969	18.93	3 200	2.26	1 112	34.75
	GENIA	1 669	26.01	5 365	1.97	1 067	19.88
测试集	ACE2004	812	23.05	3 035	2.68	1 417	45.61
	ACE2005	1 047	17.20	2 993	2.40	1 118	37.35
	GENIA	1 854	25.98	5 506	2.08	1 199	21.77

表2 参数设置

Tab. 2 Parameters setting

参数	值	参数	值
批次大小	8	随机失活	0.5
训练轮数	50	平衡因子λ	1×10^-3
学习率	1×10^-5	卷积膨胀率	｛1，2，3，4｝

表3 数据集上各模型的结果 (%)

Tab. 3 Different model results on datasets

模型	ACE2004			ACE2005			GENIA
模型	P	R	F1	P	R	F1	P	R	F1
BoningKnife^［24］	85.98	86.86	86.41	84.77	86.16	85.46	—	—	—
Biaffine^［26］	87.30	86.00	86.70	85.20	85.60	85.40	81.80	79.30	80.50
Local and Label^［25］	87.44	87.38	87.41	86.09	87.27	86.67	80.19	80.89	80.54
Triaffine^［28］	87.13	87.68	87.40	86.70	86.94	86.82	80.42	82.06	81.23
W2NER^［27］	87.33	87.71	87.52	85.03	88.62	86.79	83.10	79.76	81.39
Local Future Boost^［29］	86.96	86.36	86.66	84.94	86.73	85.83	82.35	80.33	81.33
Debiasing^［17］	87.64	87.61	87.63	85.01	87.47	86.22	79.51	79.48	79.49
Biaffine and Triaffine^［30］	87.91	87.41	87.66	85.80	87.95	86.86	83.02	78.88	80.90
本文模型	87.33	88.34	87.83	85.37	88.48	86.90	81.26	82.05	81.65

表4 平衡因子λ的选取 (%)

Tab. 4 Selection of balance factor λ

$λ$	GENIA
$λ$	P	R	F1
10^-1	80.09	80.00	80.05
10^-2	81.39	81.11	81.25
10^-3	82.52	80.95	81.73
10^-4	81.67	80.82	81.24

表4 平衡因子λ的选取 (%)

Tab. 4 Selection of balance factor λ

$λ$	GENIA
$λ$	P	R	F1
10^-1	80.09	80.00	80.05
10^-2	81.39	81.11	81.25
10^-3	82.52	80.95	81.73
10^-4	81.67	80.82	81.24

表5 去掉各模块后的性能 (%)

Tab. 5 Performance after removing each module

模块	F1
模块	ACE2004	ACE2005	GENIA
完整模型	87.83	86.90	81.65
-边界生成	87.67	86.56	81.48
-空洞卷积	87.65	86.74	81.57
-Biaffine	87.58	86.73	81.51

参考文献 36

[1]	王颖洁，张程烨，白凤波，等.中文命名实体识别研究综述［J］.计算机科学与探索，2023， 17（2）： 324-341.
	WANG Y J， ZHANG C Y， BAI F B， et al. Review of Chinese named entity recognition research ［J］. Journal of Frontiers of Computer Science and Technology， 2023， 17（2）： 324-341.
[2]	GUO J， XU G， CHENG X， et al. Named entity recognition in query ［C］// Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval. New York： ACM， 2009： 267-274.
[3]	PETKOVA D， CROFT W B. Proximity-based document representation for named entity retrieval ［C］// Proceedings of the 16th ACM Conference on Information and Knowledge Management. New York： ACM， 2007： 731-740.
[4]	MOLLÁ D， VAN ZAANEN M， SMITH D. Named entity recognition for question answering ［C］// Proceedings of the Australasian Language Technology Association Workshop 2006. ［S.l.］： Australasian Language Technology Association， 2006： 51-58.
[5]	ETZIONI O， CAFARELLA M， DOWNEY D， et al. Unsupervised named-entity extraction from the Web： an experimental study ［J］. Artificial Intelligence， 2005， 165（1）： 91-134.
[6]	ZHANG Z， HAN X， LIU Z， et al. ERNIE： enhanced language representation with informative entities ［C］// Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg： ACL， 2019： 1441-1451.
[7]	CHENG P， ERK K. Attending to entities for better text understanding ［C］// Proceedings of the 34th AAAI Conference on Artificial Intelligence. Palo Alto： AAAI Press， 2020： 7554-7561.
[8]	BABYCH B， HARTLEY A. Improving machine translation quality with automatic named entity recognition ［C］// Proceedings of the 7th International EAMT workshop on MT and other language technology tools， Improving MT through other language technology tools， Resource and tools for building MT at EACL. Stroudsburg： ACL， 2003： 1-8.
[9]	蔡宇翔，骆妲，甘洋镭，等.基于跨度边界感知的嵌套命名实体识别［J］.软件学报，2024， 35（11）： 5149-5162.
	CAI Y X， LUO D， GAN Y L， et al. Nested named entity recognition based on span boundary perception ［J］. Journal of Software， 2024， 35（11）： 5149-5162.
[10]	耿汝山，陈艳平，唐瑞雪，等.跨度语义增强的命名实体识别方法［J］.西安交通大学学报，2022， 56（7）： 118-126.
	GENG R S， CHEN Y P， TANG R X， et al. Named entity recognition based on span semantic enhancement ［J］. Journal of Xi’an Jiaotong University， 2022， 56（7）： 118-126.
[11]	LU W， ROTH D. Joint mention extraction and classification with mention hypergraphs ［C］// Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Stroudsburg： ACL， 2015： 857-867.
[12]	MUIS A O， LU W. Labeling gaps between words： recognizing overlapping mentions with mention separators ［C］// Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Stroudsburg： ACL， 2017： 2608-2618.
[13]	KATIYAR A， CARDIE C. Nested named entity recognition revisited ［C］// Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics： Human Language Technologies （Volume 1： Long Papers）. Stroudsburg： ACL， 2018： 861-871.
[14]	YAN Y， CAI B， SONG S. Nested named entity recognition as building local hypergraphs ［C］// Proceedings of the 37th AAAI Conference on Artificial Intelligence. Palo Alto： AAAI Press， 2023： 13878-13886.
[15]	STRAKOVÁ J， STRAKA M， HAJIC J. Neural architectures for nested NER through linearization ［C］// Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg： ACL， 2019： 5326-5331.
[16]	YAN H， GUI T， DAI J， et al. A unified generative framework for various NER subtasks ［C］// Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing （Volume 1： Long Papers）. Stroudsburg： ACL， 2021： 5808-5822.
[17]	XIA Y， ZHAO Y， WU W， et al. Debiasing generative named entity recognition by calibrating sequence likelihood ［C］// Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics （Volume 2： Short Papers）. Stroudsburg： ACL， 2023： 1137-1148.
[18]	JU M， MIWA M， ANANIADOU S. A neural layered model for nested named entity recognition ［C］// Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics： Human Language Technologies （Volume 1： Long Papers）. Stroudsburg： ACL， 2018： 1446-1459.
[19]	WANG J， SHOU L， CHEN K， et al. Pyramid： a layered model for nested named entity recognition ［C］// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg： ACL， 2020： 5918-5928.
[20]	ROJAS M， BRAVO-MARQUEZ F， DUNSTAN J. Simple yet powerful： an overlooked architecture for nested named entity recognition ［C］// Proceedings of the 29th International Conference on Computational Linguistics. ［S.l.］： International Committee on Computational Linguistics， 2022： 2108-2117.
[21]	SOHRAB M G， MIWA M. Deep exhaustive model for nested named entity recognition ［C］// Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Stroudsburg： ACL， 2018： 2843-2849.
[22]	ZHENG C， CAI Y， XU J， et al. A boundary-aware neural model for nested named entity recognition ［C］// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Stroudsburg： ACL， 2019： 357-366.
[23]	TAN C， QIU W， CHEN M， et al. Boundary enhanced neural span classification for nested named entity recognition ［C］// Proceedings of the 34th AAAI Conference on Artificial Intelligence. Palo Alto： AAAI Press， 2020： 9016-9023.
[24]	JIANG H， WANG G， CHEN W， et al. BoningKnife： joint entity mention detection and typing for nested NER via prior boundary knowledge ［EB/OL］. ［2024-05-10］. .
[25]	SHEN Y， MA X， TAN Z， et al. Locate and label： a two-stage identifier for nested named entity recognition ［C］// Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing （Volume 1： Long Papers）. Stroudsburg： ACL， 2021： 2782-2794.
[26]	YU J， BOHNET B， POESIO M. Named entity recognition as dependency parsing ［C］// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg： ACL， 2020： 6470-6476.
[27]	LI J， FEI H， LIU J， et al. Unified named entity recognition as word-word relation classification ［C］// Proceedings of the 36th AAAI Conference on Artificial Intelligence. Palo Alto： AAAI Press， 2022： 10965-10973.
[28]	YUAN Z， TAN C， HUANG S， et al. Fusing heterogeneous factors with triaffine mechanism for nested named entity recognition ［C］// Findings of the Association for Computational Linguistics： ACL 2022. Stroudsburg： ACL， 2022： 3174-3186.
[29]	DENG J， LIU J， MA X， et al. Local feature enhancement for nested entity recognition using a convolutional block attention module ［J］. Applied Sciences， 2023， 13（16）： No.9200.
[30]	GUO Y， TANG T， SUN S， et al. Nested entity recognition fusing span relative position and region information ［J］. Electronics， 2023， 12（11）： No.2483.
[31]	DEVLIN J， CHANG M W， LEE K， et al. BERT： pre-training of deep bidirectional transformers for language understanding ［C］// Proceedings of the 2019 North American Chapter of the Association for Computational Linguistics： Human Language Technologies （Volume 1： Long and Short Papers）. Stroudsburg： ACL， 2019： 4171-4186.
[32]	LEWIS M， LIU Y， GOYAL N， et al. BART： denoising sequence-to-sequence pre-training for natural language generation， translation， and comprehension ［C］// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg： ACL， 2020： 7871-7880.
[33]	DODDINGTON G， MITCHELL A， PRZYBOCKI M， RAMSHAW L， STRASSEL S， WEISCHEDEL R. The Automatic Content Extraction （ACE） program-tasks， data， and evaluation ［C］// Proceedings of the 4th International Conference on Language Resources and Evaluation Conference. Paris： European Language Resources Association， 2004： 837-840.
[34]	WALKER C， STRASSEL S， MEDERO J， MAEDA K. ACE 2005 multilingual training corpus ［DS/OL］. ［2024-05-15］. .
[35]	KIM J D， OHTA T， TATEISI Y， et al. GENIA corpus — a semantically annotated corpus for bio-text mining ［J］. Bioinformatics， 2003， 19（S1）： i180-i182.
[36]	LOSHCHILOV I， HUTTER F. Fixing weight decay regularization in Adam ［EB/OL］. ［2024-06-10］. .

联合边界生成的多目标学习的嵌套命名实体识别

Nested named entity recognition combined with boundary generation by multi-objective learning

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 8

参考文献 36

相关文章 15

编辑推荐

Metrics

[1]	王义, 马应龙. 基于项图动态适应性生成的多任务社交项推荐方法[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2592-2599.
[2]	余婧, 陈艳平, 扈应, 黄瑞章, 秦永彬. 结合实体边界偏移的序列标注优化方法[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2522-2529.
[3]	涂银川, 郭勇, 毛恒, 任怡, 张建锋, 李宝. 基于分布式环境的图神经网络模型训练效率与训练性能评估[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2409-2420.
[4]	彭鹏, 蔡子婷, 刘雯玲, 陈才华, 曾维, 黄宝来. 基于CNN和双向GRU混合孪生网络的语音情感识别方法[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2515-2521.
[5]	赵彪, 秦玉华, 田荣坤, 胡月航, 陈芳锐. 依赖类型及距离增强的方面级情感分析模型[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2507-2514.
[6]	林进浩, 罗川, 李天瑞, 陈红梅. 基于跨尺度注意力网络的胸部疾病分类方法[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2712-2719.
[7]	蒋权, 黄文清, 苟志勇. 基于等变图神经网络的拉格朗日粒子流模拟[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2666-2671.
[8]	梁辰, 王奕森, 魏强, 杜江. 基于Tsransformer-GCN的源代码漏洞检测方法[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2296-2303.
[9]	张子墨, 赵雪专. 多尺度稀疏图引导的视觉图神经网络[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2188-2194.
[10]	张立孝, 马垚, 杨玉丽, 于丹, 陈永乐. 基于命名实体识别的大规模物联网二进制组件识别[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2288-2295.
[11]	陶永鹏, 柏诗淇, 周正文. 基于卷积和Transformer神经网络架构搜索的脑胶质瘤多组织分割网络[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2378-2386.
[12]	向尔康, 黄荣, 董爱华. 开放生成与特征优化的开集识别方法[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2195-2202.
[13]	郭书君, 任卫军, 陈倩倩, 游广飞. 基于聚类多变量时间序列模型的交通状态实时预测[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2253-2261.
[14]	陈丹阳, 张长伦. 多尺度去相关的图卷积网络模型[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2180-2187.
[15]	张悦岚, 苏静, 赵航宇, 杨白利. 基于知识感知与交互的多视图蒸馏推荐算法[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2211-2220.