Journal of Computer Applications ›› 2025, Vol. 45 ›› Issue (11): 3547-3554.DOI: 10.11772/j.issn.1001-9081.2024111606
• Artificial intelligence • Previous Articles
Xiaoman WANG1,2,3, Yanping CHEN1,2,3(
), Caiwei YANG1,2,3, Ruizhang HUANG1,2,3, Yongbin QIN1,2,3
Received:2024-11-14
Revised:2025-02-12
Accepted:2025-02-17
Online:2025-04-02
Published:2025-11-10
Contact:
Yanping CHEN
About author:WANG Xiaoman, born in 1999, M. S. candidate. Her research interests include natural language processing, information extraction.Supported by:
王晓曼1,2,3, 陈艳平1,2,3(
), 杨采薇1,2,3, 黄瑞章1,2,3, 秦永彬1,2,3
通讯作者:
陈艳平
作者简介:王晓曼(1999—),女,山西太原人,硕士研究生,CCF会员,主要研究方向:自然语言处理、信息抽取基金资助:CLC Number:
Xiaoman WANG, Yanping CHEN, Caiwei YANG, Ruizhang HUANG, Yongbin QIN. Nested named entity recognition method for multi-directional gradient feature extraction[J]. Journal of Computer Applications, 2025, 45(11): 3547-3554.
王晓曼, 陈艳平, 杨采薇, 黄瑞章, 秦永彬. 多方向梯度特征提取的嵌套命名实体识别方法[J]. 《计算机应用》唯一官方网站, 2025, 45(11): 3547-3554.
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.joca.cn/EN/10.11772/j.issn.1001-9081.2024111606
| 数据集 | 批次 大小 | 训练 轮次 | 学习率 | BERT 学习率 | 词向量 维度 | 优化器 |
|---|---|---|---|---|---|---|
| ACE 2005 | 8 | 17 | 0.001 | 0.000 02 | 768 | Adam |
| GENIA | 8 | 11 | 0.001 | 0.000 006 | 768 | Adam |
| CoNLL2003 | 12 | 10 | 0.001 | 0.000 01 | 1 024 | Adam |
Tab. 1 Parameter setting
| 数据集 | 批次 大小 | 训练 轮次 | 学习率 | BERT 学习率 | 词向量 维度 | 优化器 |
|---|---|---|---|---|---|---|
| ACE 2005 | 8 | 17 | 0.001 | 0.000 02 | 768 | Adam |
| GENIA | 8 | 11 | 0.001 | 0.000 006 | 768 | Adam |
| CoNLL2003 | 12 | 10 | 0.001 | 0.000 01 | 1 024 | Adam |
| 数据集 | 模型 | P | R | F1 |
|---|---|---|---|---|
| ACE 2005 | 文献[ | 78.85 | 81.34 | 80.07 |
| 文献[ | 79.31 | 80.94 | 80.12 | |
| 文献[ | 79.31 | 80.94 | 85.12 | |
| 文献[ | 88.33 | 86.50 | 87.41 | |
| 本文模型 | 87.66 | 88.82 | 88.01 | |
| GENIA | 文献[ | 79.20 | 77.40 | 78.30 |
| 文献[ | 80.10 | 79.47 | 79.79 | |
| 文献[ | 82.31 | 78.66 | 80.44 | |
| 文献[ | 81.50 | 79.60 | 80.50 | |
| 文献[ | 80.19 | 80.89 | 80.54 | |
| 文献[ | 82.24 | 80.06 | 81.13 | |
| 文献[ | 61.89 | 66.95 | 64.42 | |
| 本文模型 | 81.19 | 81.55 | 81.23 |
Tab. 2 Performance comparison of different models onACE 2005 and GENIA datasets
| 数据集 | 模型 | P | R | F1 |
|---|---|---|---|---|
| ACE 2005 | 文献[ | 78.85 | 81.34 | 80.07 |
| 文献[ | 79.31 | 80.94 | 80.12 | |
| 文献[ | 79.31 | 80.94 | 85.12 | |
| 文献[ | 88.33 | 86.50 | 87.41 | |
| 本文模型 | 87.66 | 88.82 | 88.01 | |
| GENIA | 文献[ | 79.20 | 77.40 | 78.30 |
| 文献[ | 80.10 | 79.47 | 79.79 | |
| 文献[ | 82.31 | 78.66 | 80.44 | |
| 文献[ | 81.50 | 79.60 | 80.50 | |
| 文献[ | 80.19 | 80.89 | 80.54 | |
| 文献[ | 82.24 | 80.06 | 81.13 | |
| 文献[ | 61.89 | 66.95 | 64.42 | |
| 本文模型 | 81.19 | 81.55 | 81.23 |
| 模型 | P | R | F1 |
|---|---|---|---|
| 文献[ | 88.85 | 88.34 | 88.63 |
| 文献[ | 91.72 | 92.05 | 91.96 |
| 文献[ | 92.31 | 92.14 | 92.28 |
| 文献[ | 92.04 | 92.65 | 92.34 |
| 文献[ | 92.48 | 92.33 | 92.41 |
| 本文模型 | 91.71 | 93.25 | 92.52 |
Tab. 3 Performance comparison of different models onCoNLL2003 dataset
| 模型 | P | R | F1 |
|---|---|---|---|
| 文献[ | 88.85 | 88.34 | 88.63 |
| 文献[ | 91.72 | 92.05 | 91.96 |
| 文献[ | 92.31 | 92.14 | 92.28 |
| 文献[ | 92.04 | 92.65 | 92.34 |
| 文献[ | 92.48 | 92.33 | 92.41 |
| 本文模型 | 91.71 | 93.25 | 92.52 |
| 模型设置 | ACE 2005 | GENIA | ||||
|---|---|---|---|---|---|---|
| P | R | F1 | P | R | F1 | |
| w/-双仿射 | 85.15 | 88.59 | 86.84 | 81.31 | 80.60 | 80.96 |
| w/-4方向3×3算子 | 85.54 | 88.97 | 87.35 | 81.06 | 81.13 | 81.08 |
| 本文模型 | 87.66 | 88.82 | 88.01 | 81.63 | 80.69 | 81.23 |
Tab. 4 Performance comparison between local and global models
| 模型设置 | ACE 2005 | GENIA | ||||
|---|---|---|---|---|---|---|
| P | R | F1 | P | R | F1 | |
| w/-双仿射 | 85.15 | 88.59 | 86.84 | 81.31 | 80.60 | 80.96 |
| w/-4方向3×3算子 | 85.54 | 88.97 | 87.35 | 81.06 | 81.13 | 81.08 |
| 本文模型 | 87.66 | 88.82 | 88.01 | 81.63 | 80.69 | 81.23 |
| 模型设置 | P | R | F1 |
|---|---|---|---|
| w/o-残差连接 | 81.60 | 80.13 | 80.86 |
| w/o-多方向梯度算子 | 81.79 | 80.17 | 80.89 |
| w/o-逐点卷积 | 81.39 | 80.60 | 80.96 |
| 本文模型 | 81.63 | 80.69 | 81.23 |
Tab. 5 Ablation experimental results on GENIA dataset
| 模型设置 | P | R | F1 |
|---|---|---|---|
| w/o-残差连接 | 81.60 | 80.13 | 80.86 |
| w/o-多方向梯度算子 | 81.79 | 80.17 | 80.89 |
| w/o-逐点卷积 | 81.39 | 80.60 | 80.96 |
| 本文模型 | 81.63 | 80.69 | 81.23 |
| [1] | 邓依依,邬昌兴,魏永丰,等.基于深度学习的命名实体识别综述[J].中文信息学报,2021,35(9):30-45. |
| DENG Y Y, WU C X, WEI Y F, et al. A survey of named entity recognition based on deep learning[J]. Journal of Chinese Information Processing, 2021, 35(9): 30-45. | |
| [2] | LIU C, YANG S. Using text mining to establish knowledge graph from accident/incident reports in risk assessment[J]. Expert Systems with Applications, 2022, 207: No.117991. |
| [3] | LAHIRI A K, HU Q V. Named entity-based question-answering pair generator[C]// Proceedings of the 31st ACM International Conference on Information and Knowledge Management. New York: ACM, 2022: 4902-4906. |
| [4] | FU Y, LIN N, CHEN B, et al. Cross-lingual named entity recognition for heterogeneous languages[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023, 31: 371-382. |
| [5] | JU M, MIWA M, ANANIADOU S. A neural layered model for nested named entity recognition[C]// Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). Stroudsburg: ACL, 2018: 1446-1459. |
| [6] | SOHRAB M G, MIWA M. Deep exhaustive model for nested named entity recognition[C]// Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Stroudsburg: ACL, 2018: 2843-2849. |
| [7] | GENG R, CHEN Y, HUANG R, et al. Planarized sentence representation for nested named entity recognition[J]. Information Processing and Management, 2023, 60(4): No.103352. |
| [8] | 杨采薇.面向实体识别的语义边缘增强方法研究[D].贵阳:贵州大学,2024:19-23. |
| YANG C W. Research on semantic edge enhancement method for entity recognition[D]. Guiyang: Guizhou University, 2024:19-23. | |
| [9] | ZOU X, ZHANG Y, ZHANG S, et al. FPGA implementation of edge detection for Sobel operator in eight directions[C]// Proceedings of the 2018 IEEE Asia Pacific Conference on Circuits and Systems. Piscataway: IEEE, 2018: 520-523. |
| [10] | LI Z, SONG M, ZHU Y, et al. Chinese nested named entity recognition based on boundary prompt[C]// Proceedings of the 2023 International Conference on Web Information Systems and Applications, LNCS 14094. Singapore: Springer, 2023: 331-343. |
| [11] | CHEN Y, WU Y, QIN Y, et al. Recognizing nested named entity based on the neural network boundary assembling model[J]. IEEE Intelligent Systems, 2020, 35(1): 74-81. |
| [12] | STRAKOVÁ J, STRAKA M, HAJIC J. Neural architectures for nested NER through linearization[C]// Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: ACL, 2019: 5326-5331. |
| [13] | TAN Z, SHEN Y, ZHANG S, et al. A sequence-to-set network for nested named entity recognition[C]// Proceedings of the 30th International Joint Conference on Artificial Intelligence. California: IJCAI.org, 2021: 3936-3942. |
| [14] | LI J, FEI H, LIU J, et al. Unified named entity recognition as word-word relation classification[C]// Proceedings of the 36th AAAI Conference on Artificial Intelligence. Palo Alto: AAAI Press, 2022: 10965-10973. |
| [15] | SHEN Y, MA X, TAN Z, et al. Locate and label: a two-stage identifier for nested named entity recognition[C]// Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Stroudsburg: ACL, 2021: 2782-2794. |
| [16] | ZHENG Q, WU Y, WANG G, et al. Exploring interactive and contrastive relations for nested named entity recognition[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023, 31: 2899-2909. |
| [17] | CUI S, JOE I. A multi-head adjacent attention-based pyramid layered model for nested named entity recognition[J]. Neural Computing and Applications, 2023, 35(3): 2561-2574. |
| [18] | YAN H, SUN Y, LI X N, et al. An embarrassingly easy but strong baseline for nested named entity recognition[C]// Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Stroudsburg: ACL, 2023: 1442-1452. |
| [19] | WALKER C, STRASSEL S, MEDERO J, et al. ACE 2005 multilingual training corpus[DS/OL]. [2024-03-13]. . |
| [20] | KIM J D, OHTA T, TATEISI Y, et al. GENIA corpus — a semantically annotated corpus for bio-textmining[J]. Bioinformatics, 2003, 19(S1): i180-i182. |
| [21] | TJONG KIM SANG E F, DE MEULDER F. Introduction to the CoNLL-2003 shared task: language-independent named entity recognition[C]// Proceedings of the 7th Conference on Natural Language Learning at HLT-NAACL 2003. Stroudsburg: ACL, 2003: 142-147. |
| [22] | YAN H, GUI T, DAI J, et al. A unified generative framework for various NER subtasks[C]// Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Stroudsburg: ACL, 2021: 5808-5822. |
| [23] | MA X, HOVY E. End-to-end sequence labeling via bi-directional LSTM-CNNS-CRF[C]// Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Stroudsburg: ACL, 2016: 1064-1074. |
| [24] | 陈天华.数字图像处理[M].2版.北京:清华大学出版社,2014:85. |
| CHEN T H. Digital image processing[M]. 2nd ed. Beijing: Tsinghua University Press, 2014: 85. | |
| [25] | VERMA H, BERGLER S, TAHAEI N. Comparing and combining some popular NER approaches on Biomedical tasks[C]// Proceedings of the 22nd Workshop on Biomedical Natural Language Proceedings and BioNLP Shared Tasks. Stroudsburg: ACL, 2023: 273-279. |
| [26] | ZHANG S, CHENG H, GAO J, et al. Optimizing bi-encoder for named entity recognition via contrastive learning[EB/OL]. [2024-05-09]. . |
| [27] | WANG S H, SUN X F, LI X Y, et al. GPT-NER: named entity recognition via large language models[C]// Findings of the Association for Computational Linguistics: NAACL 2025. Stroudsburg: ACL, 2025: 4257-4275. |
| [28] | HANH T T H, DOUCET A, SIDERE N, et al. Named entity recognition architecture combining contextual and global features[C]// Proceedings of the 2021 International Conference on Asian Digital Libraries, LNCS 13133. Cham: Springer, 2021: 264-276. |
| [29] | LUO Y, XIAO F, ZHAO H. Hierarchical contextualized representation for named entity recognition[C]// Proceedings of the 34th AAAI Conference on Artificial Intelligence. Palo Alto: AAAI Press, 2020: 8441-8448. |
| [30] | XIA C, ZHANG C, YANG T, et al. Multi-grained named entity recognition[C]// Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg: ACL, 2019: 1430-1440. |
| [31] | CHEN H, LIN Z, DING G, et al. GRN: gated relation network to enhance convolutional neural network for named entity recognition[C]// Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Palo Alto: AAAI Press, 2019: 6236-6243. |
| [32] | SHEN Y, TAN Z, WU S, et al. PromptNER: prompt locating and typing for named entity recognition[C]// Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Stroudsburg: ACL, 2023: 12492-12507. |
| [33] | 胡徐怡,王超,厉丹.基于改进Sobel算子的边缘检测算法研究[J].福建电脑,2018,34(9):13-15. |
| HU X Y, WANG C, LI D. Research on edge detection algorithm based on improved Sobel operator[J]. Fujian Computer, 2018, 34(9): 13-15. | |
| [34] | 沈德海,张龙昌,鄂旭.一种基于Sobel算子梯度增强的边缘检测算法[J].电子设计工程,2015,23(10):162-165. |
| SHEN D H, ZHANG L C, E X. An strengthening gradient edge detection algorithm based on Sobel[J]. Electronic Design Engineering, 2015, 23(10): 162-165. |
| [1] | Chuang WANG, Lu YU, Jianwei CHEN, Cheng PAN, Wenbo DU. Review of open set domain adaptation [J]. Journal of Computer Applications, 2025, 45(9): 2727-2736. |
| [2] | Qingli CHEN, Yuanbo GUO, Chen FANG. Clustering federated learning algorithm for heterogeneous data [J]. Journal of Computer Applications, 2025, 45(4): 1086-1094. |
| [3] | Yu WANG, Xianjin FANG, Gaoming YANG, Yifeng DING, Xinlu YANG. Active defense against face forgery based on attention mask and feature extraction [J]. Journal of Computer Applications, 2025, 45(3): 904-910. |
| [4] | Qiurun HE, Jie HU, Bo PENG, Tianyuan LI. Fabric defect detection algorithm based on context information and multi-scale feature fusion [J]. Journal of Computer Applications, 2025, 45(2): 640-646. |
| [5] | Tianqi ZHANG, Shuang TAN, Xiwen SHEN, Juan TANG. Image watermarking method combining attention mechanism and multi-scale feature [J]. Journal of Computer Applications, 2025, 45(2): 616-623. |
| [6] | Lianqing WEN, Ye TAO, Yunlong TIAN, Li NIU, Hongxia SUN. Flow-based lightweight high-quality text-to-speech conversion method [J]. Journal of Computer Applications, 2025, 45(10): 3277-3283. |
| [7] | Jintao FAN, Yanping CHEN, Caiwei YANG, Chuan LIN. Nested named entity recognition by contrastive learning with boundary information [J]. Journal of Computer Applications, 2025, 45(10): 3111-3120. |
| [8] | Jietao LIANG, Bing LUO, Lanhui FU, Qingling CHANG, Nannan LI, Ningbo YI, Qi FENG, Xin HE, Fuqin DENG. Point cloud registration method based on coordinate geometric sampling [J]. Journal of Computer Applications, 2025, 45(1): 214-222. |
| [9] | Xin YANG, Xueni CHEN, Chunjiang WU, Shijie ZHOU. Short-term traffic flow prediction of urban highway based on variant residual model and Transformer [J]. Journal of Computer Applications, 2024, 44(9): 2947-2951. |
| [10] | Shuai FU, Xiaoying GUO, Ruyi BAI, Tao YAN, Bin CHEN. Age estimation method combining improved CloFormer model and ordinal regression [J]. Journal of Computer Applications, 2024, 44(8): 2372-2380. |
| [11] | Tong CHEN, Fengyu YANG, Yu XIONG, Hong YAN, Fuxing QIU. Construction method of voiceprint library based on multi-scale frequency-channel attention fusion [J]. Journal of Computer Applications, 2024, 44(8): 2407-2413. |
| [12] | Wudan LONG, Bo PENG, Jie HU, Ying SHEN, Danni DING. Road damage detection algorithm based on enhanced feature extraction [J]. Journal of Computer Applications, 2024, 44(7): 2264-2270. |
| [13] | Ruihua LIU, Zihe HAO, Yangyang ZOU. Gait recognition algorithm based on multi-layer refined feature fusion [J]. Journal of Computer Applications, 2024, 44(7): 2250-2257. |
| [14] | Zhihao WU, Ziqiu CHI, Ting XIAO, Zhe WANG. Meta-learning adaption for few-shot text-to-speech [J]. Journal of Computer Applications, 2024, 44(5): 1629-1635. |
| [15] | Ziqi HUANG, Jianpeng HU. Entity category enhanced nested named entity recognition in automotive domain [J]. Journal of Computer Applications, 2024, 44(2): 377-384. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||