Journal of Computer Applications ›› 2022, Vol. 42 ›› Issue (7): 2001-2008.DOI: 10.11772/j.issn.1001-9081.2021050861
Special Issue: 人工智能
• Artificial intelligence • Previous Articles Next Articles
Yayao ZUO(), Haoyu CHEN, Zhiran CHEN, Jiawei HONG, Kun CHEN
Received:
2021-05-25
Revised:
2021-09-09
Accepted:
2021-10-12
Online:
2021-09-09
Published:
2022-07-10
Contact:
Yayao ZUO
About author:
CHEN Haoyu, born in 1995, M.S. candidate. His research interests include natural language processing, deep learning.Supported by:
通讯作者:
左亚尧
作者简介:
陈皓宇(1995—),男,广东广州人,硕士研究生,主要研究方向:自然语言处理、深度学习基金资助:
CLC Number:
Yayao ZUO, Haoyu CHEN, Zhiran CHEN, Jiawei HONG, Kun CHEN. Named entity recognition method combining multiple semantic features[J]. Journal of Computer Applications, 2022, 42(7): 2001-2008.
左亚尧, 陈皓宇, 陈致然, 洪嘉伟, 陈坤. 融合多语义特征的命名实体识别方法[J]. 《计算机应用》唯一官方网站, 2022, 42(7): 2001-2008.
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.joca.cn/EN/10.11772/j.issn.1001-9081.2021050861
命名实体类别 | 首字符 | 非首字符 |
---|---|---|
人名 | B-PER | I-PER |
地点 | B-LOC | I-LOC |
组织机构 | B-ORG | I-ORG |
Tab.1 MSRA entity labels
命名实体类别 | 首字符 | 非首字符 |
---|---|---|
人名 | B-PER | I-PER |
地点 | B-LOC | I-LOC |
组织机构 | B-ORG | I-ORG |
命名实体类别 | 首字符 | 非首字符 |
---|---|---|
protein | B-protein | I-protein |
DNA | B-DNA | I-DNA |
RNA | B-RNA | I-RNA |
cell line | B-line | I-line |
cell type | B-type | I-type |
Tab.2 BioNLP/NLPBA 2004 entity labels
命名实体类别 | 首字符 | 非首字符 |
---|---|---|
protein | B-protein | I-protein |
DNA | B-DNA | I-DNA |
RNA | B-RNA | I-RNA |
cell line | B-line | I-line |
cell type | B-type | I-type |
数据集 | 训练集 | 验证集 | 测试集 |
---|---|---|---|
MSRA | 37 091 | 9 273 | 4 365 |
BioNLP/NLPBA 2004 | 18 241 | 4 561 | 4 256 |
Tab.3 Dataset division condition
数据集 | 训练集 | 验证集 | 测试集 |
---|---|---|---|
MSRA | 37 091 | 9 273 | 4 365 |
BioNLP/NLPBA 2004 | 18 241 | 4 561 | 4 256 |
参数名 | 值 | 单位 |
---|---|---|
文本长度 | 60 | 字符 |
向量维度 | 100 | 维 |
自注意力的头数 | 6 | — |
LSTM个数 | 2 | — |
GCN层数 | 2 | — |
Dropout参数 | 0.5 | — |
训练批次大小 | 40 | 轮次 |
滑动窗口大小 | 5 | — |
Tab.4 Model parameter setting
参数名 | 值 | 单位 |
---|---|---|
文本长度 | 60 | 字符 |
向量维度 | 100 | 维 |
自注意力的头数 | 6 | — |
LSTM个数 | 2 | — |
GCN层数 | 2 | — |
Dropout参数 | 0.5 | — |
训练批次大小 | 40 | 轮次 |
滑动窗口大小 | 5 | — |
项目 | 环境 |
---|---|
系统 | Windows 10 |
GPU | NVIDIA 3090Ti |
硬盘 | 1 TB |
内存 | 16 GB |
Python版本 | Python3.6 |
Pytorch版本 | Pytorch1.1 |
Tab.5 Software and hardware environments
项目 | 环境 |
---|---|
系统 | Windows 10 |
GPU | NVIDIA 3090Ti |
硬盘 | 1 TB |
内存 | 16 GB |
Python版本 | Python3.6 |
Pytorch版本 | Pytorch1.1 |
模型 | 精确率 | 召回率 | F1值 |
---|---|---|---|
文献[ | 88.94 | 84.20 | 86.51 |
文献[ | 91.22 | 81.71 | 86.20 |
文献[ | 91.86 | 88.75 | 90.28 |
文献[ | 92.20 | 90.18 | 91.18 |
文献[ | 91.28 | 90.62 | 90.95 |
本文模型 | 94.40 | 93.15 | 93.76 |
Tab.6 Comparison results on MSRA dataset
模型 | 精确率 | 召回率 | F1值 |
---|---|---|---|
文献[ | 88.94 | 84.20 | 86.51 |
文献[ | 91.22 | 81.71 | 86.20 |
文献[ | 91.86 | 88.75 | 90.28 |
文献[ | 92.20 | 90.18 | 91.18 |
文献[ | 91.28 | 90.62 | 90.95 |
本文模型 | 94.40 | 93.15 | 93.76 |
模型 | 精确率 | 召回率 | F1值 |
---|---|---|---|
文献[ | 69.16 | 69.48 | 69.32 |
文献[ | — | — | 72.82 |
文献[ | 67.82 | 64.80 | 66.28 |
本文模型 | 79.67 | 78.70 | 79.00 |
Tab.7 Comparison results on BioNLP/NLPBA dataset
模型 | 精确率 | 召回率 | F1值 |
---|---|---|---|
文献[ | 69.16 | 69.48 | 69.32 |
文献[ | — | — | 72.82 |
文献[ | 67.82 | 64.80 | 66.28 |
本文模型 | 79.67 | 78.70 | 79.00 |
1 | HUANG Z H, XU W, YU K. Bidirectional LSTM-CRF models for sequence tagging[EB/OL]. (2015-08-09) [2020-05-01].. |
2 | SONG H J, JO B C, PARK C Y, et al. Comparison of named entity recognition methodologies in biomedical documents[J]. Biomedical Engineering Online, 2018, 17: No.158. 10.1186/s12938-018-0573-6 |
3 | LUO Y, XIAO F S, ZHAO H. Hierarchical contextualized representation for named entity recognition[C]// Proceedings of the 2020 AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2020: 8441-8448. 10.1609/aaai.v34i05.6363 |
4 | GAJENDRAN S, MANJULA D, SUGUMARAN V. Character level and word level embedding with bidirectional LSTM-Dynamic recurrent neural network for biomedical named entity recognition from literature[J]. Journal of Biomedical Informatics, 2020, 112(1): No.103609. |
5 | DONG C H, ZHANG J J, ZONG C Q, et al. Character-based LSTM-CRF with radical-level features for Chinese named entity recognition[C]// Proceedings of the 2016 International Conference on Computer Processing of Oriental Languages/2016 National CCF Conference on Natural Language Processing and Chinese Computing, LNCS 10102. Cham: Springer, 2016: 239-250. |
6 | 张海楠,伍大勇,刘悦,等. 基于深度神经网络的中文命名实体识别[J]. 中文信息学报, 2017, 31(4):28-35. 10.3969/j.issn.1003-0077.2017.04.005 |
ZHANG H N, WU D Y, LIU Y, et al. Chinese named entity recognition based on deep neural network[J]. Journal of Chinese Information Processing, 2017, 31(4):28-35. 10.3969/j.issn.1003-0077.2017.04.005 | |
7 | 刘宇瀚,刘常健,徐睿峰,等. 结合字形特征与迭代学习的金融领域命名实体识别[J]. 中文信息学报, 2020, 34(11): 74-83. 10.3969/j.issn.1003-0077.2020.11.010 |
LIU Y H, LIU C J, XU R F, et al. Utilizing glyph feature and iterative learning for named entity recognition in finance text[J]. Journal of Chinese Information Processing, 2020, 34(11): 74-83. 10.3969/j.issn.1003-0077.2020.11.010 | |
8 | ZHANG Y, YANG J. Chinese NER using lattice LSTM[C]// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Stroudsburg, PA: Association for Computational Linguistics, 2018: 1554-1564. 10.18653/v1/p18-1144 |
9 | YU X M, FENG W Z, WANG H, et al. An attention mechanism and multi-granularity-based Bi-LSTM model for Chinese Q&A system[J]. Soft Computing, 2020, 24(8):5831-5845. 10.1007/s00500-019-04367-8 |
10 | DAI J H, FENG C, BAI X F, et al. AERNs: attention-based entity region networks for multi-grained named entity recognition[C]// Proceedings of the IEEE 31st International Conference on Tools with Artificial Intelligence. Piscataway: IEEE, 2019: 408-415. 10.1109/ictai.2019.00064 |
11 | 张晗,郭渊博,李涛. 结合GAN与BiLSTM-Attention-CRF的领域命名实体识别[J]. 计算机研究与发展, 2019, 56(9):1851-1858. 10.7544/issn1000-1239.2019.20180733 |
ZHANG H, GUO Y B, LI T. Domain named entity recognition combining GAN and BiLSTM-Attention-CRF[J]. Journal of Computer Research and Development, 2019, 56(9):1851-1858. 10.7544/issn1000-1239.2019.20180733 | |
12 | VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[C]// Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook, NY: Curran Associates Inc., 2017: 6000-6010. 10.1016/s0262-4079(17)32358-8 |
13 | 王得贤,王素格,裴文生,等. 基于JCWA-DLSTM的法律文书命名实体识别方法[J]. 中文信息学报, 2020, 34(10): 51-58. 10.3969/j.issn.1003-0077.2020.10.007 |
WANG D X, WANG S G, PEI W S, et al. Named entity recognition based on JCWA-DLSTM for legal instruments[J]. Journal of Chinese Information Processing, 2020, 34(10): 51-58. 10.3969/j.issn.1003-0077.2020.10.007 | |
14 | CHEN X Y, SHI S H, ZHAN S Y, et al. Named entity recognition of Chinese electronic medical records based on cascaded conditional random field[C]// Proceedings of the IEEE 4th International Conference on Big Data Analytics. Piscataway: IEEE, 2019: 364-368. 10.1109/icbda.2019.8713244 |
15 | SUN X L, SUN S L, YIN M Z, et al. Hybrid neural conditional random fields for multi-view sequence labeling[J]. Knowledge-Based Systems, 2020, 189: No.105151. 10.1016/j.knosys.2019.105151 |
16 | LIU J M, SUN C, YUAN Y. The BERT-BiLSTM-CRF question event information extraction method[C]// Proceedings of the IEEE 3rd International Conference on Electronic Information and Communication Technology. Piscataway: IEEE, 2020: 729-733. 10.1109/iceict51264.2020.9334197 |
17 | HU J M, ZHENG X. Opinion extraction of government microblog comments via BiLSTM-CRF model [C]// Proceedings of the 2020 ACM/IEEE Joint Conference on Digital Libraries. New York: ACM, 2020: 473-475. 10.1145/3383583.3398570 |
18 | YANG X M, GAO Z H, LI Y M, et al. Bidirectional LSTM-CRF for biomedical named entity recognition[C]// Proceedings of the 14th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery. Piscataway: IEEE, 2018: 239-242. 10.1109/fskd.2018.8687117 |
19 | 胡慧君,王聪,代建华,等.基于BiLSTM-CRF 的社会突发事件研判方法[J]. 中文信息学报, 2022, 36(3):154-161. |
HU H J, WANG C, DAI J H,et al. Social emergency event judgement based on BiLSTM-CRF[J]. Journal of Chinese Information Processing, 2022, 36(3):154-161. | |
20 | 古雪梅,刘嘉勇,程芃森,等. 基于增强 BiLSTM-CRF 模型的推文恶意软件名称识别[J]. 计算机科学, 2020, 47(2): 245-250. |
GU X M, LIU J Y, CHENG P S, et al. Malware name recognition in tweets based on enhanced BiLSTM-CRF model[J]. Computer Science, 2020, 47(2): 245-250. | |
21 | WU Z H, PAN S R, CHEN F W, et al. A comprehensive survey on graph neural networks[J]. IEEE Transactions on Neural Networks and Learning Systems, 2021, 32(1): 4-24. 10.1109/tnnls.2020.2978386 |
22 | YAO L, MAO C S, LUO Y. Graph convolutional networks for text classification[C]// Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2019: 7370-7377. 10.1609/aaai.v33i01.33017370 |
23 | ZHOU J S, HE L, DAI X Y, et al. Chinese named entity recognition with a multi-phase model[C]// Proceedings of the 5th SIGHAN Workshop on Chinese Language Processing. Stroudsburg, PA: Association for Computational Linguistics, 2006: 213-216. |
24 | CHEN A T, PENG F C, SHAN R, et al. Chinese named entity recognition with conditional probabilistic models[C]// Proceedings of the 5th SIGHAN Workshop on Chinese Language Processing. Stroudsburg, PA: Association for Computational Linguistics, 2006: 173-176. |
25 | ZHOU J H, QU W G, ZHANG F. Chinese named entity recognition via joint identification and categorization[J]. Chinese Journal of Electronics, 2013, 22(2):225-230. |
26 | YIMAM S M, BIEMANN C, MAJNARIC L, et al. An adaptive annotation approach for biomedical entity and relation recognition[J]. Brain Informatics, 2016, 3:157-168. 10.1007/s40708-016-0036-4 |
27 | SONG Y, KIM E, LEE G G, et al. POSBIOTM-NER in the shared task of BioNLP/NLPBA2004[C]// Proceedings of the 2004 International Joint Workshop on Natural Language Processing in Biomedicine and its Applications. [S.l.]: COLING, 2004:103-106. 10.3115/1567594.1567617 |
[1] | Jing QIN, Zhiguang QIN, Fali LI, Yueheng PENG. Diagnosis of major depressive disorder based on probabilistic sparse self-attention neural network [J]. Journal of Computer Applications, 2024, 44(9): 2970-2974. |
[2] | Liting LI, Bei HUA, Ruozhou HE, Kuang XU. Multivariate time series prediction model based on decoupled attention mechanism [J]. Journal of Computer Applications, 2024, 44(9): 2732-2738. |
[3] | Huanliang SUN, Siyi WANG, Junling LIU, Jingke XU. Help-seeking information extraction model for flood event in social media data [J]. Journal of Computer Applications, 2024, 44(8): 2437-2445. |
[4] | Zexin XU, Lei YANG, Kangshun LI. Shorter long-sequence time series forecasting model [J]. Journal of Computer Applications, 2024, 44(6): 1824-1831. |
[5] | Youren YU, Yangsen ZHANG, Yuru JIANG, Gaijuan HUANG. Chinese named entity recognition model incorporating multi-granularity linguistic knowledge and hierarchical information [J]. Journal of Computer Applications, 2024, 44(6): 1706-1712. |
[6] | Yue LIU, Fang LIU, Aoyun WU, Qiuyue CHAI, Tianxiao WANG. 3D object detection network based on self-attention mechanism and graph convolution [J]. Journal of Computer Applications, 2024, 44(6): 1972-1977. |
[7] | Rong HUANG, Junjie SONG, Shubo ZHOU, Hao LIU. Image aesthetic quality evaluation method based on self-supervised vision Transformer [J]. Journal of Computer Applications, 2024, 44(4): 1269-1276. |
[8] | Yongfeng DONG, Jiaming BAI, Liqin WANG, Xu WANG. Chinese named entity recognition combining prior knowledge and glyph features [J]. Journal of Computer Applications, 2024, 44(3): 702-708. |
[9] | Xinran LUO, Tianrui LI, Zhen JIA. Chinese medical named entity recognition based on self-attention mechanism and lexicon enhancement [J]. Journal of Computer Applications, 2024, 44(2): 385-392. |
[10] | Ziqi HUANG, Jianpeng HU. Entity category enhanced nested named entity recognition in automotive domain [J]. Journal of Computer Applications, 2024, 44(2): 377-384. |
[11] | Liqing QIU, Xiaopan SU. Personalized multi-layer interest extraction click-through rate prediction model [J]. Journal of Computer Applications, 2024, 44(11): 3411-3418. |
[12] | Xingyao YANG, Hongtao SHEN, Zulian ZHANG, Jiong YU, Jiaying CHEN, Dongxiao WANG. Sequential recommendation based on hierarchical filter and temporal convolution enhanced self-attention network [J]. Journal of Computer Applications, 2024, 44(10): 3090-3096. |
[13] | Yanbo LI, Qing HE, Shunyi LU. Aspect sentiment triplet extraction integrating semantic and syntactic information [J]. Journal of Computer Applications, 2024, 44(10): 3275-3280. |
[14] | Jia CHEN, Hong ZHANG. Image text retrieval method based on feature enhancement and semantic correlation matching [J]. Journal of Computer Applications, 2024, 44(1): 16-23. |
[15] | Hanxiao SHI, Leichun WANG. Short-term power load forecasting by graph convolutional network combining LSTM and self-attention mechanism [J]. Journal of Computer Applications, 2024, 44(1): 311-317. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||