基于记忆增强和跨度筛选的实体关系联合抽取模型

doi:10.11772/j.issn.1001-9081.2024111567

《计算机应用》唯一官方网站

• • 下一篇

基于记忆增强和跨度筛选的实体关系联合抽取模型

刘爽,罗桂君,孟佳娜

大连民族大学

收稿日期:2024-11-05 修回日期:2025-03-16 接受日期:2025-03-20 发布日期:2025-04-02 出版日期:2025-04-02
通讯作者: 刘爽
基金资助:
2023年教育部人文社会科学研究与规划基金

Entity relation joint extraction model based on memory enhancement and span screening

Received:2024-11-05 Revised:2025-03-16 Accepted:2025-03-20 Online:2025-04-02 Published:2025-04-02
Supported by:
2023 Humanities and Social Sciences Research and Planning Fund of the Ministry of Education

摘要/Abstract

摘要： 实体和关系抽取(ERE)通常采用流水线的方式进行处理，但这种流水线方法仅依赖于前一个任务的输出，导致实体识别和关系抽取之间出现信息交互问题，且容易引发误差传播问题。针对以上问题，提出一种基于跨度筛选且具备双向依赖的记忆增强(MEERE)模型。该模型引入类似记忆的机制，使每个任务不仅能利用前一任务的输出，还能够反向影响前一任务，从而捕获实体和关系间的复杂交互。其次为进一步减轻误差传播，引入实体跨度筛选机制。该机制通过在联合模块中动态筛选和验证实体跨度，确保只有高质量的实体被用于关系抽取，从而提升模型的鲁棒性和准确性。最后利用表格解码方式很好地处理关系重叠问题。在3个广泛使用的基准数据集(ACE05、SciERC和CoNLL04)上进行的实验结果表明，MEERE模型在ERE任务上表现出了显著的优势，与现有方法相比，它在实体识别和关系抽取的准确性上均有提升，特别是在减少误差传播和提高整体模型稳定性方面表现突出。与Tab-Seq在CoNLL04数据集上比较，MEERE在实体和关系抽取上都有显著提升，实体F1提升了0.4个百分点，关系严格评估F1提升了2.7个百分点。相比PURE-F，MEERE实现了超过10倍的加速效果，并且关系抽取性能更佳。这些结果验证了所提出的记忆增强模型在探索实体和关系交互作用方面的有效性。

关键词: 实体关系抽取, 记忆增强, 跨度筛选, 预训练语言模型, 跨句子上下文

Abstract: Entity and relation extraction (ERE) is usually processed in a pipeline manner, but this pipeline method only relies on the output of the previous task, which leads to information interaction problems between entity recognition and relation extraction, and easily causes error propagation problems. A memory enhancement model based on span screening and bidirectional dependency (MEERE) is proposed to address the above issues. This model introduces a memory-like mechanism so that each task can not only utilize the output of the previous task but also reversely affect the previous task, thereby capturing the complex interaction between entities and relations. Secondly, an entity span screening mechanism is introduced to further alleviate error propagation. This mechanism ensures that only high-quality entities are used for relation extraction by dynamically screening and verifying entity spans in the joint module, thereby improving the robustness and accuracy of the model. Finally, the table decoding method is used to deal with the relationship overlap problem well. Experimental results on three widely used benchmark datasets (ACE05, SciERC, and CoNLL04) show that the MEERE model shows significant advantages in the ERE task. Compared with existing methods, it has improved the accuracy of both entity recognition and relation extraction, especially in reducing error propagation and improving overall model stability. Compared with Tab-Seq on the CoNLL04 dataset, MEERE has significant improvements in both entity and relation extraction, with an increase of 0.4 percentage points in entity F1 and 2.7 percentage points in strict evaluation F1. Compared with PURE-F, MEERE achieves more than 10 times and better relation extraction performance. These results verify the effectiveness of the proposed memory-enhanced model in exploring the interaction between entities and relations.

Key words: Entity relation extraction, Memory enhancement, Span screening, Pre-trained language models, Cross-sentence context

中图分类号:

TP391.1

刘爽罗桂君孟佳娜. 基于记忆增强和跨度筛选的实体关系联合抽取模型[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2024111567.

[1]	王利琴, 耿智雷, 李英双, 董永峰, 边萌. 基于路径和增强三元组文本的开放世界知识推理模型[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1177-1183.
[2]	李斌, 林民, 斯日古楞null, 高颖杰, 王玉荣, 张树钧. 基于提示学习和全局指针网络的中文古籍实体关系联合抽取方法[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 75-81.
[3]	吴相岚, 肖洋, 刘梦莹, 刘明铭. 基于语义增强模式链接的Text-to-SQL模型[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2689-2695.
[4]	毛典辉, 李学博, 刘峻岭, 张登辉, 颜文婧. 基于并行异构图和序列注意力机制的中文实体关系抽取模型[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2018-2025.
[5]	魏超, 陈艳平, 王凯, 秦永彬, 黄瑞章. 基于掩码提示与门控记忆网络校准的关系抽取方法[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1713-1719.
[6]	郭安迪, 贾真, 李天瑞. 基于伪实体数据增强的高精准率医学领域实体关系抽取[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 393-402.
[7]	黄梦林, 段磊, 张袁昊, 王培妍, 李仁昊. 基于Prompt学习的无监督关系抽取模型[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2010-2016.
[8]	高永兵, 高军甜, 马蓉, 杨立东. 用户粒度级的个性化社交文本生成模型[J]. 《计算机应用》唯一官方网站, 2023, 43(4): 1021-1028.
[9]	许亮, 张春, 张宁, 田雪涛. 融合多Prompt模板的零样本关系抽取模型[J]. 《计算机应用》唯一官方网站, 2023, 43(12): 3668-3675.
[10]	江静, 陈渝, 孙界平, 琚生根. 融合后验概率校准训练的文本分类算法[J]. 《计算机应用》唯一官方网站, 2022, 42(6): 1789-1795.
[11]	张海丰, 曾诚, 潘列, 郝儒松, 温超东, 何鹏. 结合BERT和特征投影网络的新闻主题文本分类方法[J]. 《计算机应用》唯一官方网站, 2022, 42(4): 1116-1124.
[12]	王小鹏, 孙媛媛, 林鸿飞. 基于刑事Electra的编-解码关系抽取模型[J]. 《计算机应用》唯一官方网站, 2022, 42(1): 87-93.
[13]	刘雅璇, 钟勇. 基于头实体注意力的实体关系联合抽取方法[J]. 计算机应用, 2021, 41(9): 2517-2522.
[14]	崔博文, 金涛, 王建民. 自由文本电子病历信息抽取综述[J]. 计算机应用, 2021, 41(4): 1055-1063.
[15]	李志超, 吐尔地·托合提, 艾斯卡尔·艾木都拉. 基于动态注意力和多角度匹配的答案选择模型[J]. 《计算机应用》唯一官方网站, 2021, 41(11): 3156-3163.

基于记忆增强和跨度筛选的实体关系联合抽取模型

Entity relation joint extraction model based on memory enhancement and span screening

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics