Journal of Computer Applications ›› 2025, Vol. 45 ›› Issue (4): 1177-1183.DOI: 10.11772/j.issn.1001-9081.2024030265

• Artificial intelligence • Previous Articles     Next Articles

Open-world knowledge reasoning model based on path and enhanced triplet text

Liqin WANG1,2,3, Zhilei GENG1, Yingshuang LI4(), Yongfeng DONG1,2,3, Meng BIAN5   

  1. 1.School of Artificial Intelligence,Hebei University of Technology,Tianjin 300401,China
    2.Hebei Province Key Laboratory of Big Data Calculation (Hebei University of Technology),Tianjin 300401,China
    3.Hebei Engineering Research Center of Data-Driven Industrial Intelligence (Hebei University of Technology),Tianjin 300401,China
    4.Information Security and Technology Service Center,Hebei University of Technology,Tianjin 300401,China
    5.Tianjin Nankai District Teacher Development Center,Tianjin 300401,China
  • Received:2024-03-13 Revised:2024-04-19 Accepted:2024-04-22 Online:2024-07-01 Published:2025-04-10
  • Contact: Yingshuang LI
  • About author:WANG Liqin, born in 1980, Ph. D., senior experimentalist. Her research interests include intelligent information processing, knowledge graph.
    GENG Zhilei, born in 1998, M. S. candidate. His research interests include natural language processing, knowledge graph.
    DONG Yongfeng, born in 1977, Ph. D., professor. His research interests include artificial intelligence, knowledge graph.
    BIAN Meng, born in 1980, M. S., senior teacher. Her research interests include modern education.
  • Supported by:
    Science and Technology Research Program of Higher Education Institutions in Hebei Province(ZD2022082);Higher Education Teaching Reform Research and Practice Project of Hebei Province(2022GJJG049)

基于路径和增强三元组文本的开放世界知识推理模型

王利琴1,2,3, 耿智雷1, 李英双4(), 董永峰1,2,3, 边萌5   

  1. 1.河北工业大学 人工智能与数据科学学院,天津 300401
    2.河北省大数据计算重点实验室(河北工业大学),天津 300401
    3.河北省数据驱动工业智能工程研究中心(河北工业大学),天津 300401
    4.河北工业大学 信息安全与技术服务中心,天津 300401
    5.天津市南开区教师发展中心,天津 300110
  • 通讯作者: 李英双
  • 作者简介:王利琴(1980—),女,河北张北人,高级实验师,博士,CCF会员,主要研究方向:智能信息处理、知识图谱;
    耿智雷(1998—),男,河南商丘人,硕士研究生,主要研究方向:自然语言处理、知识图谱;
    董永峰(1977—),男,河北定州人,教授,博士,CCF高级会员,主要研究方向:人工智能、知识图谱;
    边萌(1980—),女,天津人,高级教师,硕士,主要研究方向:现代教育。
  • 基金资助:
    河北省高等学校科学技术研究项目(ZD2022082);河北省高等教育教学改革研究与实践项目(2022GJJG049)

Abstract:

Traditional knowledge reasoning methods based on representation learning can only be used for closed-world knowledge reasoning. Conducting open-world knowledge reasoning effectively is a hot issue currently. Therefore, a knowledge reasoning model based on path and enhanced triplet text, named PEOR (Path and Enhanced triplet text for Open world knowledge Reasoning), was proposed. First, multiple paths generated by structures between entity pairs and enhanced triplets generated by individual entity neighborhood structures were utilized. Among then, the path text was obtained by concatenating the text of triplets in the path, and the enhanced triplet text was obtained by concatenating the text of head entity neighborhood, relation, and tail entity neighborhood. Then, BERT (Bidirectional Encoder Representations from Transformers) was employed to encode the path text and enhanced triplet text separately. Finally, semantic matching attention calculation was performed using path vectors and triplet vectors, followed by aggregation of semantic information from multiple paths using semantic matching attention. Comparison experimental results on three open-world knowledge graph datasets: WN18RR, FB15k-237, and NELL-995 show that compared with suboptimal model BERTRL (BERT-based Relational Learning), the proposed model has Hits@10 (Hit ratio) metric improved by 2.6, 2.3 and 8.5 percentage points, respectively, validating the effectiveness of the proposed model.

Key words: Knowledge Graph (KG), text information, pre-trained language model, open-world knowledge reasoning, attention mechanism

摘要:

传统的基于表示学习的知识推理方法只能用于封闭世界的知识推理,有效进行开放世界的知识推理是目前的热点问题。因此,提出一种基于路径和增强三元组文本的开放世界知识推理模型PEOR(Path and Enhanced triplet text for Open world knowledge Reasoning)。首先,使用由实体对间结构生成的多条路径和单个实体周围结构生成的增强三元组,其中路径文本通过拼接路径中的三元组文本得到,而增强三元组文本通过拼接头实体邻域文本、关系文本和尾实体邻域文本得到;其次,使用BERT(Bidirectional Encoder Representations from Transformers)分别编码路径文本和增强三元组文本;最后,使用路径向量和三元组向量计算语义匹配注意力,再使用语义匹配注意力聚合多条路径的语义信息。在3个开放世界知识图谱数据集WN18RR、FB15k-237和NELL-995上的对比实验结果表明,与次优模型BERTRL(BERT-based Relational Learning)相比,所提模型的命中率(Hits@10)指标分别提升了2.6、2.3和8.5个百分点,验证了所提模型的有效性。

关键词: 知识图谱, 文本信息, 预训练语言模型, 开放世界知识推理, 注意力机制

CLC Number: