《计算机应用》唯一官方网站

• •    下一篇

基于大语言模型的超关系知识图谱限定符增强方法

李佳航,韩启龙,李丽洁,张慧   

  1. 哈尔滨工程大学
  • 收稿日期:2025-07-09 修回日期:2025-09-01 发布日期:2025-09-12 出版日期:2025-09-12
  • 通讯作者: 李佳航
  • 基金资助:
    动态时空依赖感知的空气质量数据预测关键技术研究;面向工业企业的可信数据空间关键技术与系统研发

Large language model-driven method for qualifier enhancement in hyper-relational knowledge graphs

  • Received:2025-07-09 Revised:2025-09-01 Online:2025-09-12 Published:2025-09-12

摘要: 针对超关系知识图谱(HKG)中限定符稀疏现象导致的超关系事实语义表征不完整、任务精度和泛化能力不足的问题,提出基于大语言模型(LLM)的HKG限定符增强方法L-EQs(Large language model-driven method for Qualifier Enhancement in hyper-relational knowledge graphS)。首先,通过关联知识库获取语义丰富的描述信息,以减轻实体和关系中因相同标签而导致的语义混淆问题。其次,结合提示词模板引导大语言模型生成匹配的限定符及其语义解释作为外源知识。为保证限定符的质量,采用迭代式提示策略对生成的限定符进行筛选,并通过多次迭代生成直至达到预定数量。再次,通过语义解释构建外源知识嵌入,并利用外源知识聚合模块抑制噪声干扰,将外源知识与原始知识嵌入融合,为下游任务提供高质量的语义支持。最后,通过解码器预测任务结果,并在开源数据集WikiPeople和WD50K上进行广泛实验,以验证L-EQs方法的有效性。实验结果表明,在WikiPeople数据集上,L-EQs方法在MRR、Hit@1、Hit@5和Hit@10相较于基线中的最优结果提升了8.4、4.7、5.3和6.8个百分点,在WD50K数据集上分别提升了4.8、2.3、2.7和3.0个百分点。这一显著提升证明L-EQs方法解决HKG中限定符稀疏现象的有效性,改善了超关系事实语义表征不完整带来的问题。

关键词: 超关系知识图谱, 大语言模型, 超关系知识图谱补全, 超关系事实, 限定符

Abstract: Aiming at the problems of incomplete semantic representation in hyper-relational facts, insufficient task accuracy and generalization ability caused by the qualifier sparsity in Hyper-relational Knowledge Graph (HKG), a Large Language Model (LLM) driven method L-EQs (Large language model-driven method for Qualifier Enhancement in hyper-relational knowledge graphS) was proposed to alleviate qualifier sparsity and improve semantic completeness. First, semantically rich descriptions were extracted from knowledge bases to mitigate semantic ambiguity caused by identical labels of entities and relations. Then, prompt templates were used to guide the LLM to generate qualifiers and their semantic explanations as external knowledge. To ensure the quality of the qualifiers, an iterative prompting strategy was employed to filter the generated qualifiers, with multiple iterations conducted until the predetermined quantity was reached. Third, embeddings of external knowledge were constructed, and an external knowledge aggregation module was introduced to suppress noise during the aggregation process, thereby providing high-quality semantic support for the completion task. Finally, the task results were predicted through a decoder, and extensive experiments were conducted on the open-source datasets WikiPeople and WD50K to validate the effectiveness of the L-EQs method. Experimental results show that, compared to the optimal results of the baseline method, the MRR, Hit@1, Hit@5, and Hit@10 of L-EQs on the WikiPeople dataset increased by 8.4, 4.7, 5.3, and 6.8 percentage points, respectively. On the WD50K dataset, corresponding improvements of 4.8, 2.3, 2.7, and 3.0 percentage points were also observed. This significant improvement proves the effectiveness of the L-EQs in addressing the qualifier sparsity and alleviating the issues caused by the incomplete semantic representations.

Key words: Hyper-relational knowledge graph (HKG), Large language mode (LLM), hyper-relational knowledge graph completion, hyper-relational fact, qualifier

中图分类号: