《计算机应用》唯一官方网站 ›› 2025, Vol. 45 ›› Issue (12): 3764-3770.DOI: 10.11772/j.issn.1001-9081.2024121806

• 人工智能 • 上一篇    下一篇

基于分层强化学习的知识图谱约束问答模型

许浩翔1, 余敦辉1,2, 邓怡辰3, 肖奎1,2   

  1. 1.湖北大学 计算机学院,武汉 430062
    2.大数据智能分析与行业应用湖北省重点实验室(湖北大学),武汉 430062
    3.武昌首义学院 信息科学与工程学院,武汉 430064
  • 收稿日期:2024-12-24 修回日期:2025-04-14 接受日期:2025-04-18 发布日期:2025-04-25 出版日期:2025-12-10
  • 通讯作者: 邓怡辰
  • 作者简介:许浩翔(1999—),男,江苏扬州人,硕士研究生,主要研究方向:知识图谱、知识推理
    余敦辉(1974—),男,湖北武汉人,教授,博士,CCF会员,主要研究方向:群体智能、知识图谱
    邓怡辰(1993—),女,湖北武汉人,讲师,硕士,主要研究方向:群体智能、知识图谱
    肖奎(1981—),男,湖北武汉人,副教授,博士,CCF会员,主要研究方向:数据挖掘、智慧教育。
  • 基金资助:
    国家自然科学基金资助项目(62377009)

Knowledge graph constrained question answering model based on hierarchical reinforcement learning

Haoxiang XU1, Dunhui YU1,2, Yichen DENG3, Kui XIAO1,2   

  1. 1.School of Computer Science,Hubei University,Wuhan Hubei 430062,China
    2.Hubei Key Laboratory of Big Data Intelligent Analysis and Industrial Application (Hubei University),Wuhan Hubei 430062,China
    3.College of Information Science and Engineering,Wuchang Shouyi University,Wuhan Hubei 430064,China
  • Received:2024-12-24 Revised:2025-04-14 Accepted:2025-04-18 Online:2025-04-25 Published:2025-12-10
  • Contact: Yichen DENG
  • About author:XU Haoxiang, born in 1999, M. S. candidate. His research interests include knowledge graph, knowledge reasoning.
    YU Dunhui, born in 1974, Ph. D., professor. His research interests include swarm intelligence, knowledge graph.
    DENG Yichen, born in 1993, M. S., lecturer. Her research interests include swarm intelligence, knowledge graph.
    XIAO Kui, born in 1981, Ph. D., associate professor. His research interests include data mining, smart education.
  • Supported by:
    National Natural Science Foundation of China(62377009)

摘要:

针对知识图谱问答(KGQA)中忽略约束信息和长路径推理的维数灾难问题,提出一种基于分层强化学习(HRL)的知识图谱约束问答(KGQA-HRL)模型。首先,深度融合HRL理念,拆解知识图谱中的三元组,并设计出上层策略和底层策略,从而化解推理路径的维数灾难隐患;其次,为了提高路径选择的准确性,提出基于注意力机制的动作选择策略和融合约束信息的实体选择策略,从而有效压缩推理的搜索范围;再次,在动作选择与实体选择策略之间嵌入问题更新环节,从而使每跳问题进行二次更新;最后,在实体选择策略中构建约束集并计算约束得分,以考虑问题中的约束信息,从而提高实体选择的准确性。基于4个KGQA基准数据集,对KGQA-HRL模型的性能进行实验的结果显示:KGQA-HRL模型在所有数据集上均达到最优准确率,较之前最佳模型约束路径推理(COPAR)提升了2.9%,且在复杂的3跳查询任务中表现突出(PQ (PathQuestion)数据集上提升了3.6%,MetaQA数据集上提升了2.5%),验证了KGQA-HRL模型优异的推理能力。

关键词: 知识图谱问答, 分层强化学习, 复杂约束问题, 注意力机制, 维度灾难

Abstract:

To address the issues of ignoring constraint information and the curse of dimensionality in long-path reasoning in Knowledge Graph Question Answering (KGQA), a Knowledge Graph constrained Question Answering based on Hierarchical Reinforcement Learning (HRL) (KGQA-HRL) model was proposed. Firstly, the concept of HRL was integrated deeply, triples in the knowledge graph were decomposed, and a high-level policy as well as a low-level policy was designed, so as to mitigate the curse of dimensionality risk in reasoning paths. Secondly, to improve the accuracy of path selection, an attention-based action selection strategy and an entity selection strategy incorporating constraint information were introduced, thereby narrowing the search space for reasoning effectively. Thirdly, a question update phase was embedded between the action selection and entity selection strategies, thereby enabling secondary update of the question at each hop. Finally, in the entity selection strategy, a constraint set was constructed and constraint scores were calculated, so as to incorporate constraint information from the question, thereby enhancing the accuracy of entity selection. Experimental results on four KGQA benchmark datasets to evaluate the performance of KGQA-HRL model demonstrate that KGQA-HRL model achieves the optimal accuracy on all datasets, with an average improvement of 2.9% over the previous best model reinforcement learning based COnstrained PAth Reasoning (COPAR). At the same time, KGQA-HRL model has outstanding performance in complex three-hop query tasks (3.6% improvement on the PQ (PathQuestion) dataset and 2.5% improvement on MetaQA dataset), validating good reasoning capability of KGQA-HRL model.

Key words: Knowledge Graph Question Answering (KGQA), Hierarchical Reinforcement Learning (HRL), complex constrained problem, attention mechanism, curse of dimensionality

中图分类号: