Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (2): 341-353.DOI: 10.11772/j.issn.1001-9081.2025030273

• Artificial intelligence •    

Review of multi-modal knowledge graph completion methods

Xue WANG, Liping ZHANG(), Sheng YAN, Na LI, Xuefei ZHANG   

  1. College of Computer Science and Technology,Inner Mongolia Normal University,Hohhot Inner Mongolia 010022,China
  • Received:2025-03-18 Revised:2025-05-22 Accepted:2025-05-26 Online:2025-06-12 Published:2026-02-10
  • Contact: Liping ZHANG
  • About author:WANG Xue, born in 2001, M. S. candidate. Her research interests include educational data mining.
    ZHANG Liping, born in 1974, M. S., professor. Her research interests include smart education, software engineering. Email:ciezlp@imnu.edu.cn
    YAN Sheng, born in 1984, M. S., lecturer. His research interests include computer education.
    LI Na, born in 1980, M. S., associate professor. Her research interests include education informatization, information technology education, digital learning resource design and development.
    ZHANG Xuefei, born in 1997, M. S. candidate. His research interests include computer education.
  • Supported by:
    Inner Mongolia Natural Science Foundation(2023LHMS06009);2023 Project of the 14th Five-Year Plan for Educational Science Research of Inner Mongolia Autonomous Region(2023NGHZXZH119);Basic Research Funds of Inner Mongolia Normal University(2022JBQN108)

多模态知识图谱补全方法综述

王雪, 张丽萍(), 闫盛, 李娜, 张学飞   

  1. 内蒙古师范大学 计算机科学技术学院,呼和浩特 010022
  • 通讯作者: 张丽萍
  • 作者简介:王雪(2001—),女,内蒙古巴彦淖尔人,硕士研究生,主要研究方向:教育数据挖掘
    张丽萍(1974—),女,内蒙古呼和浩特人,教授,硕士,主要研究方向:智慧教育、软件工程 Email:ciezlp@imnu.edu.cn
    闫盛(1984—),男,内蒙古包头人,讲师,硕士,CCF会员,主要研究方向:计算机教育
    李娜(1980—),女,湖北黄陂人,副教授,硕士,主要研究方向:教育信息化、信息技术教育、数字化学习资源设计与开发
    张学飞(1997—),男,内蒙古乌兰察布人,硕士研究生,CCF会员,主要研究方向:计算机教育。
  • 基金资助:
    内蒙古自然科学基金资助项目(2023LHMS06009);内蒙古自然科学基金资助项目(2024MS06020);内蒙古自治区教育科学研究“十四五”规划2023年度课题(2023NGHZXZH119);内蒙古自治区教育科学研究“十四五”规划2023年度课题(NGJGH2023234);内蒙古师范大学基本科研业务费专项(2022JBQN108);内蒙古师范大学基本科研业务费专项(2022JBQN008)

Abstract:

Traditional Knowledge Graph (KG) provides a unified and machine-interpretable representation of information on the web, but its limitations in handling multimodal applications are increasingly recognized. To address these limitations, Multi-Modal Knowledge Graph (MMKG) was proposed as an effective solution. However, the integration of multi-modal data into KG often leads to problems such as inadequate modality fusion and reasoning difficulties, which constrain the application and development of MMKG. Therefore, Multi-Modal Knowledge Graph Completion (MMKGC) techniques were introduced to integrate cross-modal information fully in the construction phase and to predict missing links after construction, thereby solving issues in modality fusion and reasoning. Subsequently, an overview of MMKGC methods were presented. Firstly, the basic concepts, widely used benchmark datasets and evaluation metrics of MMKGC were elaborated in detail. Secondly, the existing methods were classified into fusion tasks during the MMKG construction phase and reasoning tasks after construction. The former focused on key techniques such as entity alignment and entity linking, while the latter encompassed three techniques: relation inference, missing information completion, and multi-modal expansion. Thirdly, various MMKGC methods in each category were introduced thoroughly and their characteristics were analyzed. Finally, the problems and challenges faced by MMKGC methods were examined, and a summary of the above was provided.

Key words: multi-modal data, Multi-Modal Knowledge Graph (MMKG), Multi-Modal Knowledge Graphs Completion (MMKGC), entity alignment, relationship inference

摘要:

传统知识图谱(KG)虽然为网络中的信息提供了一种统一的且机器可理解的表示方式,但在处理多模态应用时逐渐暴露出局限性。为了应对这些局限性,研究者提出多模态知识图谱(MMKG)作为有效解决方案。然而,KG引入多模态数据后广泛存在模态融合不充分和推理困难的问题,这制约了MMKG的应用和发展。而多模态知识图谱补全(MMKGC)技术不仅能够在构建阶段充分融合跨模态信息,还能够在构建完成阶段预测缺失的链接,从而解决在模态融合和推理时遇到的问题。因此,对MMKG方法进行综述。首先,详尽阐述MMKGC的基本概述以及常用的基准数据集和评价指标;其次,将现有方法分为针对MMKG构建阶段的融合任务和构建完成阶段的推理任务,前者聚焦于关键技术如实体对齐和实体链接,后者则涵盖关系推理、信息缺失补全及多模态扩展这3类技术;再次,详细介绍了各类MMKGC方法,并分析它们的特点;最后,分析MMKGC方法面临的问题与挑战并总结前面的内容。

关键词: 多模态数据, 多模态知识图谱, 多模态知识图谱补全, 实体对齐, 关系推理

CLC Number: