《计算机应用》唯一官方网站

• •    下一篇

不完整多视图聚类综述

董瑶,付怡雪,董永峰,史进,陈晨   

  1. 河北工业大学
  • 收稿日期:2023-06-25 修回日期:2023-07-18 发布日期:2023-08-21 出版日期:2023-08-21
  • 通讯作者: 付怡雪
  • 基金资助:
    多源异构校园行为数据的知识图谱构建技术研究;基于逻辑规则与强化学习的知识图谱推理技术研究;基于学生行为分析的新工科人才学习质量提升路径的探索与实践;基于知识图谱的计算机专业教学资源构建及应用

Survey of incomplete multi-view clustering

  • Received:2023-06-25 Revised:2023-07-18 Online:2023-08-21 Published:2023-08-21

摘要: 多视图聚类是近年来图数据挖掘领域的研究热点。但由于数据采集技术的限制或人为因素等原因常导致视图或样本缺失问题。如何降低多视图的不完整性对聚类效果的影响是多视图聚类目前面临的重大挑战。因此,综合研究不完整多视图聚类近年的发展具有重要的理论意义和实践价值。首先,归纳分析不完整多视图数据缺失类型;其次,详细比较了基于多核学习、矩阵分解学习、深度学习、图学习的四类不完整多视图聚类方法,分析代表性方法的技术特点和区别;再次,从数据集类型、视图和类别数量、应用领域等角度总结22个公开不完整多视图数据集;然后,总结评价指标,并系统分析现有不完整多视图聚类方法在同构和异构数据集上的性能表现;最后,归纳分析不完整多视图聚类目前存在的问题、未来的发展方向及现有应用领域。

关键词: 不完整性, 多视图聚类, 图数据挖掘, 缺失视图, 多视图学习

Abstract: Multi-view clustering has recently been a hot topic in graph data mining. However, due to the limitations of data collection technology or human factors, multi-view data often has the problem of missing views or samples. Incompleteness of multi-views is one of the major challenges facing multi-view clustering. In order to better understand the development of incomplete multi-view clustering in recent years, a comprehensive review is of great theoretical and practical importance. Firstly, the missing types of incomplete multi-view data were summarized and analyzed. Secondly, four types of incomplete multi-view clustering methods, based on multiple kernel learning, matrix factorization learning, deep learning, and graph learning were compared, and the technical characteristics and differences among the methods were analyzed. Thirdly, from the perspectives of dataset types, the number of views and categories, and application fields, 22 public incomplete multi-view datasets were conducted. Then, the evaluation metrics were outlined, and the performance of existing incomplete multi-view clustering methods on homogeneous and heterogeneous datasets were evaluated. Finally, the existing problems, future research directions, and existing application fields of incomplete multi-view clustering were discussed.

Key words: incompleteness, multi-view clustering, graph data mining, missing view, multi-view learning

中图分类号: