Journal of Computer Applications ›› 2021, Vol. 41 ›› Issue (12): 3438-3446.DOI: 10.11772/j.issn.1001-9081.2021061056

• The 18th China Conference on Machine Learning • Previous Articles    

Robust multi-view subspace clustering based on consistency graph learning

Zhenjun PAN, Cheng LIANG(), Huaxiang ZHANG   

  1. School of Information Science and Engineering,Shandong Normal University,Jinan Shandong 250358,China
  • Received:2021-05-12 Revised:2021-07-16 Accepted:2021-07-26 Online:2021-12-28 Published:2021-12-10
  • Contact: Cheng LIANG
  • About author:PAN Zhenjun, born in 1996, M. S. candidate. Her research interests include machine learning, biomedical big data analysis.
    ZHANG Huaxiang, born in 1966, Ph. D., professor. His research interests include pattern recognition, multimodal data retrieval, pedestrian re-identification.
  • Supported by:
    the Joint Funds of National Natural Science Foundation of China(U1836216);the Surface Program of National Natural Science Foundation of China(61873089);the Major Fundamental Research Project of Shandong Province(ZR2019ZD03)

基于一致图学习的鲁棒多视图子空间聚类

潘振君, 梁成(), 张化祥   

  1. 山东师范大学 信息科学与工程学院,济南 250358
  • 通讯作者: 梁成
  • 作者简介:潘振君(1996—),女,山东潍坊人,硕士研究生,CCF会员,主要研究方向:机器学习、生物医学大数据分析
    张化祥(1966—),男,山东汶上人,教授,博士,CCF会员,主要研究方向:模式识别、多模态数据检索、行人重识别。
  • 基金资助:
    国家自然科学基金联合基金资助项目(U1836216);国家自然科学基金面上项目(61873089);山东省重大基础研究项目(ZR2019ZD03)

Abstract:

Concerning that the multi-view data analysis is susceptible to the noise of the original dataset and requires additional steps to calculate the clustering results, a Robust Multi-view subspace clustering based on Consistency Graph Learning (RMCGL) algorithm was proposed. Firstly, the potential robust representation of data in the subspace was learned in each view, and the similarity matrix of each view was obtained based on these representations. Then, a unified similarity graph was learned based on the obtained multiple similarity matrices. Finally, by adding rank constraints to the Laplacian matrix corresponding to the similarity graph, the obtained similarity graph had the optimal clustering structure, and the final clustering results were able to be obtained directly by using this similarity graph. The process was completed in a unified optimization framework, in which potential robust representations, similarity matrices and consistency graphs could be learned simultaneously. The clustering Accuracy (ACC) of RMCGL algorithm is 3.36 percentage points, 5.82 percentage points and 5.71 percentage points higher than that of Graph-based Multi-view Clustering (GMC) algorithm on BBC, 100leaves and MSRC datasets, respectively. Experimental results show that the proposed algorithm has a good clustering effect.

Key words: multi-view, consistency graph, subspace, clustering, self-weighting, graph learning

摘要:

针对多视图数据分析易受原始数据集噪声干扰,以及需要额外的步骤计算聚类结果的问题,提出一种基于一致图学习的鲁棒多视图子空间聚类(RMCGL)算法。首先,在各个视图下学习数据在子空间中的潜在鲁棒表示,并基于该表示得到各视图的相似度矩阵。随后,基于得到的多个相似度矩阵学习一个统一的相似度图。最后,通过对相似度图对应的拉普拉斯矩阵添加秩约束,确保得到的相似度图具有最优的聚类结构,并可直接得到最终的聚类结果。该过程在一个统一的优化框架中完成,能同时学习潜在鲁棒表示、相似度矩阵和一致图。RMCGL算法的聚类精度(ACC)在BBC、100leaves和MSRC数据集上比基于图的多视图聚类(GMC)算法分别提升了3.36个百分点、5.82个百分点和5.71个百分点。实验结果表明,该算法具有良好的聚类效果。

关键词: 多视图, 一致图, 子空间, 聚类, 自加权, 图学习

CLC Number: