Incomplete multi-view clustering algorithm based on attention mechanism

doi:10.11772/j.issn.1001-9081.2023121866

Journal of Computer Applications

Received:2024-01-08 Revised:2024-03-14 Online:2024-03-22 Published:2024-03-22

基于注意力机制的不完备多视图聚类算法

杨成昊¹,胡节¹,王红军²,彭博³

1. 西南交通大学计算机与人工智能学院
2. 西南交通大学
3. 西南交通大学信息科学与技术学院, 成都 610031

通讯作者: 胡节
基金资助:
国家自然科学基金;四川省重点研发项目;2023年西南交通大学国际学生教育管理研究项目

Abstract

Abstract: Abstract: In order to solve the problems of uncertainty in completing missing view data, lack of robustness of embedding learning and low model generalization in traditional deep incomplete multi-view clustering algorithms, an Incomplete Multi-View Clustering algorithm based on Attention Mechanism (IMVCAM) was proposed. First, K-Nearest Neighbors (KNN) was used to complete the missing data in the view, making the training data complementary. Then, after passing the linear encoding layer, the obtained embedding was passed through the attention layer to improve the quality of the embedding. Finally, the embedding obtained from the training of each view was clustered using the k-means clustering algorithm (k-means), and the weights of the views were determined by the Pearson correlation coefficient. The experiments were conducted on five classic datasets, and the best results were achieved on the Fashion dataset. Experimental results on the Fashion dataset showed that compared with the suboptimal DSIMVC (Deep Safe Incomplete Multi-View Clustering), the proposed algorithm IMVCAM improved the clustering accuracy by 2.85 and 4.35 percentage points when the data missing rate was 0.1 and 0.3 respectively. In addition, on the Caltech101-20 dataset, the clustering accuracy increased by 7.68 and 3.48 percentage points compared to the suboptimal IMVCSAF (Incomplete Multi-View Clustering algorithm based on Self-Attention Fusion) when the missing rate was 0.1 and 0.3.

Key words: Keywords: Incomplete multi-view clustering, Attention mechanism, K-Nearest Neighbors (KNN), k-means clustering algorithm (k-means), Pearson correlation coefficient

摘要： 摘要: 针对传统深度不完备多视图聚类算法中补全缺失视图数据的不确定性，嵌入学习缺乏鲁棒性以及模型泛化性低的问题，提出了基于注意力机制的不完备多视图聚类算法(IMVCAM)。首先，通过K最近邻(KNN)补全了视图中缺失的数据，使得训练数据具有互补性；然后，经过线性编码层，再将获得的嵌入通过注意力层，提高嵌入的质量；最后，对每个视图训练得到的嵌入使用k均值聚类算法(k-means)，视图的权重通过皮尔逊相关系数进行确定。实验在五个经典的数据集上进行，在Fashion数据集上取得最优的结果。在Fashion数据集上的实验结果表明，所提算法IMVCAM相较于次优的DSIMVC(Deep Safe Incomplete Multi-View Clustering)在数据缺失率为0.1，0.3的情况下聚类精度提升了2.85，4.35个百分点。此外，在Caltech101-20数据集上，缺失率为0.1，0.3的情况下相比于次优的IMVCSAF(Incomplete Multi-View Clustering algorithm based on Self-Attention Fusion)聚类精度提升了7.68，3.48个百分点。

关键词: 关键词: 不完备多视图聚类, K最近邻, 注意力机制, 皮尔逊相关系数, k均值聚类算法

CLC Number:

TP391

杨成昊胡节王红军彭博. 基于注意力机制的不完备多视图聚类算法[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2023121866.

[1]	Rui JIANG, Wei LIU, Cheng CHEN, Tao LU. Asymmetric unsupervised end-to-end image deraining network [J]. Journal of Computer Applications, 2024, 44(3): 922-930.
[2]	Tao SUN, Zhangtian DUAN, Haonan ZHU, Peihao GUO, Heli SUN. Social event recommendation method based on unexpectedness metric [J]. Journal of Computer Applications, 2024, 44(3): 760-766.
[3]	Yongfeng DONG, Jiaming BAI, Liqin WANG, Xu WANG. Chinese named entity recognition combining prior knowledge and glyph features [J]. Journal of Computer Applications, 2024, 44(3): 702-708.
[4]	Zijie HUANG, Yang OU, Degang JIANG, Cailing GUO, Bailin LI. Lightweight deep learning algorithm for weld seam surface quality detection of traction seat [J]. Journal of Computer Applications, 2024, 44(3): 983-988.
[5]	Aiguo SHANG, Xinjuan ZHU. Joint approach of intent detection and slot filling based on multi-task learning [J]. Journal of Computer Applications, 2024, 44(3): 690-695.
[6]	Yuliang ZHENG, Yunhua CHEN, Weijie BAI, Pinghua CHEN. Vehicle target detection by fusing event data and image frames [J]. Journal of Computer Applications, 2024, 44(3): 931-937.
[7]	Kui ZHAO, Huiqi QIU, Xu LI, Zhifei XU. Real-time pulmonary nodule detection algorithm combining attention and multipath fusion [J]. Journal of Computer Applications, 2024, 44(3): 945-952.
[8]	Xinran LUO, Tianrui LI, Zhen JIA. Chinese medical named entity recognition based on self-attention mechanism and lexicon enhancement [J]. Journal of Computer Applications, 2024, 44(2): 385-392.
[9]	Fuqin DENG, Huifeng GUAN, Chaoen TAN, Lanhui FU, Hongmin WANG, Tinlun LAM, Jianmin ZHANG. Multi-robot reinforcement learning path planning method based on request-response communication mechanism and local attention mechanism [J]. Journal of Computer Applications, 2024, 44(2): 432-438.
[10]	Weichao DANG, Lei ZHANG, Gaimei GAO, Chunxia LIU. Weakly supervised action localization method with snippet contrastive learning [J]. Journal of Computer Applications, 2024, 44(2): 548-555.
[11]	Ziqi HUANG, Jianpeng HU. Entity category enhanced nested named entity recognition in automotive domain [J]. Journal of Computer Applications, 2024, 44(2): 377-384.
[12]	Zhiping ZHU, Yan YANG, Jie WANG. Scene graph-aware cross-modal image captioning model [J]. Journal of Computer Applications, 2024, 44(1): 58-64.
[13]	Junhao LUO, Yan ZHU. Multi-dynamic aware network for unaligned multimodal language sequence sentiment analysis [J]. Journal of Computer Applications, 2024, 44(1): 79-85.
[14]	Mu LI, Yuheng YANG, Xizheng KE. Emotion recognition model based on hybrid-mel gama frequency cross-attention transformer modal [J]. Journal of Computer Applications, 2024, 44(1): 86-93.
[15]	Jia WANG-ZHU, Zhou YU, Jun YU, Jianping FAN. Video dynamic scene graph generation model based on multi-scale spatial-temporal Transformer [J]. Journal of Computer Applications, 2024, 44(1): 47-57.