《计算机应用》唯一官方网站 ›› 2023, Vol. 43 ›› Issue (S2): 117-121.DOI: 10.11772/j.issn.1001-9081.2023040468

• 数据科学与技术 • 上一篇    

基于语义最优化的图像聚类算法

张凯, 宋承云()   

  1. 重庆理工大学 计算机科学与工程学院,重庆 400054
  • 收稿日期:2023-04-24 修回日期:2023-05-31 接受日期:2023-06-01 发布日期:2024-01-09 出版日期:2023-12-31
  • 通讯作者: 宋承云
  • 作者简介:张凯(1999—),男,安徽亳州人,硕士研究生,主要研究方向:深度聚类
    宋承云(1988—),男,甘肃白银人,副教授,博士,主要研究方向:人工智能、机器学习。
  • 基金资助:
    重庆理工大学研究生教育高质量发展行动计划项目(gzlcx20232065)

Image clustering algorithm based on semantic optimization

Kai ZHANG, Chengyun SONG()   

  1. College of Computer Science and Engineering,Chongqing University of Technology,Chongqing 400054,China
  • Received:2023-04-24 Revised:2023-05-31 Accepted:2023-06-01 Online:2024-01-09 Published:2023-12-31
  • Contact: Chengyun SONG

摘要:

针对深度聚类中采用对比学习方式得到的语义特征信息不足的问题,提出一种优化语义特征的算法。在预训练阶段,采用重构损失作为正则化项,增加特征表示和输入之间的互信息,从而近似引入更多与聚类任务相关的信息,降低对比学习过拟合共享信息的风险;在微调阶段,抛弃传统的聚类算法与聚类网络同时更新的方式,采用图像近邻之间的相似性差异作为损失更新聚类网络,以最大限度地利用图像之间的近邻语义信息。在CIFAR10、CIFAR100和STL10数据集上的实验结果表明,所提算法在STL10数据集上的准确率比次优的SCAN(Semantic Clustering by Adopting Nearest neighbors)算法提高了2.7个百分点,并且在标准化互信息(NMI)和调整兰德系数(ARI)指标上均取得了领先,验证了所提算法的有效性。

关键词: 深度聚类, 对比学习, 语义特征, 过拟合, 正则化

Abstract:

Aiming at the problem of insufficient information of semantic features obtained by using contrastive learning in deep clustering, an algorithm for optimizing semantic features was proposed. In the pre-training stage, Reconstruction loss was used as a regularization term to increase the mutual information between the feature representation and the input, thus approximating the introduction of more information relevant to the clustering task and reducing the risk of overfitting shared information by contrastive learning. In the fine-tuning stage, the traditional method that the clustering algorithm and the clustering network were updated simultaneously was abandoned, and the similarity difference between the nearest neighbors of the image was used as the loss to update the clustering network to maximize the use of the semantic information of the nearest neighbors of the image. Experiments results on the CIFAR10, CIFAR100 and STL10 datasets show that the proposed algorithm improves the accuracy on the STL10 dataset by 2.7 percentage points compared to the suboptimal SCAN (Semantic Clustering by Adopting Nearest neighbors) algorithm, and achieves a lead in both the Normalized Mutual Information (NMI) and Adjusted Rand Index (ARI) metrics, which validates the effectiveness of the proposed algorithm.

Key words: deep clustering, contrastive learning, semantic feature, overfitting, regularization

中图分类号: