Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (6): 1836-1843.DOI: 10.11772/j.issn.1001-9081.2025050647

• Data science and technology • Previous Articles    

Multi-level neighborhood contrastive attribute graph clustering based on adaptive learning

Jinghong WANG1,2,3,4,5, Xiao CHEN1, Yingmei MA2,5(), Bi LI6, Jusheng MI7, Wei WANG1   

  1. 1.College of Computer and Cyber Security,Hebei Normal University,Shijiazhuang Hebei 050024,China
    2.College of Artificial Intelligence,Hebei University of Engineering Science,Shijiazhuang Hebei 050091,China
    3.Hebei Key Laboratory of Network and Information Security (Hebei Normal University),Shijiazhuang Hebei 050024,China
    4.Hebei Provincial Engineering Research Center for Supply Chain Big Data Analytics and Data Security (Hebei Normal University),Shijiazhuang Hebei 050024,China
    5.Shijiazhuang Intelligent IoT Technology Innovation Center,Shijiazhuang Hebei 050299,China
    6.School of Business,Hebei Normal University,Shijiazhuang Hebei 050024,China
    7.School of Mathematical Sciences,Hebei Normal University,Shijiazhuang Hebei 050024,China
  • Received:2025-06-12 Revised:2025-08-16 Accepted:2025-09-01 Online:2025-09-15 Published:2026-06-10
  • Contact: Yingmei MA
  • About author:WANG Jinghong, born in 1967, Ph. D., professor. Her research interests include artificial intelligence, data mining.
    CHEN Xiao, born in 2000, M. S. candidate. Her research interests include data mining, graph representation learning.
    LI Bi, born in 1966, M. S., associate professor. His research interests include fuzzy systems.
    MI Jusheng, born in 1966, Ph. D., professor. His research interests include mathematical foundation of artificial intelligence, rough set theory.
    WANG Wei, born in 1982, Ph. D., associate professor. His research interests include computer vision.
    First author contact:MA Yingmei, born in 1989, M. S., lecturer. Her research interests include big data analysis.
  • Supported by:
    Natural Science Foundation of Hebei Province(F2024205028);Post-graduate’s Innovation Fund Project of Hebei Province(CXZZSS2025049);Key Development Fund of Hebei Normal University(L2024ZD06)

基于自适应学习的多层次邻域对比属性图聚类

王静红1,2,3,4,5, 陈潇1, 马迎梅2,5(), 李笔6, 米据生7, 王威1   

  1. 1.河北师范大学 计算机与网络空间安全学院,石家庄 050024
    2.河北工程技术学院 人工智能学院,石家庄 050091
    3.河北省网络与信息安全重点实验室(河北师范大学),石家庄 050024
    4.供应链大数据分析与数据安全河北省工程研究中心(河北师范大学),石家庄 050024
    5.石家庄市智能物联技术创新中心,石家庄 050299
    6.河北师范大学 商学院,石家庄 050024
    7.河北师范大学 数学科学学院,石家庄 050024
  • 通讯作者: 马迎梅
  • 作者简介:王静红(1967—),女,河北石家庄人,教授,博士,CCF会员,主要研究方向:人工智能、数据挖掘
    陈潇(2000—),女,河北沧州人,硕士研究生,CCF会员,主要研究方向:数据挖掘、图表示学习
    李笔(1966—),男,湖南湘乡人,副教授,硕士,主要研究方向:模糊系统
    米据生(1966—),男,河北宁晋人,教授,博士,主要研究方向:人工智能的数学基础、粗糙集理论
    王威(1982—),男,河北邯郸人,副教授,博士,主要研究方向:计算机视觉。
    第一联系人:马迎梅(1989—),女,河北石家庄人,讲师,硕士,主要研究方向:大数据分析
  • 基金资助:
    河北省自然科学基金资助项目(F2024205028);河北省研究生创新资助项目(CXZZSS2025049);河北师范大学重点发展基金资助项目(L2024ZD06)

Abstract:

Recently, deep graph clustering methods have outstanding performance in graph clustering studies. However, most existing deep graph clustering methods are based on the auto-encoder framework, and are vulnerable to reconstruction strategies and graph enhancement strategies. Therefore, a deep graph clustering method based on contrastive learning was proposed, namely Multi-level Neighborhood Contrastive attribute Graph Clustering based on adaptive learning (MNCGC). Firstly, a dual masking strategy was designed to generate an adaptive augmented graph, which combined the node importance to generate edge weights, that is, edge masking probabilities, and a fixed masking probability was set for node features for node feature masking, so as to remove redundant information in the graph and provide rich sample pairs for neighborhood contrastive learning. Then, the edge weights were introduced into the neighborhood contrastive learning, so that the enhanced neighborhood contrastive learning was used to the original graph and the augmented graph at coding level and projection level, thereby emphasizing the local information learning and the global high-level semantic information learning. Finally, self-supervised clustering and code level representation were used to promote each other, thereby further improving the clustering effect. Experimental results on three benchmark datasets including Cora, CiteSeer and PubMed show that compared with fourteen advanced methods, MNCGC method achieves optimal values in most cases across four indicators: accuracy, Normalized Mutual Information (NMI), Adjusted Rand Index (ARI) and F1-score, fully verifying the effectiveness of the proposed method.

Key words: attribute graph clustering, adaptive learning, contrastive learning, self-supervised clustering, deep graph clustering

摘要:

在近期的图聚类研究中,深度图聚类方法表现优异;然而,现有的深度图聚类方法多数基于自编码器框架,容易受到重建策略和图增强策略的影响。因此,提出一种基于对比学习的深度图聚类方法——基于自适应学习的多层次邻域对比属性图聚类(MNCGC)。首先,设计一种双重掩蔽策略生成自适应增广图,结合节点重要性生成边权重,即边屏蔽概率,并对节点特征设置固定屏蔽概率,进行节点特征屏蔽,去除图中的冗余信息,为邻域对比学习提供丰富的样本对;其次,将边权重引入邻域对比学习,以对原图和增广图使用加强邻域对比学习并把它们应用于编码级和投影级,从而强调局部信息学习和全局高级语义信息学习;最后,采用自监督聚类与编码级表示相互促进,进一步提升聚类效果。在3个基准数据集(Cora、CiteSeer和全文PubMed)上的实验结果表明,相较于14种先进方法,MNCGC方法的准确率、标准互信息(NMI)、调整兰德指数(ARI)以及F1分数在大多数情况下都取得了最优值,充分验证了所提方法的有效性。

关键词: 属性图聚类, 自适应学习, 对比学习, 自监督聚类, 深度图聚类

CLC Number: