Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (6): 1855-1862.DOI: 10.11772/j.issn.1001-9081.2025060793

• Data science and technology • Previous Articles    

Graph neural network node classification model incorporating clustering coefficients

Yasong ZHANG1, Bihui CONG2, Shuang XU1()   

  1. 1.College of Information and Communication Engineering,Dalian Minzu University,Dalian Liaoning 116600,China
    2.Academy of the Zhonghuaminzu Community,Dalian Minzu University,Dalian Liaoning 116600,China
  • Received:2025-07-17 Revised:2025-09-20 Accepted:2025-09-25 Online:2025-10-16 Published:2026-06-10
  • Contact: Shuang XU
  • About author:ZHANG Yasong, born in 2001, M. S. candidate. Her research interests include node classification, attention mechanism.
    CONG Bihui, born in 1989, Ph. D. candidate, lecturer. Her research interests include communication studies.
    First author contact:XU Shuang, born in 1978, Ph. D., professor. Her research interests include big data analysis and processing of complex network.
  • Supported by:
    Research Project on Humanities and Social Sciences by the Ministry of Education in 2023(23YJCZH028);Fundamental Research Funds for the Central Universities(044420250008)

引入聚类系数的图神经网络节点分类模型

张雅淞1, 丛碧辉2, 许爽1()   

  1. 1.大连民族大学 信息与通信工程学院,辽宁 大连 116600
    2.大连民族大学 中华民族共同体研究院,辽宁 大连 116600
  • 通讯作者: 许爽
  • 作者简介:张雅淞(2001—),女,辽宁阜新人,硕士研究生,CCF会员,主要研究方向:节点分类、注意力机制
    丛碧辉(1989—),女,吉林四平人,讲师,博士研究生,主要研究方向:传播学
    第一联系人:许爽(1978—),女,辽宁大连人,教授,博士,主要研究方向:复杂网络的大数据分析与处理。
  • 基金资助:
    2023年度教育部人文社会科学研究项目(23YJCZH028);中央高校基本科研业务费专项资金资助项目(044420250008)

Abstract:

To address the issues of structural unfairness and classification inaccuracy of Graph ATtention network (GAT) model in node classification tasks, a Graph Neural Network (GNN) node classification model incorporating clustering coefficients, named GATcc(GAT with clustering coefficient), was proposed. Firstly, by introducing the clustering coefficients of neighboring nodes as structural information, and combining trainable weight parameters, the representation ability of the topological structure in the attention mechanism was enhanced. Then, feature scaling was employed to optimize node embeddings, and residual connections were added to mitigate the risk of feature over-smoothing. Experimental results on six real datasets demonstrate that the proposed model outperforms the mainstream models, such as Graph Isomorphism Network (GIN) and GOAT (Graph Ordering Attention Network), in classification accuracy. For instance, compared to the baseline model GAT on the Cora dataset, the proposed model has the classification accuracy improved by 4.03 percentage points, the structural bias reduced from 0.31% to 0.11%, and the classification accuracy of isolated nodes improved by 3.69 percentage points. In conclusion, the proposed model not only achieves significant improvements in classification performance, but also shows superiority in structural fairness and stability.

Key words: node classification, attention mechanism, clustering coefficient, topological information, structural fairness

摘要:

针对图注意力网络(GAT)模型在节点分类任务中存在的结构不公平性和分类不准确问题,提出一种引入聚类系数的图神经网络(GNN)节点分类模型GATcc(GAT with clustering coefficient)。首先,通过引入邻居节点的聚类系数作为结构信息,结合可训练的权重参数,增强拓扑结构在注意力机制中的表达能力;其次,采用特征缩放优化节点嵌入,并加入残差连接以减轻特征过平滑的风险。在6个真实数据集上的实验结果表明,该模型的分类准确率超过图同构网络(GIN)和GOAT(Graph Ordering Attention Network)等主流模型。例如,相较于基线模型GAT,所提模型的分类准确率在Cora数据集上提升了4.03个百分点,结构性偏差从0.31%降低到0.11%,而孤立节点的分类准确率提升了3.69个百分点。综上,所提模型不仅在分类性能上取得了显著提升,还在结构公平性与稳定性方面展现出优越性。

关键词: 节点分类, 注意力机制, 聚类系数, 拓扑信息, 结构公平性

CLC Number: