Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (5): 1378-1387.DOI: 10.11772/j.issn.1001-9081.2025050566

• Artificial intelligence • Previous Articles    

Graph neural network framework for topology semantic dual-domain collaboration

Kun FU1(), Haoyu WEI1, Weijing LIU2,3, Xing DANG2,3, Zezheng LIU1, Jianwei LI1   

  1. 1.School of Artificial Intelligence,Hebei University of Technology,Tianjin 300401,China
    2.Tianjin Institute of Aerospace Mechanical and Electrical Equipment,Tianjin 300462,China
    3.Tianjin Enterprise Key Laboratory of Aerospace Intelligent Equipment Technology,Tianjin 300462,China
  • Received:2025-05-26 Revised:2025-08-30 Accepted:2025-09-01 Online:2025-09-15 Published:2026-05-10
  • Contact: Kun FU
  • About author:WEI Haoyu, born in 2000, M. S. candidate. His research interests include network representation learning.
    LIU Weijing, born in 1990, M. S., senior engineer. Her research interests include path optimization of AGV scheduling system.
    DANG Xing, born in 1986, M. S., senior engineer. Her research interests include intelligent manufacturing application system.
    LIU Zezheng, born in 2000, M. S. candidate. His research interests include network representation learning.
    LI Jianwei, born in 1974, Ph. D., professor. His research interests include bioinformatics, graph convolutional neural networks.
  • Supported by:
    National Natural Science Foundation of China(62072154);Key Science and Technology Program of Tianjin(22JCYBJC01740)

拓扑语义双域协同的图神经网络框架

富坤1(), 魏昊宇1, 刘伟静2,3, 党兴2,3, 刘泽政1, 李建伟1   

  1. 1.河北工业大学 人工智能与数据科学学院,天津 300401
    2.天津航天机电设备研究所,天津 300462
    3.天津市宇航智能装备技术企业重点实验室,天津 300462
  • 通讯作者: 富坤
  • 作者简介:魏昊宇(2000—),男,河北保定人,硕士研究生,主要研究方向:网络表示学习
    刘伟静(1990—),女,天津人,高级工程师,硕士,主要研究方向:自动导引车调度系统的路径优化
    党兴(1986—),女,天津人,高级工程师,硕士,主要研究方向:智能制造应用系统
    刘泽政(2000—),男,河北邢台人,硕士研究生,主要研究方向:网络表示学习
    李建伟(1974—),男,河北唐山人,教授,博士,主要研究方向:生物信息学、图卷积神经网络。
  • 基金资助:
    国家自然科学基金资助项目(62072154);天津市科技计划项目(22JCYBJC01740)

Abstract:

Graph Neural Network (GNN) commonly faces the Under?Reachability problem (Under?Reach) when processing graph data with sparse and unevenly distributed labeled nodes, which means that distant unlabeled nodes cannot effectively receive supervised signals due to topological constraints, leading to limited model generalization ability. Although existing methods can partially address this issue, they still have limitations, including over?smoothing, high computational complexity, and noise sensitivity. Therefore, a GNN framework for topology semantic dual?domain collaboration, named TriMix, was proposed to address the above challenges through three key improvements. First, a dynamic mixing ratio mechanism was designed to adjust mixing weights between pseudo?labels and ground?truth labels adaptively across training epochs, relying on ground?truth labels for stable convergence in the early stage while incorporating high?confidence pseudo?labels gradually in the latter?stage training for decision boundary expansion. Second, a topology?semantic dual?domain collaborative node?weighted sampling strategy was constructed by integrating node degree, PageRank value, and feature similarity, so as to quantify node importance and optimize information propagation paths, enhancing the reachability of low?centrality nodes. Third, a contrastive learning module was implemented with a triple?level negative sample generation strategy of category?driving, feature?similarity weighting, and pseudo?label guidance, to refine the discriminability between positive and negative samples in the embedding space, thereby enhancing the semantic understanding of unlabeled data. Experimental results on benchmark datasets such as Cora and PubMed showed that TriMix achieved node classification accuracy 2.1% to 4.4% higher than baseline models like Graph Convolutional Network (GCN) and Graph ATtention network (GAT), with improved F1?score and generalization ability. The TriMix framework significantly improves learning efficiency on sparsely labeled graph data through dual?domain collaboration of topological structure and semantic features, providing a new approach to node classification tasks in complex graph structures.

Key words: Graph Neural Network (GNN), Under?Reachability Problem (Under?Reach), dynamic mixing, dual-domain collaboration, node weighted sampling, contrastive learning

摘要:

图神经网络(GNN)在处理标记节点稀疏且分布不均的图数据时,普遍面临欠可达问题(Under-Reach),即远端未标记节点因拓扑限制难以接收有效监督信号,导致模型泛化能力受限。现有方法虽能部分解决这一问题,但仍存在过度平滑、计算复杂度高、噪声敏感等局限性。因此,提出一种拓扑语义双域协同的图神经网络框架TriMix,通过三方面改进应对上述挑战:1)动态混合比例机制。基于训练轮次自适应调整伪标签与真实标签的混合权重,在训练初期依赖真实标签稳定收敛,后期逐步引入高置信伪标签以扩展决策边界。2)拓扑语义双域协同的节点加权采样策略。融合节点度、PageRank值和特征相似性,量化节点重要性并优化信息传播路径,提升低中心性节点的可达性。3)对比学习模块。通过类别驱动、特征相似性加权与伪标签引导的三级负样本生成策略,优化嵌入空间的正负样本区分度,增强模型对未标记数据的语义理解。在Cora、PubMed等经典数据集上的实验结果表明,TriMix的节点分类准确率比图卷积网络(GCN)、图注意力网络(GAT)等基线模型高2.1%~4.4%,F1评分与泛化能力均有所提升。TriMix框架通过拓扑结构与语义特征的双域协同优化显著提升了稀疏标记图数据的学习效率,为复杂图结构中的节点分类任务提供了新思路。

关键词: 图神经网络, 欠可达问题, 动态混合, 双域协同, 节点加权采样, 对比学习

CLC Number: