Journal of Computer Applications ›› 2024, Vol. 44 ›› Issue (3): 663-670.DOI: 10.11772/j.issn.1001-9081.2023030353

Special Issue: 人工智能

• Artificial intelligence •     Next Articles

Feature selection method for graph neural network based on network architecture design

Dapeng XU1(), Xinmin HOU1,2   

  1. 1.School of Mathematical Sciences,University of Science and Technology of China,Hefei Anhui 230026,China
    2.CAS Key Laboratory of Wu Wen-Tsun Mathematics (University of Science and Technology of China),Hefei Anhui 230026,China
  • Received:2023-04-03 Revised:2023-05-08 Accepted:2023-05-09 Online:2023-05-30 Published:2024-03-10
  • Contact: Dapeng XU
  • About author:HOU Xinmin, born in 1972, Ph. D., professor. His research interests include graph theory and its applications, complex network, graph neural network.
  • Supported by:
    National Natural Science Foundation of China(12071453);National Key Research and Development Program(2020YFA0713100)


徐大鹏1(), 侯新民1,2   

  1. 1.中国科学技术大学 数学科学学院,合肥 230026
    2.中国科学院吴文俊数学重点实验室(中国科学技术大学),合肥 230026
  • 通讯作者: 徐大鹏
  • 作者简介:侯新民(1972—),男,山东郓城人,教授,博士,主要研究方向:图论及其应用、复杂网络、图神经网络。
  • 基金资助:


In recent years, researchers have proposed many improved model architecture designs for Graph Neural Network (GNN), driving performance improvements in various prediction tasks. But most GNN variants start with the assumption that node features are equally important, which is not the case. To solve this problem, a feature selection method was proposed to improve the existing model and select important feature subsets for the dataset. The proposed method consists of two components, a feature selection layer, and a separate label-feature mapping. Softmax normalizer and feature “soft selector” were used for feature selection in the feature selection layer, and the model structure was designed under the idea of separate label-feature mapping to select the corresponding subsets of related features for different labels, and multiple related feature subsets were performed union operation to obtain an important feature subset of the final dataset. Graph ATtention network (GAT) and GATv2 models were selected as the benchmark models, and the algorithm was applied to the benchmark models to obtain new models. Experimental results show that when the proposed models perform node classification tasks on six datasets, their accuracies are improved by 0.83% - 8.79% compared with the baseline models. The new models also select the corresponding important feature subsets for the six datasets, in which the number of features accounts for 3.94% - 12.86% of the total number of features in their respective datasets. After using the important feature subset as the new input of the benchmark model, the accuracy more than 95% (using all features) is still achieved. That is, the scale of the model is reduced while ensuring the accuracy. It can be seen that the proposed new algorithm can improve the accuracy of node classification, and can effectively select the corresponding important feature subset for the dataset.

Key words: Graph Neural Network (GNN), Graph ATtention network (GAT), feature selection, node classification, deep learning



关键词: 图神经网络, 图注意力网络, 特征选择, 节点分类, 深度学习

CLC Number: