《计算机应用》唯一官方网站

• •    下一篇

面向GNN模型提取攻击的图数据生成方法

杨莹,郝晓燕,于丹,马垚,陈永乐   

  1. 太原理工大学
  • 收稿日期:2023-08-17 修回日期:2023-11-01 发布日期:2023-12-18 出版日期:2023-12-18
  • 通讯作者: 杨莹
  • 基金资助:
    山西省基础研究计划资助项目;山西省基础研究计划资助项目;山西省自然科学基金面上项目

Graph data generation approach for GNN model extraction attacks

  • Received:2023-08-17 Revised:2023-11-01 Online:2023-12-18 Published:2023-12-18

摘要: 摘 要: 无数据模型提取攻击是基于攻击者对进行攻击时所需的训练数据信息一无所知的情况下提出的一类机器学习安全问题。针对无数据模型提取攻击在图神经网络领域的研究空缺,提出了分别用图神经网络可解释性方法GNNExplainer和图数据增强方法GAUG-M优化图节点特征信息和边信息来生成所需图数据,实现最终的图神经网络模型提取。首先,利用GNNExplainer方法对目标模型的响应结果进行可解释性分析得到重要的图节点特征信息;然后通过对重要的图节点特征进行加权,对非重要图节点特征进行降权,实现对图节点特征信息整体优化;其次,使用图形自动编码器作为边信息预测模块,该模块根据优化后的图节点特征得到节点与节点之间的连接概率;最后根据概率增加或者删减相应边来优化边信息。实验采用五种图数据集训练的三种图神经网络模型架构作为目标模型进行提取攻击,得到的替代模型达到了73%-87%的节点分类任务准确性和76%-89.2%的与目标模型性能的一致性,验证了方法的有效性。

关键词: 无数据模型提取攻击, 图数据生成, 图神经网络, 图神经网络可解释性, 图数据增强

Abstract: Abstract: Data-free model extraction attacks are a class of machine learning security problems based on the fact that the attacker has no knowledge of the training data information required to carry out the attack. Aiming at the research gap of data-free model extraction attacks in the field of graph neural networks, optimizing the graph node feature information and edge information with the graph neural network interpretability method GNNExplainer and graph data enhancement method GAUG-M, respectively was proposed, so as to generate the required graph data and achieve the final graph neural network model extraction. Firstly, the GNNExplainer method was used to obtain the important graph node feature information from the interpretable analysis of the response results of the target model.Secondly, the overall optimization of the graph node feature information was achieved by weighting the important graph node features and downweighting the non-important graph node features.Then, the graph autoencoder was used as the edge information prediction module, which obtained the connection probability information between nodes and nodes according to the optimized graph node features.Finally, the edge information was optimized by adding or deleting the corresponding edges according to the probability. The experiments use three graph neural network model architectures trained on five graph datasets as the target model for extraction attack, and the obtained alternative models achieve 73%-87% accuracy in node classification task and 76%-89.2% consistency with the target model performance, which verifies the effectiveness of the method.

Key words: data-free model extraction attack, graph data generation, graph neural networks, graphical neural network interpretability, graph data enhancement

中图分类号: