计算机应用 ›› 2020, Vol. 40 ›› Issue (5): 1348-1353.DOI: 10.11772/j.issn.1001-9081.2019081392

• 网络空间安全 • 上一篇    下一篇

聚焦图像对抗攻击算法PS-MIFGSM

吴立人1, 刘政浩1, 张浩1, 岑悦亮2, 周维1   

  1. 1.云南大学 软件学院,昆明 650091
    2.昆明理工大学 信息工程与自动化学院,昆明 650500
  • 收稿日期:2019-08-13 修回日期:2019-11-04 出版日期:2020-05-10 发布日期:2020-05-15
  • 通讯作者: 周维(1974—)
  • 作者简介:吴立人(1994—),男,陕西咸阳人,硕士研究生,主要研究方向:深度学习; 刘政浩(1996—),男,湖北黄冈人,硕士研究生,主要研究方向:深度学习; 张浩(1992—),男,云南玉溪人,硕士研究生,主要研究方向:深度学习; 岑悦亮(1996—),女,广西梧州人,硕士研究生,主要研究方向:深度学习; 周维(1974—),男,云南昆明人,教授,博士,主要研究方向:分布式处理、生物信息学。
  • 基金资助:

    国家自然科学基金资助项目(61762089)。

PS-MIFGSM: focus image adversarial attack algorithm

WU Liren1, LIU Zhenghao1, ZHANG Hao1, CEN Yueliang2, ZHOU Wei1   

  1. 1.School of Software, Yunnan University, KunmingYunnan 650091, China
    2.Faculty of Information Engineering and Automation, Kunming University of Science and Technology, KunmingYunnan 650500, China
  • Received:2019-08-13 Revised:2019-11-04 Online:2020-05-10 Published:2020-05-15
  • Contact: ZHOU Wei, born in 1974, Ph. D., professor. His research interests include distributed processing, bioinformatics.
  • About author:WU Liren, born in 1994, M. S. candidate. His research interests include deep learning.LIU Zhenghao, born in 1996, M. S. candidate. His research interests include deep learning.ZHANG Hao, born in 1992, M. S. candidate. His research interests include deep learning.CEN Yueliang, born in 1999, M. S. candidate. Her research interests include deep learning.ZHOU Wei, born in 1974, Ph. D., professor. His research interests include distributed processing, bioinformatics.
  • Supported by:

    This work is partially supported by the National Natural Science Foundation of China (61762089).

摘要:

针对目前主流对抗攻击算法通过扰动全局图像特征导致攻击隐蔽性降低的问题,提出一种聚焦图像的无目标攻击算法——PS-MIFGSM。首先,通过Grad-CAM算法捕获卷积神经网络(CNN)在分类任务中对图像的重点关注区域;然后,使用MI-FGSM攻击分类网络,生成对抗扰动,并且将扰动作用于图像的重点关注区域,而图像的非关注区域保持不变,从而生成新的对抗样本。在实验部分,以三种图像分类模型Inception_v1、Resnet_v1和Vgg_16为基础,对比了PS-MIFGSM和MI-FGSM两种方法分别进行单模型攻击和集合模型攻击的效果。实验结果表明,PS-MIFGSM能够在攻击成功率不变的同时,有效降低对抗样本与真实样本的差异大小。

关键词: 无目标攻击, 卷积神经网络, 图像分类, 对抗样本, 集合模型

Abstract:

Aiming at the problem of the present mainstream adversarial attack algorithm that the attack invisibility is reduced by disturbing the global image features, an untargeted attack algorithm named PS-MIFGSM (Perceptual-Sensitive Momentum Iterative Fast Gradient Sign Method) was proposed. Firstly, the areas of the image focused by Convolutional Neural Network (CNN) in the classification task were captured by using Grad-CAM algorithm. Then, MI-FGSM (Momentum Iterative Fast Gradient Sign Method) was used to attack the classification network to generate the adversarial disturbance, and the disturbance was applied to the focus areas of the image with the non-focus areas of the image unchanged, thereby, a new adversarial sample was generated. In the experiment, based on three image classification models Inception_v1, Resnet_v1 and Vgg_16, the effects of PS-MIFGSM and MI-FGSM on single model attack and set model attack were compared. The results show that PS-MIFGSM can effectively reduce the difference between the real sample and the adversarial sample with the attack success rate unchanged.

Key words: untargeted attack, Convolutional Neural Network (CNN), image classification, adversarial sample, set model

中图分类号: