基于数据增强和标签噪声的快速对抗训练方法

doi:10.11772/j.issn.1001-9081.2023121835

《计算机应用》唯一官方网站

• • 下一篇

基于数据增强和标签噪声的快速对抗训练方法

宋逸飞,柳毅

广东工业大学

收稿日期:2024-01-02 修回日期:2024-03-14 发布日期:2024-03-28 出版日期:2024-03-28
通讯作者: 宋逸飞
基金资助:
广东省重点领域研发计划项目

Fast adversarial training method based on data augmentation and label noise

Received:2024-01-02 Revised:2024-03-14 Online:2024-03-28 Published:2024-03-28

摘要/Abstract

摘要： 摘要: 对抗训练是保护分类模型免受对抗性攻击的有效防御方法。然而，由于在训练过程中生成强对抗样本的高成本，可能需要数量级的额外训练时间。为了克服这一限制，基于单步攻击的快速对抗训练已被探索。以往的工作从样本初始化、损失正则化和训练策略等不同角度对快速对抗训练进行了改进。然而，在处理大扰动预算时遇到了灾难性过拟合。基于数据增强与标签噪声的快速对抗训练方法被提出，以解决此困难。初始阶段，对原始样本执行多种图像转换，并引入随机噪声以实施数据增强；接着，少量标签噪声被注入；然后使用增强的数据生成对抗样本用于模型训练；最后，根据对抗鲁棒性测试结果自适应地调整标签噪声率。在CIFAR-10、CIFAR-100数据集上的全面实验结果表明，相较于FGSM-MEP，所提方法在大扰动预算条件下，在两个数据集上的AA上分别提升了4.63和5.38个百分点。经实验证明，新提出的方案可以有效地处理大的扰动预算下灾难性过拟合问题，并显著增强模型的对抗鲁棒性。

关键词: 关键词: 深度学习, 对抗样本, 对抗防御, 数据增强, 标签噪声

Abstract: Abstract: Adversarial training has been an effective defense mechanism for protecting classification models against adversarial attacks. However, the generation of strong adversarial samples during the training process incurred a high computational cost, potentially requiring significantly more training time. To overcome this limitation, fast adversarial training based on single-step attacks was explored. Previous work improved fast adversarial training from different perspectives, such as sample initialization, loss regularization, and training strategies. However, catastrophic overfitting was encountered when dealing with large perturbation budgets. A fast adversarial training method based on data augmentation with label noise is proposed to solve this difficulty. Initially, multiple image transformations are performed on the original samples and random noise is introduced to implement data enhancement; next, a small amount of label noise is injected; then the enhanced data was used to generate adversarial samples for model training; and finally, the label noise rate was adaptively adjusted according to the results of the adversarial robustness test. The comprehensive experimental results on the CIFAR-10 and CIFAR-100 datasets show that compared to FGSM-MEP(Fast Gradient Sign Method with prior from the Momentum of all Previous Epoch), the proposed method improves 4.63 and 5.38 percentage points on the AA(AutoAttack) on the two datasets under the condition of large perturbation budget, respectively. It is experimentally demonstrated that the newly proposed scheme can effectively handle the catastrophic overfitting problem under large perturbation budgets and significantly enhance the adversarial robustness of the model.

Key words: Keywords: deep learning, adversarial example, adversarial defense, data augmentation, label noise

中图分类号:

TP181

宋逸飞柳毅. 基于数据增强和标签噪声的快速对抗训练方法[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2023121835.

[1]	张瑜, 昌燕, 张仕斌. 基于量子局部内在维度的对抗样本检测算法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 490-495.
[2]	郭安迪, 贾真, 李天瑞. 基于伪实体数据增强的高精准率医学领域实体关系抽取[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 393-402.
[3]	陈彤, 位纪伟, 何仕远, 宋井宽, 杨阳. 基于自适应攻击强度的对抗训练方法[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 94-100.
[4]	蔡引江, 许光俊, 马喜波. 图结构表示下的药物数据增强方法[J]. 《计算机应用》唯一官方网站, 2023, 43(4): 1136-1141.
[5]	伏博毅, 彭云聪, 蓝鑫, 秦小林. 基于深度学习的标签噪声学习算法综述[J]. 《计算机应用》唯一官方网站, 2023, 43(3): 674-684.
[6]	林呈宇, 王雷, 薛聪. 标签语义增强的弱监督文本分类模型[J]. 《计算机应用》唯一官方网站, 2023, 43(2): 335-342.
[7]	张济慈, 范纯龙, 李彩龙, 郑学东. 基于几何关系的跨模型通用扰动生成方法[J]. 《计算机应用》唯一官方网站, 2023, 43(11): 3428-3435.
[8]	李宇航, 杨玉丽, 马垚, 于丹, 陈永乐. 基于BERT模型的文本对抗样本生成方法[J]. 《计算机应用》唯一官方网站, 2023, 43(10): 3093-3098.
[9]	魏佳璇, 杜世康, 于志轩, 张瑞生. 图像分类中的白盒对抗攻击技术综述[J]. 《计算机应用》唯一官方网站, 2022, 42(9): 2732-2741.
[10]	杨博, 张恒巍, 李哲铭, 徐开勇. 基于图像翻转变换的对抗样本生成方法[J]. 《计算机应用》唯一官方网站, 2022, 42(8): 2319-2325.
[11]	孙邱杰, 梁景贵, 李思. 基于BART噪声器的中文语法纠错模型[J]. 《计算机应用》唯一官方网站, 2022, 42(3): 860-866.
[12]	曹一珉, 蔡磊, 高敬阳. 基于生成对抗网络的基因数据生成方法[J]. 《计算机应用》唯一官方网站, 2022, 42(3): 783-790.
[13]	彭禹, 宋耀莲, 杨俊. 基于数据增强的运动想象脑电分类[J]. 《计算机应用》唯一官方网站, 2022, 42(11): 3625-3632.
[14]	罗萍, 丁玲, 杨雪, 向阳. 基于数据增强和弱监督对抗训练的中文事件检测[J]. 《计算机应用》唯一官方网站, 2022, 42(10): 2990-2995.
[15]	邓爽, 何小海, 卿粼波, 陈洪刚, 滕奇志. 基于改进VGG网络的弱监督细粒度阿尔兹海默症分类方法[J]. 《计算机应用》唯一官方网站, 2022, 42(1): 302-309.

基于数据增强和标签噪声的快速对抗训练方法

Fast adversarial training method based on data augmentation and label noise

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics