Dynamic adversarial training method based on  sample robustness disparity

doi:10.11772/j.issn.1001-9081.2026010036

Journal of Computer Applications

Received:2026-01-19 Revised:2026-04-05 Online:2026-06-12 Published:2026-06-12
Supported by:
National Natural Science Foundation of China (NSFC)

基于样本鲁棒差异的动态对抗训练方法

王练¹,张豪杰²,梅天风¹

1. 重庆邮电大学计算机学院
2. 重庆邮电大学

通讯作者: 张豪杰
基金资助:
国家自然科学基金

Abstract

Abstract: To address issue that uniform perturbation setting in traditional adversarial training ignores robustness differences among samples, causing imbalancing natural accuracy and robust accuracy of models, this paper proposes a dynamic adversarial training method based on sample robustness disparity to achieve their collaborative optimization, thus providing technical support for defense of deep learning models in Artificial Intelligence security domain. First, proposed method quantifies sample robustness via confidence gap between top-1 and top-2 predicted categories, and customizes perturbations only for correctly classified clean samples. Second, it combines confidence gap with skewness information of multi-category confidence, and constructs a differential perturbation generation mechanism combined with an early stopping strategy. Finally, it adopts Kullback-Leibler divergence to measure sample distribution difference, and designs a dynamically weighted loss function to prioritize learning of samples with vulnerable robustness, realizing sample-level refined training through a dual-module framework. Experimental comparisons with seven mainstream methods on three benchmark datasets (CIFAR-10, CIFAR-100 and Tiny ImageNet) show that proposed method achieves significant improvements in both clean sample accuracy and adversarial robust accuracy, with higher training efficiency and good adaptability to different perturbation budgets. Ablation experiments validate efficacy of each module. Experimental results demonstrate that proposed method effectively overcomes constraints of traditional methods, realizes collaborative optimization of natural accuracy and robust accuracy, and features good strategy universality, which provides a feasible idea for refined design of adversarial training and its application in security-critical fields.

Key words: Deep Neural Network (DNN), adversarial robustness, adversarial training, adversarial examples, adaptive perturbation

摘要： 为解决传统对抗训练统一扰动设定忽视样本鲁棒差异，导致模型自然精度与鲁棒精度难以兼顾的问题，本文提出基于样本鲁棒差异的动态对抗训练方法，以实现二者协同优化，为人工智能安全领域深度学习模型防御提供技术支撑。该方法先以top-1与top-2类别置信度差量化样本鲁棒性，仅对正确分类的干净样本定制扰动；再融合置信度差与多类别置信度偏态信息，结合早停机制构建差异化扰动生成机制；最后引入Kullback-Leibler散度量化样本分布差异，设计动态加权损失函数，优先强化鲁棒性脆弱样本学习，通过双模块实现样本级精细化训练。在CIFAR-10、CIFAR-100、Tiny ImageNet三个基准数据集上对比7种主流方法，所提方法的干净样本精度与对抗鲁棒精度均显著提升，训练效率更高且扰动预算适应性好，消融实验验证了模块有效性。实验结果表明，该方法有效突破传统方法局限，实现自然精度与鲁棒精度协同优化，策略普适性良好，为对抗训练精细化设计及安全关键领域应用提供了可行思路。

关键词: 深度神经网络, 对抗鲁棒性, 对抗训练, 对抗样本, 自适应扰动

王练张豪杰梅天风. 基于样本鲁棒差异的动态对抗训练方法[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2026010036.

[1]	Xiaobo QI, Jing ZHANG, Ying SHI, Hui QI, Hangyuan DU. Multiple active learning method based on concept drift detection [J]. Journal of Computer Applications, 2026, 46(5): 1388-1396.
[2]	Chi ZHANG, Xianjing MENG, Changhao DOU, Qian WANG, Leilei GENG, Xiaoming XI. MD-FVR： cascaded finger vein recognition network based on multi-domain feature fusion [J]. Journal of Computer Applications, 2026, 46(5): 1658-1666.
[3]	Qiaoling QI, Xiaoxiao WANG, Qianqian ZHANG, Peng WANG, Yongfeng DONG. Label noise adaptive learning algorithm based on meta-learning [J]. Journal of Computer Applications, 2025, 45(7): 2113-2122.
[4]	Huibin WANG, Zhan’ao HU, Jie HU, Yuanwei XU, Bo WEN. Time series forecasting model based on segmented attention mechanism [J]. Journal of Computer Applications, 2025, 45(7): 2262-2268.
[5]	Erkang XIANG, Rong HUANG, Aihua DONG. Open set recognition method with open generation and feature optimization [J]. Journal of Computer Applications, 2025, 45(7): 2195-2202.
[6]	Xueying LI, Kun YANG, Guoqing TU, Shubo LIU. Adversarial sample generation method for time-series data based on local augmentation [J]. Journal of Computer Applications, 2025, 45(5): 1573-1581.
[7]	Lu CHEN, Huaiyao WANG, Jingyang LIU, Tao YAN, Bin CHEN. Robotic grasp detection with feature fusion of spatial-Fourier domain information under low-light environments [J]. Journal of Computer Applications, 2025, 45(5): 1686-1693.
[8]	Huahua WANG, Zijian FAN, Ze LIU. Image adversarial example generation method based on multi-space probability enhancement [J]. Journal of Computer Applications, 2025, 45(3): 883-890.
[9]	Yu WANG, Xianjin FANG, Gaoming YANG, Yifeng DING, Xinlu YANG. Active defense against face forgery based on attention mask and feature extraction [J]. Journal of Computer Applications, 2025, 45(3): 904-910.
[10]	Sheng YANG, Yan LI. Contrastive knowledge distillation method for object detection [J]. Journal of Computer Applications, 2025, 45(2): 354-361.
[11]	Benchen YANG, Haoran LI, Haibo JIN. Multi-focus image fusion network with cascade fusion and enhanced reconstruction [J]. Journal of Computer Applications, 2025, 45(2): 594-600.
[12]	Tianqi ZHANG, Shuang TAN, Xiwen SHEN, Juan TANG. Image watermarking method combining attention mechanism and multi-scale feature [J]. Journal of Computer Applications, 2025, 45(2): 616-623.
[13]	Xintao DUAN, Mengru BAO, Yinhang WU, Chuan QIN. Active protection method for deep neural network model based on four-dimensional Chen chaotic system [J]. Journal of Computer Applications, 2025, 45(11): 3621-3631.
[14]	Yongping WANG, Yao LIU, Xiaolin ZHANG, Jingyu WANG, Lixin LIU. Multimodal adversarial example generation method for Chinese text classification [J]. Journal of Computer Applications, 2025, 45(10): 3074-3082.
[15]	Rui SHI, Yong LI, Yanhan ZHU. Adversarial sample attack algorithm of modulation signal based on equalization of feature gradient [J]. Journal of Computer Applications, 2024, 44(8): 2521-2527.

Dynamic adversarial training method based on sample robustness disparity

基于样本鲁棒差异的动态对抗训练方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics