Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (1): 124-134.DOI: 10.11772/j.issn.1001-9081.2024121776

• Cyber security • Previous Articles     Next Articles

Clean-label multi-backdoor attack method based on feature regulation and color separation

Yingchun TANG1, Rong HUANG1,2(), Shubo ZHOU1,2, Xueqin JIANG1,2   

  1. 1.College of Information Science and Technology,Donghua University,Shanghai 201620,China
    2.Engineering Research Center of Digitized Textile and Apparel Technology,Ministry of Education (Donghua University),Shanghai 201620,China
  • Received:2024-12-17 Revised:2025-03-31 Accepted:2025-04-07 Online:2026-01-10 Published:2026-01-10
  • Contact: Rong HUANG
  • About author:TANG Yingchun, born in 2000, M. S. candidate. His research interests include backdoor attack.
    ZHOU Shubo, born in 1988, Ph. D., lecturer. His research interests include computer vision, computational imaging.
    JIANG Xueqin, born in 1981, Ph. D., professor. His research interests include wireless communication, quantum communication, machine learning.
  • Supported by:
    National Natural Science Foundation of China(62001099);Fundamental Research Funds for the Central Universities(2232023D-30)

基于特征调控与颜色分离的净标签多后门攻击方法

唐迎春1, 黄荣1,2(), 周树波1,2, 蒋学芹1,2   

  1. 1.东华大学 信息科学与技术学院,上海 201620
    2.数字化纺织服装技术教育部工程研究中心(东华大学),上海 201620
  • 通讯作者: 黄荣
  • 作者简介:唐迎春(2000—),男,安徽合肥人,硕士研究生,主要研究方向:后门攻击
    周树波(1988—),男,浙江绍兴人,讲师,博士,主要研究方向:计算机视觉、计算成像
    蒋学芹(1981—),男,江苏苏州人,教授,博士,主要研究方向:无线通信、量子通信、机器学习。
  • 基金资助:
    国家自然科学基金资助项目(62001099);中央高校基本科研业务费专项资金资助项目(2232023D-30)

Abstract:

To solve the problem of lack of stealth and flexibility in traditional backdoor attacks, a clean-label multi-backdoor attack method based on feature regulation and color separation was proposed to train poisoned network to embed triggers based on the information hiding framework. Firstly, image edge was used as trigger, a feature regulation strategy was designed and adversarial perturbation and a surrogate model were combined to assist poisoned network training, and enhance the significance of trigger features. Secondly, by proposing a color separation strategy to color the trigger, the trigger was given distinguishable RGB space colors and set one-hot target confidence corresponding to the color to guide training, thereby ensuring the distinguishability of trigger features. In order to verify the effectiveness of the proposed method, experiments were conducted on 3 datasets (CIFAR-10, ImageNet-10 and GTSRB) and 5 models. The results show that in the single-backdoor scenario, the proposed method achieves the Attack Success Rate (ASR) over 98% on all three datasets, outperforming the second-best method by 7.94, 1.70, and 8.61 percentage points, respectively; in the multi-backdoor scenario, the proposed method achieves the ASR over 90% on the ImageNet-10 dataset, outperforming the second-best method by 36.63 percentage points averagely. The results of ablation experiments verify the rationality of the feature regulation and color separation strategies as well as the contribution of adversarial perturbation and surrogate model. The results of the multi-backdoor experiment demonstrate the flexibility of the proposed attack method.

Key words: backdoor attack, clean-label, feature regulation, color separation, surrogate model

摘要:

针对传统的后门攻击缺乏隐蔽性与灵活性的问题,提出一种基于特征调控与颜色分离的净标签多后门攻击方法,以信息隐藏框架为基础,训练中毒网络嵌入触发器。首先,以图像边缘作为触发器,设计特征调控策略,结合对抗扰动与代理模型辅助训练中毒网络,增强触发器特征的显著性;其次,提出颜色分离策略对触发器进行着色,赋予触发器可区分的RGB空间颜色并设置与颜色相对应的one-hot目标置信度引导训练,从而保证触发器特征的可区分性。为了验证所提方法的有效性,分别在3个数据集(CIFAR-10、ImageNet-10和GTSRB)上以及5种模型上进行实验。结果表明,在单后门场景下,所提方法的攻击成功率(ASR)在3个数据集上均超过98%,分别超过次优方法7.94、1.70和8.61个百分点;在多后门场景下,所提方法在ImageNet-10数据集上的ASR达到90%以上,平均ASR超过次优方法36.63个百分点。而消融实验的结果也验证了特征调控与颜色分离策略的合理性及对抗扰动与代理模型的贡献,多后门实验的结果展示了所提攻击方法的灵活性。

关键词: 后门攻击, 净标签, 特征调控, 颜色分离, 代理模型

CLC Number: