• •    

动态梯度阈值裁剪的差分隐私生成对抗网络算法(NDBC2022+P00105)

陈少权1,蔡剑平1,孙岚2   

  1. 1. 福州大学计算机与大数据学院
    2. 福州大学数学与计算机科学学院
  • 收稿日期:2022-08-01 修回日期:2022-08-10 发布日期:2022-09-23
  • 通讯作者: 陈少权

Differential privacy generative adversarial network algorithm for dynamic gradient threshold clipping

  • Received:2022-08-01 Revised:2022-08-10 Online:2022-09-23

摘要: 摘要:现有的生成对抗网络(GAN)和差分隐私相结合的方法大多采用梯度扰动的方法实现隐私保护,即在优化过程中利用梯度裁剪约束优化器对单个数据的敏感性,并对裁剪后的梯度添加随机噪声达到保护模型的目的;但大多方法在训练时裁剪阈值固定,阈值过大或过小均会影响模型的性能。针对该问题,提出动态梯度阈值裁剪的DGC_DPGAN算法以兼顾隐私保护和模型的性能。该算法结合预训练技术,在优化过程中先求取每批次隐私数据的梯度F-范数均值作为动态梯度裁剪阈值,再对梯度进行扰动。考虑不同的裁剪顺序,提出先裁剪再加噪的CLIP_DGC_DPGAN算法和先加噪再裁剪的DGC_DPGAN算法,并采用Rényi Accountant求取隐私损失。实验结果表明,在相同的隐私预算下,所提出的两种动态梯度裁剪算法与固定梯度阈值裁剪方法相比,在Mnist数据集上,IS评分,SSIM指标,CNN分类准确率分别提升了0.32-3.92,0.03-0.27,0.06-0.28;在Fashion-Mnist数据集上,IS评分,SSIM指标,CNN分类准确率分别提升了0.4-4.32,0.01-0.44,0.11-0.23;同时GAN模型生成图像的可用性更好。

关键词: 生成对抗网络, 差分隐私, 动态梯度阈值裁剪, Rényi Accountant

Abstract: Abstract: Most of the existing methods that combine the generative adversarial network (GAN) and differential privacy used gradient perturbation to achieve privacy protection. In the process of optimization, gradient clipping technology was used to constrain the sensitivity of the optimizer to single data, and random noise was added to the clipped gradient to achieve the purpose of protecting the model. However, most methods took the clipping threshold as a fixed parameter during training. When the threshold was too large or too small, the performance of the model would be affected. To solve this problem, DGC_DPGAN with dynamic gradient threshold clipping was proposed to consider privacy protection and model performance. Combined with the pre-training technology, and in the process of optimization, the gradient F-norm average of each batch of privacy data was obtained as the dynamic gradient clipping threshold, and then the gradient was perturbed. Considering different clipping orders, CLIP_DGC_DPGAN and DGC_DPGAN were proposed, and used Rényi Accountant to calculate the privacy loss. The experimental results show that under the same privacy budget, the two dynamic gradient clipping algorithms are better than the fixed gradient threshold clipping method. On the Mnist dataset, the IS, SSIM, and CNN classification accuracy are improved by 0.32-3.92, 0. 03-0.27, and 0.06-0.28 respectively; On the Fashion-Mnist dataset, the IS, SSIM, and CNN classification accuracy are improved by 0.4-4.32, 0.01-0.44 and 0.11-0.23 respectively; At the same time, the availability of the images generated by GAN model is better.

Key words: Keywords: generative adversarial network, differential privacy, dynamic gradient clipping, Rényi Accountant

中图分类号: