《计算机应用》唯一官方网站

• •    下一篇

基于多尺度注意力的生成式信息隐藏算法

刘丽1,侯海金1,王安红2,张涛1   

  1. 1. 太原科技大学
    2. 山西省太原市太原科技大学
  • 收稿日期:2023-07-11 修回日期:2023-09-13 发布日期:2023-10-26 出版日期:2023-10-26
  • 通讯作者: 刘丽

Generative data hiding algorithm based on multi-scale attention

  • Received:2023-07-11 Revised:2023-09-13 Online:2023-10-26 Published:2023-10-26

摘要: 针对现有生成式信息隐藏算法嵌入容量低且提取的秘密图像视觉质量欠佳的问题,提出了基于多尺度注意力的生成式信息隐藏算法。首先,设计基于多尺度注意力的双编码-单解码生成器,载体图像与秘密图像的特征在编码端分两个支路独立提取,在解码端通过多尺度注意力模块进行融合,并利用跳跃连接为其提供不同尺度的细节特征,从而获得高质量的载密图像。其次,在U-Net结构的提取器中引入自注意力模块,以此弱化载体图像特征、增强秘密图像深层特征,并利用跳跃连接弥补秘密图像细节特征,提高秘密信息提取的准确率。同时,多尺度判决器与生成器的对抗训练,可以有效提升载密图像的视觉质量。实验结果表明,所提算法在嵌入容量为24 bpp的情况下,生成的载密图像峰值信噪比(PSNR)/结构相似性(SSIM)平均可达到40.93 dB/0.9883,且提取的秘密图像PSNR/SSIM平均可达到30.47 dB/0.9543。

Abstract: Aiming to the problems of low embedding capacity and poor visual quality of the extracted secret images in existing generative data hiding algorithms, a generative data hiding algorithm based on multi-scale attention was proposed. First, a generator based on multi-scale attention which had dual encoder-single decoder was designed. At its encoding end, the features of the cover image and secret image were extracted independently in two branches, and fused by a multi-scale attention module at its decoding end. Skip connections were used to provide different scale detail features, thereby ensuring high-quality of stego-images. Second, self-attention modules were introduced into the extractor of the U-Net structure to weaken the deep features of the cover image, and enhance the deep features of the secret image. The skip connections were used to compensate the detail features of the secret image, so as to improve the accuracy of the extracted secret data. At the same time, the adversarial training of the multi-scale discriminator and generator could effectively improve the image quality of the stego-image. Experimental results show that the proposed algorithm can achieve an average Peak Signal-to-Noise Ratio (PSNR)/Structure Similarity Index Measure (SSIM) of 40.93 dB/0.9883 for the generated stego-images, and an average PSNR/SSIM of 30.47 dB/0.9543 for the extracted secret images under the embedding capacity of 24 bpp.