Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (2): 467-474.DOI: 10.11772/j.issn.1001-9081.2025050577

• Cyber security • Previous Articles    

GAB3D-SEVSN: enhanced video steganography model via invertible neural network

Qianhui XU1, Ke NIU1,2(), Shunzhe ZHU1, Lin SHI1,2, Jun LI1,2   

  1. 1.College of Cryptography Engineering,Engineering University of PAP,Xi’an Shaanxi 710086,China
    2.Key Laboratory of Network and Information Security,Engineering University of PAP,Xi’an Shaanxi 710086,China
  • Received:2025-05-28 Revised:2025-08-12 Accepted:2025-08-20 Online:2025-08-22 Published:2026-02-10
  • Contact: Ke NIU
  • About author:XU Qianhui, born in 2000, M. S. candidate. Her research interests include information hiding, artificial intelligence.
    NIU Ke, born in 1981, Ph. D., professor. His research interests include multimedia security, information hiding. Email:niuke@163.com
    ZHU Shunzhe, born in 2002, M. S. candidate. His research interests include information hiding, artificial intelligence.
    SHI Lin, born in 1987, M. S., lecturer. His research interests include information security, information management systems.
    LI Jun, born in 1987, Ph. D., lecturer. His research interests include multimedia information hiding.
  • Supported by:
    National Natural Science Foundation of China(62272478);Basic Cutting-edge Innovation Project of Engineering University of PAP(WJY202314)

增强型可逆神经网络视频隐写网络GAB3D-SEVSN

徐千惠1, 钮可1,2(), 朱顺哲1, 石林1,2, 李军1,2   

  1. 1.中国人民武装警察部队工程大学 密码工程学院,西安 710086
    2.中国人民武装警察部队工程大学 武警部队信息安全重点实验室,西安 710086
  • 通讯作者: 钮可
  • 作者简介:徐千惠(2000—),女,山东烟台人,硕士研究生,主要研究方向:信息隐藏、人工智能
    钮可(1981—),男,浙江湖州人,教授,博士,主要研究方向:多媒体安全、信息隐藏 Email:niuke@163.com
    朱顺哲(2002—),男,湖北武汉人,硕士研究生,主要研究方向:信息隐藏、人工智能
    石林(1987—),男,江西九江人,讲师,硕士,主要研究方向:信息安全、信息管理系统
    李军(1987—),男,湖南娄底人,讲师,博士,主要研究方向:多媒体信息隐藏。
  • 基金资助:
    国家自然科学基金资助项目(62272478);武警工程大学基础前沿创新项目(WJY202314)

Abstract:

To address the issues of insufficient long-range motion modeling and over-parameterization caused by channel redundancy in video steganography tasks under small-sample conditions, an enhanced video steganographic network — GAB3D-SEVSN was proposed by integrating 3D Global Attention Block (GAB-3D) and Squeeze-and-Excitation (SE) channel attention. In the model, through the optimized GAB-3D module, key motion trajectories in the 3D spatio-temporal domain were focused on adaptively, thereby enhancing the capability of modeling long-range dependencies. Meanwhile, by embedding the SE module into the reversible architecture, the channel-level adaptive calibration was achieved, which suppressed redundant parameters and alleviated over-parameterization effectively. Experimental results on the UCF101 dataset (13K video samples) demonstrate that, compared to the LF-VSN baseline model, the proposed model achieves improvements of 0.5 dB in Peak Signal-to-Noise Ratio (PSNR) and 2.06% in Structural SIMilarity (SSIM). Ablation experimental results verify the effectiveness and synergistic effect of various modules. Test results on high-dynamic scene subsets and videos with different attributes show that the model outperforms baseline models in PSNR and SSIM significantly, demonstrating excellent robustness and generalization ability.

Key words: video steganography, Invertible Neural Network (INN), Squeeze-and-Excitation (SE) mechanism, 3D global attention mechanism, channel attention mechanism

摘要:

针对小样本条件下视频隐写任务中存在的长程运动建模不足和通道冗余导致的过参数化问题,提出一种融合三维全局注意力块(GAB-3D)与压缩激励(SE)通道注意力的增强型视频隐写网络GAB3D-SEVSN。该模型通过优化的GAB-3D模块在三维时空域自适应地聚焦关键运动轨迹,从而增强长程依赖的建模能力;同时,通过在可逆架构中嵌入SE模块实现通道级自适应校准,从而有效抑制冗余参数并缓解过参数化现象。在UCF101数据集(13K视频样本)上的实验结果表明,相较于LF-VSN基线模型,所提模型的峰值信噪比(PSNR)和结构相似度(SSIM)分别提升了0.5 dB和2.06%。消融实验结果验证了各模块的有效性和协同效应。而在高动态场景子集和不同属性视频上的测试结果表明,该模型在PSNR和SSIM上均显著优于基线模型,展现出优异的鲁棒性和泛化能力。

关键词: 视频隐写, 可逆神经网络, 压缩激励机制, 三维全局注意力机制, 通道注意力机制

CLC Number: