Image tampering forensics network based on residual feedback and self-attention

doi:10.11772/j.issn.1001-9081.2022081283

Abstract

Abstract:

The existing multi-tampering type image forgery detection algorithms using noise features often can not effectively detect the feature difference between tampered areas and non-tampered areas， especially for copy-move tampering type. To this end， a dual-stream image tampering forensics network fusing residual feedback and self-attention mechanism was proposed to detect tampering artifacts such as unnatural edges of RGB pixels and local noise inconsistence respectively through two streams. Firstly， in the encoder stage， multiple dual residual units integrating residual feedback were used to extract relevant tampering features to obtain coarse feature maps. Secondly， further feature reinforcement was performed on the coarse feature maps by the improved self-attention mechanism. Thirdly， the mutual corresponding shallow features of encoder and deep features of decoder were fused. Finally， the final features of tempering extracted by the two streams were fused in series， and then the pixel-level localization of the tampered area was realized through a special convolution operation. Experimental results show that the F1 score and Area Under Curve （AUC） value of the proposed network on COVERAGE dataset are better than those of the comparison networks. The F1 score of the proposed network is 9.8 and 7.7 percentage points higher than that of TED-Net （Two-stream Encoder-Decoder Network） on NIST16 and Columbia datasets， and the AUC increases by 1.1 and 6.5 percentage points， respectively. The proposed network achieves good results in copy-move tampering type detection， and is also suitable for other tampering type detection. At the same time， the proposed network can locate the tampered area at pixel level accurately， and its detection performance is superior to the comparison networks.

Key words: image tampering, encoder-decoder, feature reinforcement, residual feedback, self-attention mechanism, noise feature

摘要：

现存的使用噪声特征的多篡改类型图像伪造检测算法，往往不能有效地检测篡改区域和非篡改区域之间的特征差异，特别是对复制-粘贴篡改类型。为此，提出一种融合残差反馈和自注意力机制的双流编-解码器图像篡改取证网络，通过两个流分别检测RGB像素的非自然边缘等篡改伪影和局部噪声不一致性。首先，在编码器阶段使用多个融合残差反馈的双重残差单元提取相关篡改特征，以获得粗特征图；其次，通过改进后的自注意力机制对粗特征图进行进一步特征增强；随后，将互相对应的编码器浅层特征和解码器深层特征进行融合；最后，串联融合两个流最终提取到的篡改特征，再通过一个特殊卷积操作实现对篡改区域的像素级定位。实验结果表明，所提网络在COVERAGE数据集上的F1值和曲线下面积（AUC）优于对比网络。在NIST16、Columbia数据集上，所提网络的F1值相较于TED-Net（Two-stream Encoder-Decoder Network）分别提高了9.8和7.7个百分点，AUC分别提高了1.1和6.5个百分点。所提网络在复制-粘贴篡改类型检测上取得了良好的效果，并且也适用于其他篡改类型检测。同时，该网络能在像素级上对篡改区域准确定位，检测性能优于对比网络。

关键词: 图像篡改, 编-解码器, 特征增强, 残差反馈, 自注意力机制, 噪声特征

CLC Number:

TP309.2

Guolong YUAN, Yujin ZHANG, Yang LIU. Image tampering forensics network based on residual feedback and self-attention[J]. Journal of Computer Applications, 2023, 43(9): 2925-2931.

袁国龙, 张玉金, 刘洋. 基于残差反馈和自注意力的图像篡改取证网络[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2925-2931.

Figures/Tables 12

Fig. 1 Structure of ResNet

Fig. 2 Network framework of the proposed algorithm

Fig. 3 Processing flows of residual feedback and dual residual unit

Fig. 4 Self-attention mechanism designed in this paper

Tab. 1 Information of different datasets

数据集	训练集样本数	测试集样本数
NIST16	404	160
COVERAGE	75	25
CASIA	5 123	921
Columbia	135	45

Fig. 5 ROC curves of proposed algorithm on different datasets

Tab. 2 Comparison of F1 scores on different datasets

算法	NIST16	COVERAGE	CASIA v1.0	Columbia
ELA	23.6	22.2	21.4	47.0
NOI	28.5	26.9	26.3	57.4
CFA1	17.4	19.0	20.7	46.7
RGB-N	72.2	43.7	40.8	69.7
TED-Net	61.0	—	44.0	85.0
本文算法	70.8	62.5	48.5	92.7

Tab. 3 Comparison of AUC values on different datasets

算法	NIST16	COVERAGE	CASIA v1.0	Columbia
ELA	42.9	58.3	61.3	58.1
NOI	48.7	58.7	61.2	54.6
CFA1	50.1	48.5	52.2	72.0
ManTra-Net	79.5	81.9	81.7	82.4
RGB-N	93.7	81.7	79.5	85.8
TED-Net	96.0	—	83.0	87.0
本文算法	97.1	87.6	83.9	93.5

Tab. 4 Comparison of F1 scores and AUC values of different models on NIST16 and Columbia datasets

模型	NIST16		Columbia
模型	F1	AUC	F1	AUC
Base	59.7	94.1	85.1	86.6
Base-RF	61.3	94.0	86.2	87.1
Base-RF-RP	67.1	95.4	91.3	92.9
Base-RF-RP-Adap	70.5	96.9	92.5	93.5
本文算法	70.8	97.1	92.7	93.5

Fig. 6 Tampered area localization results of proposed algorithm on different datasets

Fig. 7 Comparison of localization effect of proposed algorithm and TED-Net algorithm on NIST16 dataset

Tab. 5 F1 scores of JPEG compression with different quality factors on NIST16 and Columbia datasets

压缩等级	NIST16	Columbia
100	0.708	0.927
90	0.603	0.795
70	0.571	0.742
50	0.568	0.735

References 23

1	KRAWETZ N. A picture’s worth digital image analysis and forensics［C/OL］ // Proceedings of the Black Hat Briefings USA 2007 ［2022-06-22］..
2	MAHDIAN B， SAIC S. Using noise inconsistencies for blind image forensics［J］. Image and Vision Computing， 2009， 27（10）： 1497-1503. 10.1016/j.imavis.2009.02.001
3	FERRARA P， BIANCHI T， DE ROSA A， et al. Image forgery localization via fine-grained analysis of CFA artifacts［J］. IEEE Transactions on Information Forensics and Security， 2012， 7（5）： 1566-1577. 10.1109/tifs.2012.2202227
4	BAYAR B， STAMM M C. A deep learning approach to universal image manipulation detection using a new convolutional layer［C］// Proceedings of the 4th ACM Workshop on Information Hiding and Multimedia Security. New York： ACM， 2016： 5-10. 10.1145/2909827.2930786
5	YANG Q W， PENG F， LI J T， et al. Image tamper detection based on noise estimation and lacunarity texture［J］. Multimedia Tools and Applications， 2016， 75（17）： 10201-10211. 10.1007/s11042-015-3079-2
6	BI X L， WEI Y， XIAO B， et al. RRU-Net： the ringed residual U-Net for image splicing forgery detection［C］// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. Piscataway： IEEE， 2019：30-39. 10.1109/cvprw.2019.00010
7	吴鹏，陈北京，郑雨鑫，等. 基于双流Faster R-CNN的像素级图像拼接篡改定位算法［J］. 电子测量与仪器学报， 2021， 35（4）：154-160.
	WU P， CHEN B J， ZHENG Y X， et al. Pixel-level image splicing localization algorithm based on dual-stream Faster R-CNN［J］. Journal of Electronic Measurement and Instrumentation， 2021， 35（4）： 154-160.
8	ZHONG J L， PUN C M. An end-to-end Dense-InceptionNet for image copy-move forgery detection［J］. IEEE Transactions on Information Forensics and Security， 2020 15： 2134-2146. 10.1109/tifs.2019.2957693
9	李应灿，杨建权，丁峰，等. 区分来源和目标区域的图像copy-move伪造检测方法［J］. 信号处理， 2020， 36（9）：1533-1543. 10.16798/j.issn.1003-0530.2020.09.019
	LI Y C， YANG J Q， DING F， et al. Copy-move detection method for distinguishing between source and target regions［J］. Journal of Signal Processing， 2020， 36（9）：1533-1543. 10.16798/j.issn.1003-0530.2020.09.019
10	ZHU X S， QIAN Y J， ZHAO X F， et al. A deep learning approach to patch-based image inpainting forensics［J］. Signal Processing： Image Communication， 2018， 67： 90-99. 10.1016/j.image.2018.05.015
11	WU Y， AbdALMAGEED W， NATARAJAN P. ManTra-Net： manipulation tracing network for detection and localization of image forgeries with anomalous features［C］// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2019： 9535-9544. 10.1109/cvpr.2019.00977
12	BIACH F Z EL， IALA I， LAANAYA H， et al. Encoder-decoder based convolutional neural networks for image forgery detection［J］. Multimedia Tools and Applications， 2022， 81（16）： 22611-22628. 10.1007/s11042-020-10158-3
13	ZHUO L， TAN S Q， LI B， et al. Self-Adversarial training incorporating forgery attention for image forgery localization［J］. IEEE Transactions on Information Forensics and Security， 2022， 17： 819-834. 10.1109/tifs.2022.3152362
14	BAPPY J H， SIMONS C， NATARAJ L， et al. Hybrid LSTM and encoder-decoder architecture for detection of image forgeries［J］. IEEE Transactions on Image Processing， 2019， 28（7）： 3286-3300. 10.1109/tip.2019.2895466
15	ZHOU P， HAN X T， MORARIU V I， et al. Learning rich features for image manipulation detection［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018： 1053-1061. 10.1109/cvpr.2018.00116
16	MAZUMDAR A， BORA P K. Two-stream encoder-decoder network for localizing image forgeries［J］. Journal of Visual Communication and Image Representation， 2022， 82： No.103417. 10.1016/j.jvcir.2021.103417
17	HE K M， ZHANG X Y， REN S Q， et al. Deep residual learning for image recognition［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2016： 770-778. 10.1109/cvpr.2016.90
18	FU J， LIU J， TIAN H J， et al. Dual attention network for scene segmentation［C］// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2019： 3141-3149. 10.1109/cvpr.2019.00326
19	ZHU Y， CHEN C F， YAN G， et al. AR-Net： adaptive attention and residual refinement network for copy-move forgery detection［J］. IEEE Transactions on Industrial Informatics， 2020， 16（10）： 6714-6723. 10.1109/TII.2020.2982705
20	nimble NIST 2016 datasets［DS/OL］. ［2022-06-20］..
21	WEN B H， ZHU Y， SUBRAMANIAN R， et al. COVERAGE - a novel database for copy-move forgery detection［C］// Proceedings of the 2016 IEEE International Conference on Image Processing. Piscataway： IEEE， 2016： 161-165. 10.1109/icip.2016.7532339
22	DONG J， WANG W， TAN T N. CASIA image tampering detection evaluation database［C］// Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing. Piscataway： IEEE， 2013： 422-426. 10.1109/chinasip.2013.6625374
23	NG T T， CHANG S F. A data set of authentic and spliced image blocks： DVENT Technical Report # 203-2004-3［R/OL］. （2004-06-08）［2022-06-23］..

[1]	Zhigang XU, Chuang ZHANG. Multi-level color restoration of mural image based on gated positional encoding [J]. Journal of Computer Applications, 2024, 44(9): 2931-2937.
[2]	Jing QIN, Zhiguang QIN, Fali LI, Yueheng PENG. Diagnosis of major depressive disorder based on probabilistic sparse self-attention neural network [J]. Journal of Computer Applications, 2024, 44(9): 2970-2974.
[3]	Liting LI, Bei HUA, Ruozhou HE, Kuang XU. Multivariate time series prediction model based on decoupled attention mechanism [J]. Journal of Computer Applications, 2024, 44(9): 2732-2738.
[4]	Zexin XU, Lei YANG, Kangshun LI. Shorter long-sequence time series forecasting model [J]. Journal of Computer Applications, 2024, 44(6): 1824-1831.
[5]	Yue LIU, Fang LIU, Aoyun WU, Qiuyue CHAI, Tianxiao WANG. 3D object detection network based on self-attention mechanism and graph convolution [J]. Journal of Computer Applications, 2024, 44(6): 1972-1977.
[6]	Rong HUANG, Junjie SONG, Shubo ZHOU, Hao LIU. Image aesthetic quality evaluation method based on self-supervised vision Transformer [J]. Journal of Computer Applications, 2024, 44(4): 1269-1276.
[7]	Weina DONG, Jia LIU, Xiaozhong PAN, Lifeng CHEN, Wenquan SUN. High-capacity robust image steganography scheme based on encoding-decoding network [J]. Journal of Computer Applications, 2024, 44(3): 772-779.
[8]	Xinran LUO, Tianrui LI, Zhen JIA. Chinese medical named entity recognition based on self-attention mechanism and lexicon enhancement [J]. Journal of Computer Applications, 2024, 44(2): 385-392.
[9]	Ziqi HUANG, Jianpeng HU. Entity category enhanced nested named entity recognition in automotive domain [J]. Journal of Computer Applications, 2024, 44(2): 377-384.
[10]	Liqing QIU, Xiaopan SU. Personalized multi-layer interest extraction click-through rate prediction model [J]. Journal of Computer Applications, 2024, 44(11): 3411-3418.
[11]	Yanbo LI, Qing HE, Shunyi LU. Aspect sentiment triplet extraction integrating semantic and syntactic information [J]. Journal of Computer Applications, 2024, 44(10): 3275-3280.
[12]	Xingyao YANG, Hongtao SHEN, Zulian ZHANG, Jiong YU, Jiaying CHEN, Dongxiao WANG. Sequential recommendation based on hierarchical filter and temporal convolution enhanced self-attention network [J]. Journal of Computer Applications, 2024, 44(10): 3090-3096.
[13]	Jia CHEN, Hong ZHANG. Image text retrieval method based on feature enhancement and semantic correlation matching [J]. Journal of Computer Applications, 2024, 44(1): 16-23.
[14]	Li’an CHEN, Yi GUO. Text sentiment analysis model based on individual bias information [J]. Journal of Computer Applications, 2024, 44(1): 145-151.
[15]	Hanxiao SHI, Leichun WANG. Short-term power load forecasting by graph convolutional network combining LSTM and self-attention mechanism [J]. Journal of Computer Applications, 2024, 44(1): 311-317.