Dual U-Former image deraining network based on non-separable lifting wavelet

doi:10.11772/j.issn.1001-9081.2022091422

Abstract

Abstract:

Aiming at the problem that the deraining methods based on tensor product wavelet cannot capture high-frequency rain streaks in all directions， a Dual U-Former Network （DUFN） based on non-separable lifting wavelet was proposed. Firstly， the isotropic non-separable lifting wavelet was used to capture high-frequency rain streaks in all directions. In this way， compared with tensor product wavelets such as Haar wavelet， which can only capture high-frequency rain streaks in three directions， DUFN was able to obtain more comprehensive rain streak information. Secondly， two U-Nets composed of Transformer Blocks （TBs） were connected in series at various scales， so that the semantic features of the shallow decoder were transferred to the deep stage， and the rain streaks were removed more thoroughly. At the same time， the scale-guide encoder was used to guide the coding stage by using the information of various scales in the shallow layer， and Gated Fusion Module （GFM） based on CBAM （Convolutional Block Attention Module） was used to make the fusion process put more focus on the rain area. Experimental results on Rain200H， Rain200L， Rain1200 and Rain12 synthetic datasets show that the Structure SIMilarity （SSIM） of DUFN is improved by 0.009 7 on average compared to that of the advanced method SPDNet （Structure-Preserving Deraining Network）. And on Rain200H， Rain200L and Rain12 synthetic datasets， the Peak Signal-to-Noise Ratio （PSNR） of DUFN is improved by 0.657 dB averagely. On real-world dataset SPA-Data， PSNR and SSIM of DUFN are improved by 0.976 dB and 0.003 1 respectively compared with those of the advanced method ECNetLL （Embedding Consistency Network+Layered Long short-term memory）. The above verifies that DUFN can improve the rain removal performance by enhancing the ability to capture high-frequency information.

Key words: image deraining, non-separable lifting wavelet, multi-scale, Transformer, scale-guide

摘要：

针对基于张量积小波的去雨方法无法捕获所有方向的高频雨纹的问题，提出基于不可分提升小波的双U-Former网络（DUFN）。首先，利用各向同性的不可分提升小波捕捉各个方向的高频雨纹，相较于哈尔小波等张量积小波只能捕捉3个方向的高频雨纹，DUFN能获得更全面的雨纹信息；其次，在各尺度上串联两个由Transformer Block（TB）构成的U-Net，将浅层解码器的语义特征传递到深层阶段，并更彻底地去除雨纹；同时，使用尺度引导编码器通过浅层各尺度信息引导编码阶段，并利用基于CBAM（Convolutional Block Attention Module）的门控融合模块（GFM）使融合过程更专注于有雨区域。实验结果表明，相较于先进方法SPDNet（Structure-Preserving Deraining Network），在Rain200H、Rain200L、Rain1200和Rain12这4个合成数据集上，DUFN的结构相似度（SSIM）平均提高了0.009 7，在Rain200H、Rain200L和Rain12这3个合成数据集上，DUFN的峰值信噪比（PSNR）平均提高了0.657 dB；在真实世界数据集SPA-Data上，相较于先进方法ECNetLL（Embedding Consistency Network+Layered Long short-term memory），DUFN的PSNR和SSIM分别提高了0.976 dB和0.003 1。验证了DUFN可以通过增强捕捉高频信息的能力提升去雨性能。

关键词: 图像去雨, 不可分提升小波, 多尺度, Transformer, 尺度引导

CLC Number:

TP391.4

Bin LIU, Siyan FANG. Dual U-Former image deraining network based on non-separable lifting wavelet[J]. Journal of Computer Applications, 2023, 43(10): 3251-3259.

刘斌, 方思严. 基于不可分提升小波的双U-Former图像去雨网络[J]. 《计算机应用》唯一官方网站, 2023, 43(10): 3251-3259.

Figures/Tables 18

References 37

1	LIU J， YANG W， YANG S， et al. Erase or fill？ deep joint recurrent rain removal and reconstruction in videos［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018： 3233-3242. 10.1109/cvpr.2018.00341
2	YANG W， LIU J， FENG J. Frame-consistent recurrent video deraining with dual-level flow［C］// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2019： 1661-1670. 10.1109/cvpr.2019.00176
3	KANG L W， LIN C W， FU Y H. Automatic single-image-based rain streaks removal via image decomposition［J］. IEEE Transactions on Image Processing， 2012， 21（4）： 1742-1755. 10.1109/tip.2011.2179057
4	LI Y， TAN R T， GUO X， et al. Rain streak removal using layer priors［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2016： 2736-2744. 10.1109/cvpr.2016.299
5	CHEN Y L， HSU C T. A generalized low-rank appearance model for spatio-temporally correlated rain streaks［C］// Proceedings of the 2013 IEEE International Conference on Computer Vision. Piscataway： IEEE， 2013： 1968-1975. 10.1109/iccv.2013.247
6	FU X， HUANG J， DING X， et al. Clearing the skies： a deep network architecture for single-image rain removal［J］. IEEE Transactions on Image Processing， 2017， 26（6）： 2944-2956. 10.1109/tip.2017.2691802
7	FU X， HUANG J， ZENG D， et al. Removing rain from single images via a deep detail network［C］// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2017： 1715-1723. 10.1109/cvpr.2017.186
8	LI Y， MONNO Y， OKUTOMI M. Single image deraining network with rain embedding consistency and layered LSTM［C］// Proceedings of the 2022 IEEE/CVF Winter Conference on Applications of Computer Vision. Piscataway： IEEE， 2022： 3957-3966. 10.1109/wacv51458.2022.00401
9	CHEN H， WANG Y， GUO T， et al. Pre-trained image processing transformer［C］// Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2021： 12294-12305. 10.1109/cvpr46437.2021.01212
10	LI X， WU J， LIN Z， et al. Recurrent squeeze-and-excitation context aggregation net for single image deraining［C］// Proceedings of the 2018 European Conference on Computer Vision， LNCS 11211. Cham： Springer， 2018： 262-277.
11	JIANG K， WANG Z， YI P， et al. Multi-scale progressive fusion network for single image deraining［C］// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2020： 8343-8352. 10.1109/cvpr42600.2020.00837
12	WANG C， XING X， WU Y， et al. DCSFN： deep cross-scale fusion network for single image rain removal［C］// Proceedings of the 28th ACM International Conference on Multimedia. New York： ACM， 2020： 1643-1651. 10.1145/3394171.3413820
13	WANG C， ZHU H， FAN W， et al. Single image rain removal using recurrent scale-guide networks［J］. Neurocomputing， 2022， 467： 242-255. 10.1016/j.neucom.2021.10.029
14	YI Q， LI J， DAI Q， et al. Structure-preserving deraining with residue channel prior guidance［C］// Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision. Piscataway： IEEE， 2021： 4218-4227. 10.1109/iccv48922.2021.00420
15	LIU B， LIU W. The lifting factorization of 2D 4-channel nonseparable wavelet transforms［J］. Information Sciences， 2018， 456： 113-130. 10.1016/j.ins.2018.05.012
16	RONNEBERGER O， FISCHER P， BROX T. U-net： convolutional networks for biomedical image segmentation［C］// Proceedings of the 2015 International Conference on Medical Image Computing and Computer-Assisted Intervention， LNCS 9351. Cham： Springer， 2015： 234-241.
17	PARK Y， JEON M， LEE J， et al. MCW-Net： single image deraining with multi-level connections and wide regional non-local blocks［J］. Signal Processing： Image Communication， 2022， 105： No.116701. 10.1016/j.image.2022.116701
18	LIN H， JING C， HUANG Y， et al. A²Net： adjacent aggregation networks for image raindrop removal［J］. IEEE Access， 2020， 8： 60769-60779. 10.1109/access.2020.2983087
19	JHA D， RIEGLER M A， JOHANSEN D， et al. DoubleU-Net： a deep convolutional neural network for medical image segmentation［C］// Proceedings of the IEEE 33rd International Symposium on Computer-Based Medical Systems. Piscataway： IEEE， 2020： 558-564. 10.1109/cbms49503.2020.00111
20	WANG Z， CUN X， BAO J， et al. Uformer： a general U-shaped Transformer for image restoration［C］// Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2022： 17662-17672. 10.1109/cvpr52688.2022.01716
21	ZAMIR S W， ARORA A， KHAN S， et al. Restormer： efficient Transformer for high-resolution image restoration［C］// Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2022： 5718-5729. 10.1109/cvpr52688.2022.00564
22	WOO S， PARK J， LEE J Y， et al. CBAM： convolutional block attention module［C］// Proceedings of the 2018 European Conference on Computer Vision， LNCS 11211. Cham： Springer， 2018： 3-19.
23	刘斌，彭嘉雄. 基于四通道不可分加性小波的多光谱图像融合［J］. 计算机学报， 2009， 32（2）： 350-356. 10.3724/sp.j.1016.2009.00350
	LIU B， PENG J X. Fusion method of multi-spectral image and panchromatic image based on four channels non-sperable additive wavelets［J］. Chinese Journal of Computers， 2009， 32（2）： 350-356. 10.3724/sp.j.1016.2009.00350
24	QIN X， WANG Z， BAI Y， et al. FFA-Net： feature fusion attention network for single image dehazing［C］// Proceedings of the 34th AAAI Conference on Artificial Intelligence. Palo Alto， CA： AAAI Press， 2020： 11908-11915. 10.1609/aaai.v34i07.6865
25	WANG Z， BOVIK A C， SHEIKH H R， et al. Image quality assessment： from error visibility to structural similarity［J］. IEEE Transactions on Image Processing， 2004， 13（4）： 600-612. 10.1109/tip.2003.819861
26	YANG W， TAN R T， FENG J， et al. Deep joint rain detection and removal from a single image［C］// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2017： 1685-1694. 10.1109/cvpr.2017.183
27	ZHANG H， PATEL V M. Density-aware single image de-raining using a multi-stream dense network［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018： 695-704. 10.1109/cvpr.2018.00079
28	WANG T， YANG X， XU K， et al. Spatial attentive single-image deraining with a high quality real rain dataset［C］// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2019： 12262-12271. 10.1109/cvpr.2019.01255
29	WEI W， MENG D， ZHAO Q， et al. Semi-supervised transfer learning for image rain removal［C］// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2019： 3872-3881. 10.1109/cvpr.2019.00400
30	HUYNH-THU Q， GHANBARI M. Scope of validity of PSNR in image/video quality assessment［J］. Electronics Letters， 2008， 44（13）： 800-801. 10.1049/el:20080522
31	KINGMA D P， BA J L. Adam： a method for stochastic optimization［EB/OL］. （2017-01-30）［2022-05-15］..
32	REN D， ZUO W， HU Q， et al. Progressive image deraining networks： a better and simpler baseline［C］// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2019： 3932-3941. 10.1109/cvpr.2019.00406
33	REN D， SHANG W， ZHU P， et al. Single image deraining using bilateral recurrent network［J］. IEEE Transactions on Image Processing， 2020， 29： 6852-6863. 10.1109/tip.2020.2994443
34	WANG H， XIE Q， ZHAO Q， et al. A model-driven deep neural network for single image rain removal［C］// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2020： 3100-3109. 10.1109/cvpr42600.2020.00317
35	GUO Q， SUN J， JUEFEI-XU F， et al. EfficientDeRain： learning pixel-wise dilation filtering for high-efficiency single-image deraining［C］// Proceedings of the 35th AAAI Conference on Artificial Intelligence. Palo Alto， CA： AAAI Press， 2021： 1487-1495. 10.1609/aaai.v35i2.16239
36	CUI X， WANG C， REN D， et al. Semi-supervised image deraining using knowledge distillation［J］. IEEE Transactions on Circuits and Systems for Video Technology， 2022， 32（12）： 8327-8341. 10.1109/tcsvt.2022.3190516
37	Ultralytics. YOLOv5［EB/OL］. ［2022-04-25］.. 10.1117/1.jei.31.3.033033

方法	Rain200H		Rain200L		Rain1200		Rain12		参数量/10⁶
方法	PSNR/dB	SSIM	PSNR/dB	SSIM	PSNR/dB	SSIM	PSNR/dB	SSIM	参数量/10⁶
RESCAN	26.577	0.839 7	36.965	0.978 5	32.131	0.903 7	32.172	0.948 5	0.15
PReNet	27.956	0.892 0	36.563	0.980 4	32.096	0.915 1	35.424	0.966 5	0.17
SPANet	25.009	0.848 0	34.118	0.970 7	27.099	0.808 2	32.400	0.948 9	0.28
BRN	28.843	0.906 8	37.454	0.983 0	31.998	0.914 9	35.194	0.966 2	0.41
DCSFN	28.587	0.903 7	36.952	0.980 3	32.275	0.922 8	35.607	0.968 0	6.45
RCDNet	29.268	0.899 6	38.519	0.984 7	32.516	0.915 9	35.573	0.966 1	2.98
EfDeRain	24.539	0.834 4	30.498	0.945 5	31.098	0.887 5	33.372	0.953 9	27.40
SPDNet	29.867	0.910 3	39.152	0.986 1	32.810	0.916 2	36.814	0.967 4	3.32
SSID-KD	28.707	0.900 5	37.041	0.980 6	32.424	0.920 2	35.473	0.968 2	4.43
DUFN	30.810	0.932 7	39.715	0.988 5	32.618	0.924 8	37.280	0.972 6	5.44

方法	Rain200H		Rain200L		Rain1200		Rain12		参数量/10⁶
方法	PSNR/dB	SSIM	PSNR/dB	SSIM	PSNR/dB	SSIM	PSNR/dB	SSIM	参数量/10⁶
RESCAN	26.577	0.839 7	36.965	0.978 5	32.131	0.903 7	32.172	0.948 5	0.15
PReNet	27.956	0.892 0	36.563	0.980 4	32.096	0.915 1	35.424	0.966 5	0.17
SPANet	25.009	0.848 0	34.118	0.970 7	27.099	0.808 2	32.400	0.948 9	0.28
BRN	28.843	0.906 8	37.454	0.983 0	31.998	0.914 9	35.194	0.966 2	0.41
DCSFN	28.587	0.903 7	36.952	0.980 3	32.275	0.922 8	35.607	0.968 0	6.45
RCDNet	29.268	0.899 6	38.519	0.984 7	32.516	0.915 9	35.573	0.966 1	2.98
EfDeRain	24.539	0.834 4	30.498	0.945 5	31.098	0.887 5	33.372	0.953 9	27.40
SPDNet	29.867	0.910 3	39.152	0.986 1	32.810	0.916 2	36.814	0.967 4	3.32
SSID-KD	28.707	0.900 5	37.041	0.980 6	32.424	0.920 2	35.473	0.968 2	4.43
DUFN	30.810	0.932 7	39.715	0.988 5	32.618	0.924 8	37.280	0.972 6	5.44

方法	PSNR/dB	SSIM	方法	PSNR/dB	SSIM
RESCAN	36.445	0.966 4	SPDNet	41.328	0.982 9
SPANet	38.505	0.978 6	ECNetLL	41.853	0.986 9
RCDNet	39.683	0.979 7	DUFN	42.829	0.990 0
EfDeRain	40.663	0.981 0

方法	PSNR/dB	SSIM	方法	PSNR/dB	SSIM
RESCAN	36.445	0.966 4	SPDNet	41.328	0.982 9
SPANet	38.505	0.978 6	ECNetLL	41.853	0.986 9
RCDNet	39.683	0.979 7	DUFN	42.829	0.990 0
EfDeRain	40.663	0.981 0

实验序号	采样方式			Rain200H
实验序号	NLWT+INLWT	DWT+IDWT	最大池化+双线性插值	PSNR/dB	SSIM
1	√			30.810	0.932 7
2		√		30.664	0.931 5
3			√	29.520	0.925 6