SAR and visible image fusion based on residual Swin Transformer

doi:10.11772/j.issn.1001-9081.2024081166

Journal of Computer Applications

SAR and visible image fusion based on residual Swin Transformer

,Li-qun LIU

Received:2024-08-19 Revised:2024-11-12 Online:2025-02-17 Published:2025-02-17
Contact: Li-qun LIU
Supported by:
Gansu Provincial University Teacher Innovation Fund Project;Science and Technology Program of Gansu Province;Youth Mentor Fund of Gansu Agricultural University

基于残差 Swin Transformer 的 SAR 与可见光图像融合

李进¹,刘立群²

1. 甘肃农业大学理学院
2. 甘肃农业大学信息科学技术学院

通讯作者: 刘立群
基金资助:
甘肃省高校教师创新基金项目;甘肃省科技计划项;甘肃农业大学青年导师基金项目

Abstract

Abstract: Abstract: In the fusion of synthetic aperture radar and visible light images, the existing methods usually face the challenges of large modal differences, information loss and high computational complexity, so a SAR and visible image fusion algorithm based on residual Swin transformer model is proposed. Firstly, Swin transformer was used as the backbone to extract global features, and a full-attention feature coding backbone network was used to model remote dependencies. Secondly, in order to improve the fusion effect, three different fusion strategies were designed: the feature fusion strategy based on the L1 norm of the sequence matrix, the fusion strategy based on the image pyramid and downsampling, and then the final fusion result was obtained by weighting the average of the three results, which effectively adjusted the pixel value and reduced the noise of the SAR image, better retained the clear details and structural information of the visible light image, and fused the feature information of the SAR image and the visible light image at different scales. Finally, a large number of experiments are carried out on the SEN1-2 dataset and the QXS-SARPT dataset, and the results show that compared with other fusion methods, the objective evaluation indexes AG, MI, SD, PSNR, SF and EN of the proposed method are increased by 13.44%, 18.12%, 16.72%, 6.34%, 22.93% and 12.69%, respectively, and the results are better than those of the existing methods. The ablation experiments also proved the effectiveness of the improved fusion strategy, which provided new ideas and methods for further research in the field of fusion of SAR and visible images.

Key words: Keywords: Synthetic Aperture Radar, visible image, image fusion, transformer, deep learning

摘要： 摘要: 在合成孔径雷达与可见光图像的融合研究中，现有方法通常面临着模态间差异大、信息丢失和计算复杂度高等挑战，因此提出了一种基于残差Swin transformer模型的SAR和可见光图像融合算法。首先，采用Swin transformer作为主干来完成提取全局特征，用一个全注意力特征编码主干网络来建模远程依赖关系；其次，为了提高融合效果，设计了三种不同的融合策略：基于序列矩阵的L1范数的特征融合策略、基于图像金字塔的融合策略及下采样，再将三个结果加权平均得到最终的融合结果，有效地调节像素值并减少SAR图像的噪声，更好地保留可见光图像清晰的细节和结构信息，融合了SAR图像和可见光图像不同尺度的地物特征信息。最后，在SEN1-2数据集以及QXS-SAROPT数据集上进行了大量实验验证，结果表明，与其他融合方法相比，所提方法的客观评价指标AG、MI、SD、PSNR、SF及EN分别平均提升了13.44%、18.12%、16.72%、6.34%、22.93%和12.69%，得到了较现有方法更佳的结果，通过消融实验也证明了改进的融合策略具备有效性，为SAR和可见光图像的融合领域的进一步研究提供了新的思路和方法

关键词: 合成孔径雷达, 可见光图像, 图像融合, transformer, 深度学习

CLC Number:

TP391

Li-qun LIU. SAR and visible image fusion based on residual Swin Transformer[J]. Journal of Computer Applications, DOI: 10.11772/j.issn.1001-9081.2024081166.

李进刘立群. 基于残差 Swin Transformer 的 SAR 与可见光图像融合[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2024081166.

[1]	Kai CHEN, Hailiang YE, Feilong CAO. Classification algorithm for point cloud based on local-global interaction and structural Transformer [J]. Journal of Computer Applications, 2025, 45(5): 1671-1676.
[2]	Pengyu CHEN, Xiushan NIE, Nanjun LI, Tuo LI. Semi-supervised video object segmentation method based on spatio-temporal decoupling and regional robustness enhancement [J]. Journal of Computer Applications, 2025, 45(5): 1379-1386.
[3]	Dan WANG, Wenhao ZHANG, Lijuan PENG. Channel estimation of reconfigurable intelligent surface assisted communication system based on deep learning [J]. Journal of Computer Applications, 2025, 45(5): 1613-1618.
[4]	Sijie NIU, Yuliang LIU. Auxiliary diagnostic method for retinopathy based on dual-branch structure with knowledge distillation [J]. Journal of Computer Applications, 2025, 45(5): 1410-1414.
[5]	Pengcheng XU, Lei HE, Chuan LI, Weiqi QIAN, Tun ZHAO. Deep symbolic regression method based on Transformer [J]. Journal of Computer Applications, 2025, 45(5): 1455-1463.
[6]	Hui LI, Bingzhi JIA, Chenxi WANG, Ziyu DONG, Jilong LI, Zhaoman ZHONG, Yanyan CHEN. Generative adversarial network underwater image enhancement model based on Swin Transformer [J]. Journal of Computer Applications, 2025, 45(5): 1439-1446.
[7]	Wenpeng WANG, Yinchang QIN, Wenxuan SHI. Review of unsupervised deep learning methods for industrial defect detection [J]. Journal of Computer Applications, 2025, 45(5): 1658-1670.
[8]	Xueying LI, Kun YANG, Guoqing TU, Shubo LIU. Adversarial sample generation method for time-series data based on local augmentation [J]. Journal of Computer Applications, 2025, 45(5): 1573-1581.
[9]	Yang ZHOU, Hui LI. Remote sensing image building extraction network based on dual promotion of semantic and detailed features [J]. Journal of Computer Applications, 2025, 45(4): 1310-1316.
[10]	Lihu PAN, Shouxin PENG, Rui ZHANG, Zhiyang XUE, Xuzhen MAO. Video anomaly detection for moving foreground regions [J]. Journal of Computer Applications, 2025, 45(4): 1300-1309.
[11]	Yiding WANG, Zehao WANG, Yaoli LI, Shaoqing CAI, Yuan YUAN. Multi-scale 2D-Adaboost microscopic image recognition algorithm of Chinese medicinal materials powder [J]. Journal of Computer Applications, 2025, 45(4): 1325-1332.
[12]	Baohua YUAN, Jialu CHEN, Huan WANG. Medical image segmentation network integrating multi-scale semantics and parallel double-branch [J]. Journal of Computer Applications, 2025, 45(3): 988-995.
[13]	Ruilong CHEN, Tao HU, Youjun BU, Peng YI, Xianjun HU, Wei QIAO. Stacking ensemble adversarial defense method for encrypted malicious traffic detection model [J]. Journal of Computer Applications, 2025, 45(3): 864-871.
[14]	Zhenhua XUE, Qiang LI, Chao HUANG. Vision foundation model-driven pixel-level image anomaly detection method [J]. Journal of Computer Applications, 2025, 45(3): 823-831.
[15]	Zirong HONG, Guangqing BAO. Review of radar automatic target recognition based on ensemble learning [J]. Journal of Computer Applications, 2025, 45(2): 371-382.

SAR and visible image fusion based on residual Swin Transformer

基于残差 Swin Transformer 的 SAR 与可见光图像融合

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics