Journal of Computer Applications ›› 2025, Vol. 45 ›› Issue (12): 3987-3994.DOI: 10.11772/j.issn.1001-9081.2024121742

• Multimedia computing and computer simulation • Previous Articles     Next Articles

High-quality sonar image generation method based on multi-scale feature fusion

Jing HUANG1,2, Xin PENG2, Wenhao LI2, Kai HU2, Teng WANG2, Yamin HUANG3,4, Yuanqiao WEN3,4   

  1. 1.Research and Development Center of Transport Industry of New Generation of Artificial Intelligence Technology,Zhejiang Scientific Research Institute of Transport,Hangzhou Zhejiang 310023,China
    2.College of Computer and Artificial Intelligence,Wuhan University of Technology,Wuhan Hubei 430070,China
    3.Intelligent Transportation Systems Research Center (Wuhan University of Technology),Wuhan Hubei 430070,China
    4.National Engineering Research Center for Water Transport Safety,Wuhan Hubei 430063,China
  • Received:2024-12-10 Revised:2025-03-14 Accepted:2025-03-20 Online:2025-04-08 Published:2025-12-10
  • Contact: Jing HUANG
  • About author:HUANG Jing, born in 1977, Ph. D., associate professor. His research interests include computer vision, artificial intelligence.
    PENG Xin, born in 2000, M. S. candidate. His research interests include computer vision, artificial intelligence.
    LI Wenhao, born in 1997, M. S. His research interests include computer vision, artificial intelligence.
    HU Kai, born in 2001, M. S. candidate. His research interests include computer vision, artificial intelligence.
    WANG Teng, born in 2000, M. S. candidate. His research interests include computer vision, artificial intelligence.
    HUANG Yamin, born in 1990, Ph. D., research fellow. His research interests include intelligent transportation.
    WEN Yuanqiao, born in 1975, Ph. D., professor. His research interests include intelligent transportation.
  • Supported by:
    Project of Research and Development Center of Transport Industry of New Generation of Artificial Intelligence Technology(202302H);Research Program of Zhejiang Provincial Department of Transportation(2024006)

多尺度特征融合的高质量声呐图像生成方法

黄靖1,2, 彭鑫2, 李文豪2, 胡凯2, 王腾2, 黄亚敏3,4, 文元桥3,4   

  1. 1.浙江省交通运输科学研究院 新一代人工智能技术应用交通运输行业研发中心,杭州 310023
    2.武汉理工大学 计算机与人工智能学院,武汉 430070
    3.智能交通系统研究中心(武汉理工大学),武汉 430070
    4.国家水运安全工程技术研究中心,武汉 430063
  • 通讯作者: 黄靖
  • 作者简介:黄靖(1977—),男,湖北潜江人,副教授,博士,主要研究方向:计算机视觉、人工智能
    彭鑫(2000—),男,湖南株洲人,硕士研究生,主要研究方向:计算机视觉、人工智能
    李文豪(1997—),男,河南周口人,硕士,主要研究方向:计算机视觉、人工智能
    胡凯(2001—),男,湖北鄂州人,硕士研究生,主要研究方向:计算机视觉、人工智能
    王腾(2000—),男,湖北黄冈人,硕士研究生,主要研究方向:计算机视觉、人工智能
    黄亚敏(1990—),男,福建福州人,研究员,博士生导师,博士,主要研究方向:智能交通
    文元桥(1975—),男,湖北松滋人,教授,博士生导师,博士,主要研究方向:智能交通。
  • 基金资助:
    新一代人工智能技术应用交通运输行业研发中心开放基金资助项目(202302H);浙江省交通厅科技项目(2024006)

Abstract:

Due to the inherent characteristics of sonar imaging principles and the interference of complex underwater environments, underwater sonar images generally suffer from insufficient resolution and missing target details. To address these issues, a high-quality sonar image generation method based on multi-scale feature fusion was proposed. Firstly, the Residual Dense Blocks (RDBs) were used to extract image features at shallow level, thereby capturing basic texture and contour information, and establishing spatial layout of the image. Secondly, a Multi-Scale Attention feature extraction module (MSA) was designed to focus on key features at different scales adaptively and further enhance the expression of key features while suppressing redundant information expression through the attention mechanism. Finally, a discriminator network was constructed using a pixel-by-pixel discrimination strategy based on spectral normalization, which improved the reconstruction ability of complex object contours and details. Experimental results on an underwater sonar image dataset show that the proposed method achieves relative improvements of 6.7% and 5.4%, respectively, in Peak Signal-to-Noise Ratio (PSNR) and Structural SIMilarity (SSIM) metrics compared to the existing representative method ESRGAN (Enhanced Super-Resolution Generative Adversarial Network). It can be seen that the proposed method improves the generation performance on underwater sonar image dataset effectively.

Key words: deep learning, high-quality image generation, Generative Adversarial Network (GAN), sonar image processing, feature fusion

摘要:

由于声呐图像成像原理本身的特点和复杂水下环境的干扰,水下声呐图像普遍存在分辨率不足和目标细节缺失的问题。针对这些问题,提出一种多尺度特征融合的高质量声呐图像生成方法。首先,通过残差密集连接模块(RDB)在浅层提取图像特征,以捕捉基本纹理与轮廓信息,并建立图像的空间布局;其次,设计多尺度注意力特征提取模块(MSA)自适应地聚焦不同尺度下的关键特征,并通过注意力机制在进一步增强关键特征表达的同时抑制冗余信息表达;最后,以基于谱归一化的逐像素判别策略构建判别器网络,从而提升复杂物体边缘和细节的重建能力。在水下声呐图像数据集上的实验结果表明,所提方法相较于现有代表性方法ESRGAN (Enhanced Super-Resolution Generative Adversarial Network)在峰值信噪比(PSNR)和结构相似性(SSIM)指标上分别取得了6.7%和5.4%的提升。可见,所提方法在水下声呐图像数据集上有效提升了生成效果。

关键词: 深度学习, 高质量图像生成, 生成对抗网络, 声呐图像处理, 特征融合

CLC Number: