《计算机应用》唯一官方网站

• •    下一篇

双编码空频混合的红外小目标检测

边小勇1,1,袁培洋1,胡其仁2   

  1. 1. 武汉科技大学 计算机科学与技术学院,武汉 430065
    2. 武汉科技大学
  • 收稿日期:2025-01-21 修回日期:2025-03-11 发布日期:2025-04-27 出版日期:2025-04-27
  • 通讯作者: 边小勇

Dual-coding space-frequency mixing for infrared small target detection

  • Received:2025-01-21 Revised:2025-03-11 Online:2025-04-27 Published:2025-04-27
  • Contact: BIAN Xiao-yong

摘要: 红外小目标检测(IRSTD)旨在从低信杂比的红外图像中精准找到目标,在多个领域获得了非常广泛的应用。但现有方法因目标特征微弱、背景干扰严重,难以有效提取目标结构性信息,从而导致目标分割不完整、检测精度低等问题,并且模型参数量较大。为了克服以上问题,提出了双编码空频混合的红外小目标检测方法。首先,采用U-Net3+作为基本框架,在编码阶段提出一种多形状上下文感知模块和频域交互注意力模块相结合的双编码结构提取空频混合特征;其次,在解码阶段设计了跨层特征引导模块,用于融合多尺度下的特征图;所提方法分别在NUAA-SIRST和IRSTD-1k数据集上进行了实验验证,参数量为0.86×10^6,交并比(IoU)分别达到了78.11%和69.08%。与注意力多尺度特征融合U型网络(AMFUNet)相比,参数量减少了1.31×106,IoU分别提升了2.25个百分点和1.23个百分点。实验结果表明,所提方法在保留较少参数量的同时具有较高的检测性能。

关键词: 深度学习, 红外小目标检测, 双编码, 空频混合, 跨层引导

Abstract: Infrared Small Target Detection (IRSTD) aims to accurately find targets from infrared images with low signal-to-clutter ratio, and has been widely used in many fields. However, due to the weak target features and severe background interference, existing methods struggle to effectively extract the structural information of the target. This leads to issues such as incomplete target segmentation and low detection accuracy. Moreover, these models usually have a large number of parameters. To overcome above problem, a dual-coding space-frequency mixing IRSTD method was proposed. Firstly, using U-Net3+ as the basic framework, a dual-coding structure combining Multi-Shape Context Aware module and Frequency-Domain Interactive Attention module was proposed to extract space-frequency mixing features in the coding stage. Secondly, in the decoding stage, a Cross-Layer Feature Guide module is designed to fuse multi-scale feature maps. The proposed method is experimentally verified on NUAA-SIRST and IRSTD-1k datasets, the number of parameters is 0.86 M, and the Intersection over Union (IoU) values reach 78.11% and 69.08% respectively. Compared with the Attention Multiscale Feature Fusion U-Net (AMFUNet), the number of parameters is reduced by 1.31 M, and the IoU values are increased by 2.25 percentage points and 1.23 percentage points respectively. The experimental results show that the proposed method has high detection performance while retaining fewer parameters.

Key words: deep learning, infrared small target detection, dual-coding, space-frequency mixing, cross-layer guide

中图分类号: