Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (1): 252-259.DOI: 10.11772/j.issn.1001-9081.2025010078

• Multimedia computing and computer simulation • Previous Articles     Next Articles

Dual-coding space-frequency mixing method for infrared small target detection

Xiaoyong BIAN1,2,3(), Peiyang YUAN1, Qiren HU1   

  1. 1.School of Computer Science and Technology,Wuhan University of Science and Technology,Wuhan Hubei 430065,China
    2.Institute of Big Data Science and Engineering,Wuhan University of Science and Technology,Wuhan Hubei 430081,China
    3.Key Laboratory of Hubei Province for Intelligent Information Processing and Real-time Industrial System (Wuhan University of Science and Technology),Wuhan Hubei 430081,China
  • Received:2025-01-21 Revised:2025-03-11 Accepted:2025-03-11 Online:2026-01-10 Published:2026-01-10
  • Contact: Xiaoyong BIAN
  • About author:YUAN Peiyang, born in 2000, M. S. candidate. His research interests include small target detection.
    HU Qiren, born in 1995, M. S. candidate. His research interests include small target detection.

双编码空频混合的红外小目标检测方法

边小勇1,2,3(), 袁培洋1, 胡其仁1   

  1. 1.武汉科技大学 计算机科学与技术学院,武汉 430065
    2.武汉科技大学 大数据科学与工程研究院,武汉 430081
    3.智能信息处理与实时工业系统湖北省重点实验室(武汉科技大学),武汉 430081
  • 通讯作者: 边小勇
  • 作者简介:袁培洋(2000—),男,河南汝州人,硕士研究生,主要研究方向:小目标检测
    胡其仁(1995—),男,湖北仙桃人,硕士研究生,主要研究方向:小目标检测。

Abstract:

InfraRed Small Target Detection (IRSTD) aims to find targets from infrared images with low Signal-to-Clutter Ratio (SCR) accurately, and has been used in many fields widely. However, due to the weak target features and severe background interference, the existing methods struggle to extract structural information of the target effectively, which leads to issues such as incomplete target segmentation and low detection accuracy. Moreover, these models usually have a large number of parameters. To solve the above problems, a dual-coding space-frequency mixing IRSTD method was proposed. Firstly, with using U-Net3+ as the basic framework, a dual-coding structure combining Multi-Shape Context Aware (MSCA) module and Frequency-Domain Interactive Attention (FDIA) module was proposed to extract space-frequency mixing features in the coding stage. Secondly, in the decoding stage, a Cross-Layer Feature Guide (CLFG) module was designed to fuse multi-scale feature maps. Finally, the proposed method was experimentally verified on the NUAA-SIRST and IRSTD-1k datasets. The results show that the proposed method has the number of parameters of 0.86×106, and the Intersection over Union (IoU) reached 78.11% and 69.08% respectively. Compared with Attention Multiscale feature Fusion U-Net (AMFUNet) method, the number of parameters is reduced by 1.31×106, and the IoUs are increased by 2.25 percentage points and 1.23 percentage points respectively. It can be seen that the proposed method has high detection performance while retaining fewer parameters.

Key words: deep learning, Infrared Small Target Detection (IRSTD), dual-coding, space-frequency mixing, cross-layer guidance

摘要:

红外小目标检测(IRSTD)旨在从低信杂比(SCR)的红外图像中精准找到目标,在多个领域获得了非常广泛的应用。然而,现有方法因目标特征微弱和背景干扰严重,难以有效提取目标结构性信息,进而导致目标分割不完整和检测精度低等问题,并且模型参数量较大。为了解决以上问题,提出双编码空频混合的IRSTD方法。首先,采用U-Net3+作为基本框架,并在编码阶段提出一种多形状上下文感知(MSCA)模块和频域交互注意力(FDIA)模块相结合的双编码结构来提取空频混合特征;其次,在解码阶段设计跨层特征引导(CLFG)模块来融合多尺度下的特征图;最后,所提方法分别在NUAA-SIRST和IRSTD-1k数据集上进行了实验验证。结果表明,所提方法的参数量为0.86×106,交并比(IoU)分别达到了78.11%和69.08%,与注意力多尺度特征融合U型网络(AMFUNet)方法相比,参数量减少了1.31×106, IoU分别提升了2.25和1.23个百分点。可见,所提方法在保留较少参数量的同时具有较高的检测性能。

关键词: 深度学习, 红外小目标检测, 双编码, 空频混合, 跨层引导

CLC Number: