双编码空频混合的红外小目标检测

doi:10.11772/j.issn.1001-9081.2025010078

《计算机应用》唯一官方网站

• • 下一篇

双编码空频混合的红外小目标检测

边小勇¹,¹,袁培洋¹,胡其仁²

1. 武汉科技大学计算机科学与技术学院,武汉 430065
2. 武汉科技大学

收稿日期:2025-01-21 修回日期:2025-03-11 发布日期:2025-04-27 出版日期:2025-04-27
通讯作者: 边小勇

Dual-coding space-frequency mixing for infrared small target detection

Received:2025-01-21 Revised:2025-03-11 Online:2025-04-27 Published:2025-04-27
Contact: BIAN Xiao-yong

摘要/Abstract

摘要： 红外小目标检测(IRSTD)旨在从低信杂比的红外图像中精准找到目标，在多个领域获得了非常广泛的应用。但现有方法因目标特征微弱、背景干扰严重，难以有效提取目标结构性信息，从而导致目标分割不完整、检测精度低等问题，并且模型参数量较大。为了克服以上问题，提出了双编码空频混合的红外小目标检测方法。首先，采用U-Net3+作为基本框架，在编码阶段提出一种多形状上下文感知模块和频域交互注意力模块相结合的双编码结构提取空频混合特征；其次，在解码阶段设计了跨层特征引导模块，用于融合多尺度下的特征图；所提方法分别在NUAA-SIRST和IRSTD-1k数据集上进行了实验验证，参数量为0.86×10^6，交并比(IoU)分别达到了78.11%和69.08%。与注意力多尺度特征融合U型网络(AMFUNet)相比，参数量减少了1.31×106，IoU分别提升了2.25个百分点和1.23个百分点。实验结果表明，所提方法在保留较少参数量的同时具有较高的检测性能。

关键词: 深度学习, 红外小目标检测, 双编码, 空频混合, 跨层引导

Abstract: Infrared Small Target Detection (IRSTD) aims to accurately find targets from infrared images with low signal-to-clutter ratio, and has been widely used in many fields. However, due to the weak target features and severe background interference, existing methods struggle to effectively extract the structural information of the target. This leads to issues such as incomplete target segmentation and low detection accuracy. Moreover, these models usually have a large number of parameters. To overcome above problem, a dual-coding space-frequency mixing IRSTD method was proposed. Firstly, using U-Net3+ as the basic framework, a dual-coding structure combining Multi-Shape Context Aware module and Frequency-Domain Interactive Attention module was proposed to extract space-frequency mixing features in the coding stage. Secondly, in the decoding stage, a Cross-Layer Feature Guide module is designed to fuse multi-scale feature maps. The proposed method is experimentally verified on NUAA-SIRST and IRSTD-1k datasets, the number of parameters is 0.86 M, and the Intersection over Union (IoU) values reach 78.11% and 69.08% respectively. Compared with the Attention Multiscale Feature Fusion U-Net (AMFUNet), the number of parameters is reduced by 1.31 M, and the IoU values are increased by 2.25 percentage points and 1.23 percentage points respectively. The experimental results show that the proposed method has high detection performance while retaining fewer parameters.

Key words: deep learning, infrared small target detection, dual-coding, space-frequency mixing, cross-layer guide

中图分类号:

TP391.4

边小勇袁培洋胡其仁. 双编码空频混合的红外小目标检测[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2025010078.

[1]	潘理虎, 彭守信, 张睿, 薛之洋, 毛旭珍. 面向运动前景区域的视频异常检测[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1300-1309.
[2]	王一丁, 王泽浩, 李耀利, 蔡少青, 袁媛. 多尺度2D-Adaboost的中药材粉末显微图像识别算法[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1325-1332.
[3]	周阳, 李辉. 基于语义和细节特征双促进的遥感影像建筑物提取网络[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1310-1316.
[4]	陈瑞龙, 胡涛, 卜佑军, 伊鹏, 胡先君, 乔伟. 面向加密恶意流量检测模型的堆叠集成对抗防御方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 864-871.
[5]	薛振华, 李强, 黄超. 视觉基础模型驱动的像素级图像异常检测方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 823-831.
[6]	李严, 叶冠华, 李雅文, 梁美玉. 基于丰度协调技术的企业ESG指标预测模型[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 670-676.
[7]	邓淼磊, 阚雨培, 孙川川, 徐海航, 樊少珺, 周鑫. 基于深度学习的网络入侵检测系统综述[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 453-466.
[8]	余松森, 林智凡, 薛国鹏, 徐建宇. 基于改进YOLOv8的轻量级大幅面瓷砖缺陷检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 647-654.
[9]	丁丹妮, 彭博, 吴锡. 受腹侧通路启发的脂肪肝超声图像分类方法VPNet[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 662-669.
[10]	洪梓榕, 包广清. 基于集成学习的雷达自动目标识别综述[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 371-382.
[11]	张众维, 王俊, 刘树东, 王志恒. 多尺度特征融合与加权框融合的遥感图像目标检测[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 633-639.
[12]	张天骐, 谭霜, 沈夕文, 唐娟. 融合注意力机制和多尺度特征的图像水印方法[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 616-623.
[13]	郑宗生, 杜嘉, 成雨荷, 赵泽骋, 张月维, 王绪龙. 用于红外-可见光图像分类的跨模态双流交替交互网络[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 275-283.
[14]	徐欣然, 张绍兵, 成苗, 张洋, 曾尚. 基于多路层次化混合专家模型的轴承故障诊断方法[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 59-68.
[15]	梁杰涛, 罗兵, 付兰慧, 常青玲, 李楠楠, 易宁波, 冯其, 何鑫, 邓辅秦. 基于坐标几何采样的点云配准方法[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 214-222.

双编码空频混合的红外小目标检测

Dual-coding space-frequency mixing for infrared small target detection

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics