Journal of Computer Applications ›› 2025, Vol. 45 ›› Issue (11): 3707-3712.DOI: 10.11772/j.issn.1001-9081.2024101554

• Multimedia computing and computer simulation • Previous Articles    

Multi-attention contrastive learning for infrared small target detection

Xiaoyong BIAN1,2,3(), Qiren HU1   

  1. 1.School of Computer Science and Technology,Wuhan University of Science and Technology,Wuhan Hubei 430065,China
    2.Institute of Big Data Science and Engineering,Wuhan University of Science and Technology,Wuhan Hubei 430065,China
    3.Key Laboratory of Hubei Province for Intelligent Information Processing and Real-time Industrial System (Wuhan University of Science and Technology),Wuhan Hubei 430065,China
  • Received:2024-11-01 Revised:2025-01-09 Accepted:2025-01-09 Online:2025-01-13 Published:2025-11-10
  • Contact: Xiaoyong BIAN
  • About author:HU Qiren, born in 1995, M. S. candidate. His research interests include small target detection.
  • Supported by:
    This work is partially supported by National Natural Science Foundation of China(62372343)

多注意力对比学习的红外小目标检测

边小勇1,2,3(), 胡其仁1   

  1. 1.武汉科技大学 计算机科学与技术学院,武汉 430065
    2.武汉科技大学 大数据科学与工程研究院,武汉 430065
    3.智能信息处理与实时工业系统湖北省重点实验室(武汉科技大学),武汉 430065
  • 通讯作者: 边小勇
  • 作者简介:胡其仁(1995—),男,湖北仙桃人,硕士研究生,主要研究方向:小目标检测。
  • 基金资助:
    国家自然科学基金资助项目(62372343)

Abstract:

InfRared Small Target Detection (IRSTD) is a hotspot and suffers from difficulties in the field of target detection. IRSTD is difficult to learn accurate feature representation from limited and distorted information of small targets due to its characteristics of small pixels, low contrast and lacking texture, thus IRSTD methods still face many challenges. To address the above issue, a multi-attention contrastive learning based IRSTD method was proposed. Firstly, with U-Net adopted as the fundamental framework, a Context Mixer Block (CMB) that integrates Frequency Attention (FA) and spatial attention was proposed during the encoding phase to produce a preliminary attention feature map. Then, in the decoding phase, a Multi-Kernel Central Difference Convolution (MKCDC) was designed to extract the core information of small targets, which remained stable with different scales. Finally, by combining binary cross-entropy loss and contrastive loss functions, the small target detection network was trained, so that the feature representation ability of small targets was enhanced and a discriminative small target detection model was obtained. Experimental results show that the Probability of detection (Pd) of the proposed method on IRSTD-1k and NUAA-SIRST datasets reaches 96.63% and 100.00% respectively, which is improved by 4.71 and 1.90 percentage points, respectively, compared with Dense Nested Attention Network (DNA-Net). It can be seen that the proposed method improves the performance of IRSTD effectively.

Key words: deep learning, small target detection, attention U-Net, Central Difference Convolution (CDC), contrastive learning

摘要:

红外小目标检测(IRSTD)是目标检测领域中的研究热点和难点,具有像素小、对比度低和无纹理的特性,难以从小目标有限和扭曲的信息中学习正确的特征表示,因此IRSTD方法依然面临挑战。针对以上问题,提出多注意力对比学习的IRSTD方法。首先,采用U-Net为基本框架,在编码阶段提出一种融合频率注意力(FA)和空间注意力的上下文混合块(CMB),产生初级注意力特征图;其次,在解码阶段设计多核中心差分卷积(MKCDC),用于提取小目标在不同尺度下都稳定表征的核心信息;最后,联合二元交叉熵损失和对比损失函数训练小目标检测网络,提高小目标特征表示能力,得到富于判别的小目标检测模型。实验结果表明,在IRSTD-1k和NUAA-SIRST数据集上,所提方法的检测率(Pd)分别达到96.63%和100.00%,与密集嵌套的注意力网络(DNA-Net)相比,分别提高了4.71和1.90个百分点。可见,所提方法有效提高了IRSTD性能。

关键词: 深度学习, 小目标检测, 注意力U-Net, 中心差分卷积, 对比学习

CLC Number: