Journal of Computer Applications ›› 2024, Vol. 44 ›› Issue (3): 745-751.DOI: 10.11772/j.issn.1001-9081.2023030315

• Artificial intelligence • Previous Articles     Next Articles

Few-shot object detection combining feature fusion and enhanced attention

Xinye LI1,2, Yening HOU1(), Yinghui KONG1,2, Zhiqi YAN1   

  1. 1.Department of Electronic & Communication Engineering,North China Electric Power University,Baoding Hebei 071003,China
    2.Hebei Key Laboratory of Power Internet of Things Technology (North China Electric Power University),Baoding Hebei 071003,China
  • Received:2023-03-29 Revised:2023-04-27 Accepted:2023-05-05 Online:2023-05-24 Published:2024-03-10
  • Contact: Yening HOU
  • About author:LI Xinye, born in 1969, Ph. D., associate professor. Her research interests include computer vision, deep learning, few-shot learning.
    KONG Yinghui, born in 1964, Ph. D., professor. Her research interests include computer vision, object detection.
    YAN Zhiqi, born in 1998, M. S. candidate. His research interests include few-shot object detection.
  • Supported by:
    Science and Technology Program of Hebei Province(SZX2020034)

结合特征融合与增强注意力的少样本目标检测

李新叶1,2, 侯晔凝1(), 孔英会1,2, 燕志旗1   

  1. 1.华北电力大学 电子与通信工程系,河北 保定 071003
    2.河北省电力物联网技术重点实验室(华北电力大学),河北 保定 071003
  • 通讯作者: 侯晔凝
  • 作者简介:李新叶(1969—),女,河北平山人,副教授,博士,主要研究方向:计算机视觉、深度学习、少样本学习
    孔英会(1964—),女,河北衡水人,教授,博士,主要研究方向:计算机视觉、目标检测
    燕志旗(1998—),男,山西吕梁人,硕士研究生,主要研究方向:少样本目标检测。
  • 基金资助:
    河北省科技计划项目(SZX2020034)

Abstract:

In order to fully utilize the key information in support features and query features, a few-shot object detection method based on feature fusion and enhanced attention was proposed, namely FFA-FSOD (Feature Fusion and enhanced Attention Few-Shot Object Detection). Firstly, the iterative Attention Feature Fusion (iAFF) module was introduced to effectively fuse the key features of the support image and the query image. Secondly, the feature enhancement operation was added after the iAFF module, which made full use of the support feature information to enhance the object features in the query image. To avoid the loss of part of the details of the query image after the above two operations, the Multi-Scale Channel Attention Module (MS-CAM) was improved in the iAFF module to capture more context information. Experimental results on MS COCO dataset under 2-way 10-shot condition show that compared with FSOD (Few-Shot Object Detection) method, after adding the iAFF module, feature enhancement operation and improving MS-CAM, FFA-FSOD has mean Average Precision (mAP) increased by 8.0%. Experimental results show that the proposed feature fusion enhancement method pays full attention to the details of features, thus achieving better detection effect of few-shot objects.

Key words: object detection, few-shot, feature fusion, feature enhancement, Faster R-CNN

摘要:

为了更充分地利用支持特征和查询特征中的关键信息,提出一种基于特征融合和增强注意力的少样本目标检测方法FFA-FSOD(Feature Fusion and enhanced Attention Few-Shot Object Detection)。首先引入迭代注意力特征融合(iAFF)模块,以有效融合支持图像和查询图像的关键特征;其次在iAFF模块后添加特征增强操作,充分利用支持特征信息对查询图像中的目标特征进行增强。为避免上述两次处理可能导致的查询图像特征部分细节信息的丢失,对iAFF模块中的多尺度通道注意力模块(MS-CAM)进行改进,以捕获更多的上下文信息。在MS COCO数据集上的实验结果表明,在2-way 10-shot条件下,与小样本目标检测(FSOD)方法相比,加入iAFF模块、特征增强操作并改进MS-CAM后,FFA-FSOD的平均精度均值(mAP)提升了8.0%。实验结果验证了所提特征融合增强方法充分关注到了特征中的细节信息,从而实现了更好的少样本目标检测效果。

关键词: 目标检测, 少样本, 特征融合, 特征增强, Faster R-CNN

CLC Number: