《计算机应用》唯一官方网站 ›› 2024, Vol. 44 ›› Issue (12): 3790-3797.DOI: 10.11772/j.issn.1001-9081.2023121731

• 人工智能 • 上一篇    下一篇

基于改进区域提议网络和特征聚合小样本目标检测方法

付可意1, 王高才1, 邬满1,2,3()   

  1. 1.广西大学 计算机与电子信息学院,南宁 530004
    2.广西近海海洋环境科学重点实验室(广西科学院),南宁 530007
    3.广西壮族自治区北部湾碳汇与低碳工程研究中心(广西科学院),南宁 530007
  • 收稿日期:2023-12-18 修回日期:2024-02-14 接受日期:2024-02-28 发布日期:2024-03-21 出版日期:2024-12-10
  • 通讯作者: 邬满
  • 作者简介:付可意(2000—),女,湖南衡阳人,硕士研究生,主要研究方向:小样本目标检测、小样本学习
    王高才(1976—),男,广西桂林人,教授,博士,CCF会员,主要研究方向:计算机网络、系统性能评价、随机算法;
  • 基金资助:
    国家重点研发计划重点专项(2022YFD2401200);广西科技重大专项(桂科AA22068072)

Few-shot object detection method based on improved region proposal network and feature aggregation

Keyi FU1, Gaocai WANG1, Man WU1,2,3()   

  1. 1.School of Computer,Electronic and Information,Guangxi University,Nanning Guangxi 530004,China
    2.Guangxi Key Laboratory of Marine Environmental Science (Guangxi Academy of Sciences),Nanning Guangxi 530007,China
    3.Research Center for Carbon Sink and Low?Carbon Engineering in the Beibu Gulf of Guangxi (Guangxi Academy of Sciences),Nanning Guangxi 530007,China
  • Received:2023-12-18 Revised:2024-02-14 Accepted:2024-02-28 Online:2024-03-21 Published:2024-12-10
  • Contact: Man WU
  • About author:FU Keyi, born in 2000, M. S. candidate. Her research interests include few-shot object detection, few-shot learning.
    WANG Gaocai, born in 1976, Ph. D., processor. His research interests include computer network, system performance evaluation, randomized algorithm.
  • Supported by:
    Key Project of National Key Research and Development Program of China(2022YFD2401200);Guangxi Science and Technology Major Project(Guike AA22068072)

摘要:

在现有的小样本目标检测中,区域提议网络(RPN)通常是在基类数据上训练以生成新类候选框;然而新类数据相较于基类更稀缺,在引入时可能产生与目标物不同的复杂背景,导致RPN将背景误认为前景,遗漏高交并比(IoU)值候选框。针对上述问题,提出一种基于改进RPN和特征聚合小样本目标检测方法(IFA-FSOD)。首先,基于RPN进行改进,即通过在RPN中设计一个基于度量的非线性分类器,计算骨干网络提取的特征和新类特征之间的相似度,以提高对新类候选框的召回率,从而筛选高IoU候选框;其次,在感兴趣区域对齐(RoI Align)中引入基于注意力机制的特征聚合模块(FAM),并通过设计不同尺度的网格,获取更全面的信息和特征表示,从而缓解因尺度不同引起的特征信息缺失。实验结果表明,相较于QA-FewDet(Query Adaptive Few-shot object Detection)方法,IFA-FSOD方法在PASCAL VOC数据集的新类上的Novel Set 3中的10-shot下的新类别平均精度(50% IoU)(nAP50)提升了4.5个百分点;相较于FsDetView(Few-shot object Detection and Viewpoint estimation)方法,在10-shot和30-shot设置下,IFA-FSOD方法在COCO数据集的新类上的平均精度均值(mAP)分别提升了0.2和0.8个百分点。可见改进RPN和特征聚合(IFA)能有效提高在小样本情况下对目标类别的检测性能,并解决高IoU值候选框遗漏和特征信息捕捉不全的问题。

关键词: 小样本目标检测, 基于度量, 区域提议网络, 非线性分类器, 特征聚合

Abstract:

In the existing few-shot object detection, Region Proposal Network (RPN) is usually trained on base class data to generate new class anchor boxes. However, new class data are more sparse compared to the base class. Introducing new class data may lead to the presence of complex backgrounds different to the objects, causing RPN to misclassify the background as foreground, resulting in the omission of high Intersection over Union (IoU) value anchor boxes. To address the above issues, a Few-Shot Object Detection method based on Improved RPN and Feature Aggregation (IFA-FSOD) was proposed. Firstly, an improvement was made on the basis of RPN by incorporating a metric-based non-linear classifier within RPN. This classifier was designed to compute the similarity between features extracted by the backbone network and the features representing the new class, so as to increase the recall for anchor boxes of the new class, thereby filtering out high IoU value anchor boxes. Then, a Feature Aggregation Module (FAM) based on attention mechanism was introduced in Region of Interest Alignment (RoI Align). And by designing grids of different scales, more comprehensive information and feature representation were obtained, which alleviated the lack of feature information caused by different scales. Experimental results show that compared with QA-FewDet (Query Adaptive Few-shot object Detection) method, IFA-FSOD method improves nAP50(Novel Average Precision at 50% IoU) by 4.5 percentage points under Novel Set 3's 10-shot on the new class of PASCAL VOC dataset; compared with FsDetView (Few-shot object Detection and Viewpoint estimation) method, under the settings of 10-shot and 30-shot, IFA-FSOD method has mean Average Precision (mAP) increased by 0.2 and 0.8 percentage points, respectively, on the new class of COCO dataset. It can be seen that Improved RPN and Feature Aggregation (IFA) can improve the detection performance of object classes in the case of few-shot effectively, and solve the problem of missing high IoU value anchor boxes and incomplete feature information capture.

Key words: Few-Shot Object Detection (FSOD), metric-based, Region Proposal Network (RPN), non-linear classifier, feature aggregation

中图分类号: