《计算机应用》唯一官方网站

• •    下一篇

基于改进区域生成网络和特征聚合 小样本目标检测方法

付可意1,王高才1,邬满2   

  1. 1. 广西大学
    2. 广西科学院
  • 收稿日期:2023-12-13 修回日期:2024-02-14 发布日期:2024-03-21 出版日期:2024-03-21
  • 通讯作者: 邬满
  • 基金资助:
    国家重点研发计划重点专项(北部湾陆海接力智慧渔场养殖装备与新模式,2022YFD2401200);广西科技重大专项(空天地一体协同重大灾害应急智慧服务平台研发与应用示范,桂科AA22068072);国家自然科学基金(62062007)

Few-shot object detection method based on improved region proposal network and feature aggregation

  • Received:2023-12-13 Revised:2024-02-14 Online:2024-03-21 Published:2024-03-21
  • Supported by:
    The National Key Research and Development Program (Beibu Gulf Land-Sea Relay Smart Fish Farm Breeding Equipment and New Model, 2022YFD2401200);The Guangxi Science and Technology Major Project (Aerospace-Ground Integrated Collaborative Major Disaster Emergency Smart Service Platform R&D and Application Demonstration, Guike AA22068072);The National Natural Science Foundation of China

摘要: 在现有的小样本目标检测中,区域生成网络(RPN)通常是在基类数据上进行训练以生成新类候选框,但新类数据相对基类而言较为稀缺,在引入时可能产生与目标物不同的复杂背景,会导致RPN将背景误认为前景,遗漏高交并比(IoU)值候选框。针对上述问题,提出一种基于改进RPN和特征聚合小样本目标检测(IFA-FSOD)方法。首先,基于RPN进行改进,通过在RPN中设计一个基于度量的非线性分类器,计算骨干网络提取的特征和新类之间的相似度,提高对新类候选框的召回率,从而筛选出高IoU候选框。其次,在感兴趣区域对齐(ROI Align)中引入基于注意力机制的特征聚合模块,通过设计不同尺度的网格,获取更全面的信息和特征表示,缓解因尺度不同引起的特征信息缺失。实验结果表明,相比QA-FewDet方法,IFA-FSOD在PASCAL-VOC数据集的新类上,Novel Set 3 10shot下nAP50提升了4.5个百分点;相比FsDetView方法,在10shot和30shot设置下,IFA-FSOD在COCO数据集的新类上mAP分别提升了0.2和0.8个百分点。IFA能有效提高在小样本情况下对目标类别的检测性能,解决高IoU值候选框遗漏和特征信息捕捉不全的问题。

关键词: 小样本目标检测, 基于度量, 区域生成网络, 非线性分类器, 特征聚合

Abstract: Abstract: In existing few-shot object detection, the Region Proposal Network (RPN) is usually trained on base class data to generate new class candidate boxes. However, new class data is relatively scarce compared to the base class. Introducing new classes may lead to the presence of different complex backgrounds, causing the RPN to misclassify background as foreground, resulting in the omission of high Intersection over Union (IoU) value candidate boxes. To address the above issues, a few-shot object detection method that based on improved region proposal network and feature aggregation was proposed, namely IFA-FSOD (Few-Shot Object Detection Based on Improved RPN and Feature Aggregation) . Firstly, an improvement was made based on the RPN by incorporating a metric-based non-linear classifier. This classifier was designed within the RPN to compute the similarity between features extracted by the backbone network and the features representing the new class. This enhancement aims to increase the recall rate for candidate boxes of the new class, ultimately filtering out candidate boxes with IoU. Then, a feature aggregation module based on the attention mechanism was introduced in the region of interest alignment (ROI Align). By designing grids of different scales, more comprehensive information and feature representation were obtained, and alleviate the lack of feature information caused by different scales. Experimental results show that compared with the QA-FewDet method, IFA-FSOD improves nAP50 by 4.5percentage points under Novel Set 3 10shot on the new class of PASCAL-VOC dataset; compared with the FsDetView method, under the settings of 10 shot and 30 shot, IFA-FSOD's mAP increased by 0.2 and 0.8 percentage points respectively on the new class of the COCO dataset. IFA can effectively improve the detection performance of target categories in the case of few-shot, and solve the problem of missing candidate boxes with high IoU values and incomplete feature information capture.

Key words: few-shot object detection, metric-based, Region Proposal Network(RPN), nonlinear classifier, feature aggregation

中图分类号: