Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (1): 233-241.DOI: 10.11772/j.issn.1001-9081.2025020142

• Multimedia computing and computer simulation • Previous Articles     Next Articles

Agent prototype distillation algorithm for few-shot object detection

Binhong XIE, Rui WANG(), Rui ZHANG, Yingjun ZHANG   

  1. College of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan Shanxi 030024,China
  • Received:2025-02-14 Revised:2025-03-18 Accepted:2025-03-24 Online:2026-01-10 Published:2026-01-10
  • Contact: Rui WANG
  • About author:XIE Binhong, born in 1971, M. S., professor. His research interests include intelligent software engineering, machine learning.
    ZHANG Rui, born in 1987, Ph. D., professor. His research interests include intelligent information processing.
    ZHANG Yingjun, born in 1969, M. S., professor-level senior engineer. His research interests include intelligent perception and decision-making.
  • Supported by:
    Fundamental Research Program of Shanxi Province(20210302123216);Key Research and Development Program for the Introduction of High-Level Scientific and Technological Talents in Lvliang City(2022RC08);Shanxi Province Industry-Education Integration Postgraduate Joint Training Demonstration Base Project(2022JD11)

代理原型蒸馏的小样本目标检测算法

谢斌红, 王瑞(), 张睿, 张英俊   

  1. 太原科技大学 计算机科学与技术学院,太原 030024
  • 通讯作者: 王瑞
  • 作者简介:谢斌红(1971—),男,山西太原人,教授,硕士,CCF会员,主要研究方向:智能化软件工程、机器学习
    张睿(1987—),男,山西太原人,教授,博士,主要研究方向:智能信息处理
    张英俊(1969—),男,山西太原人,教授级高级工程师,硕士,主要研究方向:智能感知决策。
  • 基金资助:
    山西省基础研究计划项目(20210302123216);吕梁市引进高层次科技人才重点研发项目(2022RC08);山西省产教融合研究生联合培养示范基地项目(2022JD11)

Abstract:

The existing Few-Shot Object Detection (FSOD) algorithms are constrained by insufficient class-level prototype generation accuracy and loss of detail information, which limit the feature representation capability in target regions. To address this issue, an Agent Prototype Aggregation (APA) based FSOD algorithm named APA-FSOD was proposed. In the algorithm, the support features were distilled into detail-rich prototypes through agent attention, and the prototype vectors were assigned to the query feature map accurately based on their correlations, thereby enhancing the feature representation capability in target instance regions significantly. Additionally, a Wavelet Convolution Enhancement Module (WCEM) and an Adaptive Multi-Relation Fusion (AMRF) module were designed to optimize global feature extraction and advanced feature fusion of the algorithm, respectively. Experimental results demonstrate that on three novel class splits of the PASCAL VOC dataset, the nAP50 of APA-FSOD is improved by 0.5 to 1.1 percentage points compared to that of the baseline method VFA (Variational Feature Aggregation); under the 30-shot setting of the MS COCO dataset, nAP of APA-FSOD is increased by 1.0 percentage point compared to that of the meta-learning method SMPCCNet (Support-query Mutual Promotion and Classification Correction Network). It can be seen that the proposed algorithm achieves significant accuracy improvement in FSOD.

Key words: Few-Shot Object Detection (FSOD), meta-learning, agent prototype distillation, wavelet convolution, multi-relation fusion

摘要:

针对现有小样本目标检测(FSOD)算法中类级原型生成精度不足以及细节信息缺失导致的目标区域特征表达能力受限的问题,提出一种基于代理原型聚合(APA)的FSOD算法APA-FSOD。该算法通过代理注意力将支持特征蒸馏为细节丰富的原型,并基于原型向量的相关性实现原型向量在查询特征图上的精准分配,从而显著强化目标实例区域的特征表达能力。此外,设计小波卷积增强模块(WCEM)和自适应多关系融合模块(AMRF),并分别用于优化算法的全局特征提取和高级特征融合。实验结果表明,在PASCAL VOC数据集的3种新类划分下, APA-FSOD的nAP50相较于基线方法VFA (Variational Feature Aggregation)提升了0.5~1.1个百分点;而在MS COCO数据集的30-shot设置下,与元学习方法SMPCCNet (Support-query Mutual Promotion and Classification Correction Network)相比, nAP提升了1.0个百分点。可见,所提算法显著提高了FSOD的精度。

关键词: 小样本目标检测, 元学习, 代理原型蒸馏, 小波卷积, 多关系融合

CLC Number: