Few-shot object detection algorithm based on new class feature enhancement and metric mechanism

doi:10.11772/j.issn.1001-9081.2024081146

Journal of Computer Applications

Received:2024-08-14 Revised:2024-09-28 Online:2024-11-07 Published:2024-11-07
Supported by:
National Natural Science Foundation of China

结合新类特征增强与度量机制的小样本目标检测算法

张嘉祥¹,李晓明²,张佳慧²

1. 太原科技大学
2. 山西省太原市万柏林区太原科技大学计算机科学与技术学院

通讯作者: 李晓明
基金资助:
国家自然科学基金项目

Abstract

Abstract: Abstract: To address the issue of low sensitivity of existing few-shot object detection models to the feature parameters of new categories, which leads to unclear feature boundaries and category confusion, we propose a few-shot object detection algorithm based on new category feature enhancement and metric mechanisms. Firstly, a Cross-Domain Parameter Perception Module (CDPM) is proposed to improve the neck network, reconstructing the re-weighting operation of channel and spatial features. It combines dilated convolution with cross-stage information transfer and feature fusion to provide rich gradient information guidance and enhance the sensitivity of new category parameters. Meanwhile, a Integrated Correlated Multi-Feature Module (ICMF) is constructed before ROI Pooling to establish the correlation between features and dynamically optimize the fusion method of relevant features, enhancing significant features. The introduction of CDPM and ICMF effectively enhances the feature representation of new categories to alleviate feature boundary ambiguity. Additionally, to further reduce category confusion, an orthogonal loss function based on metric mechanisms (CohSep Loss) is proposed in the detection head to achieve intra-class feature aggregation and inter-class feature separation by measuring feature vector similarity. On the PASCAL VOC dataset, the proposed algorithm improves the average nAP50 by 5.3% compared to the baseline TFA. On the COCO dataset, the nAP for 10-shot and 30-shot settings is improved by 3.6% and 6.2%, respectively. Experimental results demonstrate that the proposed model achieves higher accuracy in few-shot object detection.

Key words: Keywords: transfer learning, few-shot object detection, feature enhancement, category confusion, orthogonal loss

摘要： 摘要: 针对现有小样本目标检测算法中模型对新类别的特征参数敏感度较低，难以准确区分类别相关和类别无关参数，导致特征边界模糊，造成类别混淆的问题，提出了一种结合新类特征增强与度量机制的小样本目标检测算法。首先，提出跨域参数感知模块(CDPM)改进颈部网络，重构通道和空间的特征重加权操作，结合空洞卷积采用跨阶段的信息传递与特征融合方式，提供丰富的梯度信息导向并提升新类别参数敏感性。同时，在ROI Pooling前构造多元相关特征融合模块(ICMF)，建立特征之间的相关性并动态优化相关特征的融合方式，增强显著特征。CDPM与ICMF的引入有效增强了新类别的特征表示以减轻特征边界模糊。此外，为进一步减轻类别混淆，在检测头部分提出基于度量机制的正交损失函数(CohSep Loss)，通过度量特征向量相似度，实现类内特征聚合，类间特征分离。在PASCAL VOC数据集上，本算法相比基线TFA在nAP50的平均值上提升了5.3%；在COCO数据集中，10shot和30shot对应的nAP分别提升了3.6%和6.2%。实验结果表明，所提出的模型实现了更高精度的小样本目标检测。

关键词: 关键词: 迁移学习, 小样本目标检测, 特征增强, 类别混淆, 正交损失

CLC Number:

TP391.4

张嘉祥李晓明张佳慧. 结合新类特征增强与度量机制的小样本目标检测算法[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2024081146.

[1]	Weichao DANG, Yinghao FAN, Gaimei GAO, Chunxia LIU. Weakly supervised action localization based on temporal and global contextual feature enhancement [J]. Journal of Computer Applications, 2025, 45(3): 963-971.
[2]	Benchen YANG, Haoran LI, Haibo JIN. Multi-focus image fusion network with cascade fusion and enhanced reconstruction [J]. Journal of Computer Applications, 2025, 45(2): 594-600.
[3]	Binhong XIE, Wanyin GAO, Wangdong LU, Yingjun ZHANG, Rui ZHANG. Dense object counting network with few-shot similarity matching feature enhancement [J]. Journal of Computer Applications, 2025, 45(2): 403-410.
[4]	Hongtian LI, Xinhao SHI, Weiguo PAN, Cheng XU, Bingxin XU, Jiazheng YUAN. Few-shot object detection via fusing multi-scale and attention mechanism [J]. Journal of Computer Applications, 2024, 44(5): 1437-1444.
[5]	Jie WANG, Hua MENG. Image classification algorithm based on overall topological structure of point cloud [J]. Journal of Computer Applications, 2024, 44(4): 1107-1113.
[6]	Xinye LI, Yening HOU, Yinghui KONG, Zhiqi YAN. Few-shot object detection combining feature fusion and enhanced attention [J]. Journal of Computer Applications, 2024, 44(3): 745-751.
[7]	Keyi FU, Gaocai WANG, Man WU. Few-shot object detection method based on improved region proposal network and feature aggregation [J]. Journal of Computer Applications, 2024, 44(12): 3790-3797.
[8]	Jia CHEN, Hong ZHANG. Image text retrieval method based on feature enhancement and semantic correlation matching [J]. Journal of Computer Applications, 2024, 44(1): 16-23.
[9]	Guolong YUAN, Yujin ZHANG, Yang LIU. Image tampering forensics network based on residual feedback and self-attention [J]. Journal of Computer Applications, 2023, 43(9): 2925-2931.
[10]	Bin LU, Jielin LIU. Semantic segmentation for 3D point clouds based on feature enhancement [J]. Journal of Computer Applications, 2023, 43(6): 1818-1825.
[11]	Fang LUO, Yang LIU, G. T. S HO. Multi-scale ship detection based on adaptive feature fusion in complex scenes [J]. Journal of Computer Applications, 2023, 43(11): 3587-3593.
[12]	Xiangyue TAN, Xiao HU, Jiaxin YANG, Junjiang XIANG. Camouflaged object detection based on progressive feature enhancement aggregation [J]. Journal of Computer Applications, 2022, 42(7): 2192-2200.
[13]	Runchao LIN, Rong HUANG, Aihua DONG. Few-shot object detection based on attention mechanism and secondary reweighting of meta-features [J]. Journal of Computer Applications, 2022, 42(10): 3025-3032.
[14]	Jing WEN, Qiang LI. Object tracking algorithm based on spatio-temporal context information enhancement [J]. Journal of Computer Applications, 2021, 41(12): 3565-3570.