基于改进DETR算法的小目标检测方法

doi:10.11772/j.issn.1001-9081.2025030277

《计算机应用》唯一官方网站

• • 下一篇

基于改进DETR算法的小目标检测方法

吴俊,赵川

成都理工大学计算机与网络安全学院

收稿日期:2025-03-18 修回日期:2025-05-08 发布日期:2025-05-16 出版日期:2025-05-16
通讯作者: 赵川
作者简介:吴俊(2000—)，男，四川资阳人，硕士研究生，主要研究方向：计算机视觉；赵川(1967—)，女，四川成都人，副教授，博士，主要研究方向：计算机视觉、自然语言处理。
基金资助:
四川省科技创新项目（24PYXM1008）。

Small object detection method based on improved DETR algorithm

WU Jun, ZHAO Chuan

College of Computer and Network Security, Chengdu University of Technology

Received:2025-03-18 Revised:2025-05-08 Online:2025-05-16 Published:2025-05-16
About author:WU Jun, born in 2000, M. S. candidate. His research interests include computer vision. ZHAO Chuan, born in 1967, Ph. D., associate professor. Her research interests include computer vison, natural language processing.
Supported by:
Sichuan Provincial Science and Technology Innovation Project (24PYXM1008)

摘要/Abstract

摘要： 针对DETR（DEtection Transformer）在小目标检测方面精度较低的问题，提出了一种基于改进DETR算法的小目标检测方法。首先，针对骨干网络ResNet-50在小目标特征提取方面提取能力弱、效率低、易丢失细节等问题，提出了一种结合多尺度注意力机制的改进MetaFormer作为DETR的骨干网络，增强模型对小目标的表征能力。其次，针对Transformer注意力模块在处理图像特征映射时存在收敛慢、特征空间分辨率受限等问题，引入了可变形注意力解码器，使模型能够聚焦于参考点周围的关键采样区域，从而加快模型收敛并提升小目标检测精度。最后，针对GIoU损失函数无法衡量预测框质量的问题，引入了WIoU(Wise-IoU) v3损失函数，为不同质量的预测框赋予差异化的梯度增益，引导模型收敛到更高的精度。在COCO2017目标检测数据集上的实验结果表明，相较于DETR，所提方法对小目标的平均检测精度提升了7.6个百分点，整体的平均检测精度提升了4.7个百分点，表明所提方法具有更高的检测精度。

关键词: DETR, 小目标, 可变形注意力, 多尺度注意力, WIoU v3

Abstract: To address the problem of low accuracy of DETR(DEtection Transformer) in small object detection, an improved DETR for small object detection was proposed. Firstly, an improved MetaFormer combined with a multi-scale attention mechanism was adopted as the backbone network, aiming to solve the problems of weak extraction ability, low efficiency, and detail loss in small object feature extraction of ResNet-50, thereby enhancing the representation capability for small objects. Secondly, a deformable attention decoder was introduced to address the problems of slow convergence and limited feature space resolution in the Transformer attention module when processing image feature maps. This enabled the model to focus on key sampling regions around reference points, accelerating convergence and improving detection accuracy for small objects. Finally, the Wise-IoU (WIoU) v3 loss function was incorporated to overcome the limitation of the GIoU loss function in evaluating prediction box quality. Differentiated gradient gains were assigned to prediction boxes of varying quality, guiding the model to converge towards higher accuracy. Experimental results on the COCO2017 object detection dataset showed that, compared with DETR, the proposed method improved the average precision for small objects by 7.6 percentage points and the overall average precision by 4.7 percentage points, demonstrating superior detection performance.

Key words: DEtection Transformer (DETR), small object, deformable attention, multi-scale attention, WIoU (Wise-IoU) v3

中图分类号:

TP391

吴俊赵川. 基于改进DETR算法的小目标检测方法[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2025030277.

WU Jun, ZHAO Chuan. Small object detection method based on improved DETR algorithm[J]. Journal of Computer Applications, DOI: 10.11772/j.issn.1001-9081.2025030277.

[1]	刘皓宇, 孔鹏伟, 王耀力, 常青. 基于多视角信息的行人检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2325-2332.
[2]	范博淦, 王淑青, 陈开元. 基于改进YOLOv8的航拍无人机小目标检测模型[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2342-2350.
[3]	陈亮, 王璇, 雷坤. 复杂场景下跨层多尺度特征融合的安全帽佩戴检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2333-2341.
[4]	周得辉, 赵军, 程进峰. 基于RT-DETR的轴承表面微小缺陷检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(6): 1987-1997.
[5]	余松森, 林智凡, 薛国鹏, 徐建宇. 基于改进YOLOv8的轻量级大幅面瓷砖缺陷检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 647-654.
[6]	何秋润, 胡节, 彭博, 李天源. 基于上下文信息的多尺度特征融合织物疵点检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 640-646.
[7]	边小勇, 胡其仁. 多注意力对比学习的红外小目标检测[J]. 《计算机应用》唯一官方网站, 2025, 45(11): 3707-3712.
[8]	杨博然, 蔺素珍, 李大威, 禄晓飞, 崔晨辉. 基于信息补偿的红外弱小目标检测方法[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 284-291.
[9]	刘赏, 周煜炜, 代娆, 董林芳, 刘猛. 融合注意力和上下文信息的遥感图像小目标检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 292-300.
[10]	潘烨新, 杨哲. 基于多级特征双向融合的小目标检测优化模型[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2871-2877.
[11]	李烨恒, 罗光圣, 苏前敏. 基于改进YOLOv5的Logo检测算法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2580-2587.
[12]	龙伍丹, 彭博, 胡节, 申颖, 丁丹妮. 基于加强特征提取的道路病害检测算法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2264-2270.
[13]	姬张建, 杜娜. 基于改进VariFocalNet的微小目标检测[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2200-2207.
[14]	刘越, 刘芳, 武奥运, 柴秋月, 王天笑. 基于自注意力机制与图卷积的3D目标检测网络[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1972-1977.
[15]	崔晨辉, 蔺素珍, 李大威, 禄晓飞, 武杰. 基于孪生网络和Transformer的红外弱小目标跟踪方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 563-571.

基于改进DETR算法的小目标检测方法

Small object detection method based on improved DETR algorithm

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics