《计算机应用》唯一官方网站 ›› 2023, Vol. 43 ›› Issue (5): 1557-1564.DOI: 10.11772/j.issn.1001-9081.2022040554

• 多媒体计算与计算机仿真 • 上一篇    

基于注意力机制和上下文信息的目标检测算法

刘辉1,2, 张琳玉1,2(), 王复港1,2, 何如瑾1,2   

  1. 1.重庆邮电大学 通信与信息工程学院,重庆 400065
    2.重庆邮电大学 数智化通信新技术应用研究中心,重庆 400065
  • 收稿日期:2022-04-19 修回日期:2022-06-20 接受日期:2022-06-22 发布日期:2022-07-11 出版日期:2023-05-10
  • 通讯作者: 张琳玉
  • 作者简介:刘辉(1966—),男,四川仪陇人,高级工程师,硕士,主要研究方向:计算机视觉、通信网络新技术、电信系统业务
    张琳玉(1997—),女,河北石家庄人,硕士研究生,主要研究方向:目标检测 1075634172@qq.com
    王复港(1997—),男,山东泰安人,硕士研究生,主要研究方向:目标检测
    何如瑾(1998—),女,湖南邵阳人,硕士研究生,主要研究方向:异常行为识别。

Object detection algorithm based on attention mechanism and context information

Hui LIU1,2, Linyu ZHANG1,2(), Fugang WANG1,2, Rujin HE1,2   

  1. 1.School of Communication and Information Engineering,Chongqing University of Posts and Telecommunications,Chongqing 400065,China
    2.Digital Intelligence Communication New Technology Application Research Center,Chongqing University of Posts and Telecommunications,Chongqing 400065,China
  • Received:2022-04-19 Revised:2022-06-20 Accepted:2022-06-22 Online:2022-07-11 Published:2023-05-10
  • Contact: Linyu ZHANG
  • About author:LIU Hui, born in 1966, M. S., senior engineer. His research interests include computer vision, new technology of communication network, telecommunication system service.
    ZHANG Linyu, born in 1997, M. S. candidate. Her research interests include object detection.
    WANG Fugang, born in 1997, M. S. candidate. His research interests include object detection.
    HE Rujin, born in 1998, M. S. candidate. Her research interests include abnormal behavior detection.

摘要:

针对目标检测过程中存在的小目标漏检问题,提出一种基于注意力机制和多尺度上下文信息的改进YOLOv5目标检测算法。首先,在特征提取结构中加入多尺度空洞可分离卷积模块(MDSCM)以提取多尺度特征信息,在增大感受野的同时避免小目标信息的丢失;其次,在主干网络中添加注意力机制,并在通道信息中嵌入位置感知信息,进一步增强算法的特征表达能力;最后,使用Soft-NMS(Soft-Non-Maximum Suppression)代替YOLOv5使用的非极大值抑制(NMS),降低检测算法的漏检率。实验结果表明,改进算法在PASCAL VOC数据集、DOTA航拍数据集和DIOR光学遥感数据集上的检测精度分别达到了82.80%、71.74%和77.11%,相较于YOLOv5,分别提高了3.70、1.49和2.48个百分点;而且它对图像中小目标的检测效果更好。因此,改进的YOLOv5可以更好地应用到小目标检测场景中。

关键词: 目标检测, 深度可分离卷积, 空洞卷积, 注意力机制, 非极大值抑制

Abstract:

Aiming at the problem of small object miss detection in object detection process, an improved YOLOv5 (You Only Look Once) object detection algorithm based on attention mechanism and multi-scale context information was proposed. Firstly, Multiscale Dilated Separable Convolutional Module (MDSCM) was added to the feature extraction structure to extract multi-scale feature information, increasing the receptive field while avoiding the loss of small object information. Secondly, the attention mechanism was added to the backbone network, and the location awareness information was embedded in the channel information, so as to further enhance the feature expression ability of the algorithm. Finally, Soft-NMS (Soft-Non-Maximum Suppression) was used instead of the NMS (Non-Maximum Suppression) used by YOLOv5 to reduce the missed detection rate of the algorithm. Experimental results show that the improved algorithm achieves detection precisions of 82.80%, 71.74% and 77.11% respectively on PASCAL VOC dataset, DOTA aerial image dataset and DIOR optical remote sensing dataset, which are 3.70, 1.49 and 2.48 percentage points higer than those of YOLOv5, and it has better detection effect on small objects. Therefore, the improved YOLOv5 can be better applied to small object detection scenarios in practice.

Key words: object detection, depthwise separable convolution, dilated convolution, attention mechanism, Non-Maximum Suppression (NMS)

中图分类号: