基于多尺度和加权坐标注意力的轻量化红外道路场景检测模型

doi:10.11772/j.issn.1001-9081.2023060775

《计算机应用》唯一官方网站

• • 下一篇

基于多尺度和加权坐标注意力的轻量化红外道路场景检测模型

程小辉¹,黄云天²,张瑞芳²

1. 桂林理工大学信息科学与工程学院，广西桂林 541004
2. 桂林理工大学

收稿日期:2023-06-19 修回日期:2023-09-12 发布日期:2023-09-27 出版日期:2023-09-27
通讯作者: 黄云天
基金资助:
国家自然科学基金资助项目;广西创新驱动发展专项资金项目;广西科技计划重点研发项目

Lightweight infrared road scene detection model based on multiscale and weighted coordinate attention

Received:2023-06-19 Revised:2023-09-12 Online:2023-09-27 Published:2023-09-27

摘要/Abstract

摘要： 摘要: 针对道路场景下红外目标遮挡、缺乏纹理细节而导致目标误检、漏检的问题，提出一种基于多尺度和加权坐标注意力的轻量化红外道路场景检测模型（MSC-YOLO)。以YOLOv7-tiny作为基线模型，首先，在MobileNetv3网络的不同中间特征层引入多尺度金字塔模块（PSA)，设计一种多尺度特征提取的轻量化主干提取网络（MSM-Net），解决固定大小卷积核造成的特征污染问题，提高对于不同尺度目标的细粒度提取能力；其次，在特征融合网络融入加权坐标注意力机制（WCA)，叠加从中间特征图垂直和水平空间方向上获取到的目标位置信息，增强目标特征在不同维度上的融合能力；最后，替换定位损失函数为高效交并比（EIOU），分别计算预测框和真实框的长、宽影响因子，加速收敛速度。在Flir数据集上进行验证实验，与YOLOv7-tiny模型相比，在mAP(IOU=0.5)仅降低0.7个百分比的前提下，参数量减少67.3%，浮点运算次数减少54.6%，模型大小减少60.5%，FPS在RTA 2080Ti上达到101，在检测性能和轻量化上达到平衡，满足红外道路场景的实时检测需求。

关键词: 关键词: 红外道路场景检测, 多尺度, 加权坐标注意力, 轻量化, 定位损失函数

Abstract: Abstract: In view of occlusion and lack of texture details of infrared targets in road scenes, which lead to false detection and missed detection, a lightweight infrared road scene detection model based on multi-scale and weighted coordinate attention, named MSC-YOLO, was proposed. Taking YOLOv7-tiny (You Only Look Once) as the baseline model, firstly, the Pyramid Split Attention (PSA) module was introduced in different intermediate feature layers of the MobileNetv3 network, and a lightweight backbone extraction network (Multi-scale Mobile Network, MSM-Net) for multi-scale feature extraction was designed to solve the problem of feature pollution caused by the fixed-size convolution kernel, improving the fine-grained extraction ability of targets of different scales. Secondly, the Weighted Coordinate Attention (WCA) was integrated into the feature fusion network, and the target position information obtained from the vertical and horizontal spatial directions of the intermediate feature map was superimposed to enhance the fusion ability of target features in different dimensions. Finally, the positioning loss function Efficient Intersection over Union (EIOU) was replaced to calculate the length and width influencing factors of the predicted frame and the real frame separately, accelerating the convergence speed. The verification experiment was carried out on the Flir dataset. Compared with the YOLOv7-tiny model, the number of parameters is reduced by 67.3%, the number of floating-point operations is reduced by 54.6%, and the model size is reduced by 60.5% under the premise that mean Average Precision(IOU=0.5) (mAP(IOU=0.5)) is only reduced by 0.7 percentage. The Frames Per Second (FPS) reaches 101 on the RTA 2080Ti, which achieves a balance between detection performance and light weight, and meets the real-time detection requirements of infrared road scenes.

Key words: Keywords: infrared road scene detection, multi scale, weighted coordinate attention(WCA), lightweight, positioning loss function

中图分类号:

TP391.41

程小辉黄云天张瑞芳. 基于多尺度和加权坐标注意力的轻量化红外道路场景检测模型[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2023060775.

[1]	付顺旺, 陈茜, 李智, 王国美, 卢妤. 用于篡改图像检测和定位的双通道渐进式特征过滤网络[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1303-1309.
[2]	吴宁, 罗杨洋, 许华杰. 基于多尺度特征融合的遥感图像语义分割方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 737-744.
[3]	蒋占军, 吴佰靖, 马龙, 廉敬. 多尺度特征和极化自注意力的Faster-RCNN水漂垃圾识别[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 938-944.
[4]	黄子杰, 欧阳, 江德港, 郭彩玲, 李柏林. 面向牵引座焊缝表面质量检测的轻量型深度学习算法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 983-988.
[5]	张卓, 陈花竹. 基于一致性和多样性的多尺度自表示学习的深度子空间聚类[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 353-359.
[6]	张成涵宇, 林钰哲, 谭程珂, 王俊帆, 顾烨婷, 董哲康, 高明煜. 基于轻量化YOLOv5的新型菜品识别网络[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 638-644.
[7]	王朱佳, 余宙, 俞俊, 范建平. 基于多尺度时空Transformer的视频动态场景图生成模型[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 47-57.
[8]	杨昊, 张轶. 基于上下文信息和多尺度融合重要性感知的特征金字塔网络算法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2727-2734.
[9]	王宏, 钱清, 王欢, 龙永. 融合大核注意力卷积的轻量化图像篡改定位算法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2692-2699.
[10]	段升位, 程欣宇, 王浩舟, 王飞. 基于改进的YOLOv5的大坝表面病害检测算法[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2619-2629.
[11]	齐爱玲, 王宣淋. 基于中层细微特征提取与多尺度特征融合细粒度图像识别[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2556-2563.
[12]	郑帅, 张晓龙, 邓鹤, 任宏伟. 基于多尺度特征融合和网格注意力机制的三维肝脏影像分割方法[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2303-2310.
[13]	詹春兰, 王安志, 王明辉. 基于通道注意力和边缘融合的伪装目标分割方法[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2166-2172.
[14]	朱周华, 齐琦. 基于改进YOLOv5s电动车头盔的自动检测与识别[J]. 《计算机应用》唯一官方网站, 2023, 43(4): 1291-1296.
[15]	郝巨鸣, 杨景玉, 韩淑梅, 王阳萍. 引入Ghost模块和ECA的YOLOv4公路路面裂缝检测方法[J]. 《计算机应用》唯一官方网站, 2023, 43(4): 1284-1290.

基于多尺度和加权坐标注意力的轻量化红外道路场景检测模型

Lightweight infrared road scene detection model based on multiscale and weighted coordinate attention

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics