复杂场景下跨层多尺度特征融合的安全帽佩戴检测算法

doi:10.11772/j.issn.1001-9081.2024070999

《计算机应用》唯一官方网站 ›› 2025, Vol. 45 ›› Issue (7): 2333-2341.DOI: 10.11772/j.issn.1001-9081.2024070999

• 多媒体计算与计算机仿真 • 上一篇下一篇

复杂场景下跨层多尺度特征融合的安全帽佩戴检测算法

陈亮¹^,²(), 王璇¹, 雷坤¹

^1.西安工程大学计算机科学学院，西安 710600
^2.陕西省服装设计智能化重点实验室（西安工程大学），西安 710600

收稿日期:2024-07-17 修回日期:2024-09-26 接受日期:2024-10-09 发布日期:2025-07-10 出版日期:2025-07-10
通讯作者: 陈亮
作者简介:王璇（2000—），女，陕西渭南人，硕士研究生，主要研究方向：目标检测、图像处理
雷坤（2001—），男，陕西西安人，硕士研究生，主要研究方向：人工智能、数据分析与可视化。
基金资助:
陕西省教育厅重点科学研究计划项目(22JS021)

Helmet wearing detection algorithm for complex scenarios based on cross-layer multi-scale feature fusion

Liang CHEN¹^,²(), Xuan WANG¹, Kun LEI¹

^1.School of Computer Science，Xi’an Polytechnic University，Xi’an Shaanxi 710600，China
^2.Shaanxi Key Laboratory of Clothing Intelligence （Xi’an Polytechnic University），Xi’an Shaanxi 710600，China

Received:2024-07-17 Revised:2024-09-26 Accepted:2024-10-09 Online:2025-07-10 Published:2025-07-10
Contact: Liang CHEN
About author:WANG Xuan， born in 2000， M. S. candidate. Her research interests include object detection， image processing.
LEI Kun， born in 2001， M. S. candidate. His research interests include artificial intelligence， data analysis and visualization.
Supported by:
Key Scientific Research and Development Program of Education Department of Shaanxi Province(22JS021)

摘要/Abstract

摘要：

为了解决施工场景下安全帽佩戴检测时，由于人员密集、遮挡和复杂背景等原因造成的小目标漏检和错检的问题，提出一种基于YOLOv8n的双重注意力机制的跨层多尺度安全帽佩戴检测算法。首先，设计微小目标检测头，以提高模型对小目标的检测能力；其次，在特征提取网络中嵌入双重注意力机制，从而更加关注复杂场景下目标信息的特征捕获；然后，将特征融合网络替换成重参数化泛化特征金字塔网络（RepGFPN）改进后的跨层多尺度特征融合结构S-GFPN （Selective layer Generalized Feature Pyramid Network），以实现小目标特征层信息和其他特征层的多尺度融合，并建立长期的依赖关系，从而抑制背景信息的干扰；最后，采用MPDIOU（Intersection Over Union with Minimum Point Distance）损失函数来解决尺度变化不敏感的问题。在公开数据集GDUT-HWD上的实验结果表明，改进后的模型比YOLOv8n的mAP@0.5提升了3.4个百分点，对蓝色、黄色、白色和红色安全帽的检测精度分别提升了2.0、1.1、4.6和9.1个百分点，在密集、遮挡、小目标、反光和黑暗这5类复杂场景下的可视化检测效果也优于YOLOv8n，为实际施工场景中安全帽佩戴检测提供了一种有效方法。

关键词: 复杂场景, 目标检测, 小目标, 多尺度特征融合, YOLOv8

Abstract:

To address the issue of missed and false detections of small objects of helmet wearing detection in construction scenarios， caused by reasons such as crowding， occlusion， and complex backgrounds， a cross-layer multi-scale helmet wearing detection algorithm with double attention mechanism based on YOLOv8n was proposed. Firstly， a small object detection head was designed to enhance the model’s ability to detect small objects. Secondly， the double attention mechanism was embedded in the feature extraction network to focus more on capturing object features in complex scenarios. Thirdly， the feature fusion network was replaced with the cross-layer multi-scale feature fusion structure S-GFPN （Selective layer Generalized Feature Pyramid Network）， which was improved with Re-parameterized Generalized Feature Pyramid Network （RepGFPN）， so as to enable multi-scale fusion of small object feature layer with other layers and establish long-term dependencies， thus reducing background information interference. Finally， the MPDIOU （Intersection Over Union with Minimum Point Distance） loss function was employed to address non-sensitivity issues related to scale changes. Experimental results on the public dataset GDUT-HWD show that compared to the YOLOv8n， the improved model increases the mAP@0.5 by 3.4 percentage points， and improves the detection accuracy for blue， yellow， white， and red helmets by 2.0， 1.1， 4.6， and 9.1 percentage points， respectively. The model also outperforms the YOLOv8n in five complex scenarios： density， occlusion， small objects， light reflection， and darkness， and provides an effective method for helmet wearing detection in real-world construction scenarios.

Key words: complex scenario, object detection, small object, multi-scale feature fusion, YOLOv8 (You Only Look Once v8)

中图分类号:

TP391.4

陈亮, 王璇, 雷坤. 复杂场景下跨层多尺度特征融合的安全帽佩戴检测算法[J]. 计算机应用, 2025, 45(7): 2333-2341.

Liang CHEN, Xuan WANG, Kun LEI. Helmet wearing detection algorithm for complex scenarios based on cross-layer multi-scale feature fusion[J]. Journal of Computer Applications, 2025, 45(7): 2333-2341.

图/表 12

图1 YOLOv8n算法的网络结构

Fig. 1 Network structure of YOLOv8n algorithm

图2 SDS-YOLOv8算法的网络结构

Fig. 2 Network structure of SDS-YOLOv8 algorithm

图3 微小目标检测头

Fig. 3 Tiny object detection head

图4 Double Attention机制的计算过程

Fig. 4 Calculation process of Double Attention mechanism

图5 跨层多尺度特征融合结构

Fig. 5 Selective layer generalized feature pyramid network

图6 YOLOv8n算法的训练结果

Fig. 6 Training results of YOLOv8n algorithm

图7 SDS-YOLOv8算法的训练结果

Fig. 7 Training results of SDS-YOLOv8 algorithm

图8 PR曲线对比

Fig. 8 Comparison of PR curves

表1 不同模型在GDUT-HWD数据集上的性能

Tab. 1 Performance of different models on GDUT-HWD dataset

模型	P/%	R/%	mAP@0.5/%
YOLOv3-tiny	88.15	69.4	74.6
YOLOv4	88.06	71.6	78.2
YOLOv5s	89.10	77.5	82.9
YOLOv7	88.92	75.8	83.7
YOLOv8n	90.02	75.8	82.0
YOLOv8s	90.05	75.2	82.3
YOLOv10	85.28	69.4	77.4
SDS-YOLOv8	89.91	78.4	85.4

表2 SDS-YOLOv8算法在GDUT-HWD数据集上的消融实验结果

Tab. 2 Ablation experimental results of SDS-YOLOv8 algorithm on GDUT-HWD dataset

实验	模块				AP/%					mAP@0.5/%	浮点运算量/GFLOPs
实验	A1	A2	A3	A4	蓝色	黄色	白色	红色	未佩戴	mAP@0.5/%	浮点运算量/GFLOPs
对照组					89.3	91.0	76.7	76.4	76.8	82.0	8.1
实验1	√				90.5	91.8	77.9	82.3	77.9	84.1	12.2
实验2		√			90.7	91.7	74.8	77.0	77.4	82.3	8.1
实验3			√		90.5	91.4	74.7	78.6	76.6	82.4	8.4
实验4				√	89.0	90.8	76.5	77.8	76.8	82.2	8.4
实验5	√	√			89.9	91.1	81.5	82.9	76.8	84.4	12.2
实验6	√	√	√		90.8	92.2	80.1	85.4	75.7	84.8	12.5
实验7	√	√	√	√	91.3	92.1	81.3	85.5	76.6	85.4	12.5

图9 5类复杂场景下的可视化检测结果

Fig. 9 Visualized detection results in five types of complex scenarios

图10 可视化检测结果不佳的例子

Fig. 10 Examples of poor visual detection results

参考文献 25

[1]	DALAL N， TRIGGS B. Histograms of oriented gradients for human detection ［C］// Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition — Volume 1. Piscataway： IEEE， 2005： 886-893.
[2]	LOWE D G. Distinctive image features from scale-invariant keypoints ［J］. International Journal of Computer Vision， 2004， 60（2）： 91-110.
[3]	JIN M， LU B， ZHANG J， et al. Video streaming helmet detection algorithm based on feature map fusion and faster RCNN ［C］// Proceedings of the 2021 International Conference on Electronic Information Engineering and Computer Science. Piscataway： IEEE， 2021： 470-474.
[4]	REN S， HE K， GIRSHICK R， et al. Faster R-CNN： towards real-time object detection with region proposal networks ［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2017， 39（6）： 1137-1149.
[5]	NANDHINI C， BRINDHA M. Transfer learning based SSD model for helmet and multiple rider detection ［J］. International Journal of Information Technology， 2023， 15（2）： 565-576.
[6]	LIU W， ANGUELOV D， ERHAN D， et al. SSD： single shot multiBox detector ［C］// Proceedings of the 2016 European Conference on Computer Vision， LNCS 9905. Cham： Springer， 2016： 21-37.
[7]	YANG B， WANG J. An improved helmet detection algorithm based on YOLO v4 ［J］. International Journal of Foundations of Computer Science， 2022， 33（06n07）： 887-902.
[8]	王新良，王璐莹.特征增强的低照度爆破现场安全帽检测算法［J/OL］.计算机工程［2024-06-10］..
	WANG X L， WANG L Y. Safety helmet detection algorithm with feature enhancement in low light blasting scenes ［J/OL］. Computer Engineering［2024-06-10］..
[9]	杜晓刚，王玉琪，晏润冰，等.基于YOLO-ST的安全帽佩戴精确检测算法［J］.陕西科技大学学报，2022， 40（6）： 177-183， 191.
	DU X G， WANG Y Q， YAN R B， et al. Accurate helmet wearing detection algorithm based on YOLO-ST ［J］. Journal of Shaanxi University of Science and Technology， 2022， 40（6）： 177-183， 191.
[10]	LIU Z， LIN Y， CAO Y， et al. Swin Transformer： hierarchical Vision Transformer using shifted windows ［C］// Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision. Piscataway： IEEE， 2021： 9992-10002.
[11]	邓珍荣，熊宇旭，杨睿，等.面向小目标的改进YOLOv5安全帽佩戴检测算法［J］.计算机工程与应用，2024， 60（3）： 78-87.
	DENG Z R， XIONG Y X， YANG R， et al. Improved YOLOv5 helmet wearing detection algorithm for small targets ［J］. Computer Engineering and Applications， 2024， 60（3）： 78-87.
[12]	谢国波，肖峰，林志毅，等.复杂作业场景下的反光衣和安全帽检测方法［J］.安全与环境学报，2024， 24（9）： 3513-3521.
	XIE G B， XIAO F， LIN Z Y， et al. Method for detecting reflective vests and safety helmets in complex operational environments ［J］. Journal of Safety and Environment， 2024， 24（9）： 3513-3521.
[13]	WU J， CAI N， CHEN W， et al. Automatic detection of hardhats worn by construction personnel： a deep learning approach and benchmark dataset ［J］. Automation in Construction， 2019， 106： No.102894.
[14]	CHEN Y， KALANTIDIS Y， LI J， et al. A ²-Nets： double attention networks ［C］// Proceedings of the 32nd International Conference on Neural Information Processing Systems. Red Hook： Curran Associates Inc.， 2018： 350-359.
[15]	MA S， XU Y. MPDIoU： a loss for efficient and accurate bounding box regression ［EB/OL］. ［2024-07-14］. .
[16]	LIN T Y， DOLLÁR P， GIRSHICK R， et al. Feature pyramid networks for object detection ［C］// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2017： 936-944.
[17]	LIU S， QI L， QIN H， et al. Path aggregation network for instance segmentation ［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018： 8759-8768.
[18]	GE Z， LIU S， WANG F， et al. YOLOX： exceeding yolo series in 2021 ［EB/OL］. ［2024-08-06］. .
[19]	XU X， JIANG Y， CHEN W， et al. DAMO-YOLO： a report on real-time object detection design ［EB/OL］. ［2023-04-24］. .
[20]	ZHENG Z， WANG P， LIU W， et al. Distance-IoU loss： faster and better learning for bounding box regression ［C］// Proceedings of the 34th AAAI Conference on Artificial Intelligence. Palo Alto： AAAI Press， 2020： 12993-13000.
[21]	ADARSH P， RATHI P， KUMAR M. YOLO v3-Tiny： object Detection and Recognition using one stage improved model ［C］// Proceedings of the 6th International Conference on Advanced Computing and Communication Systems. Piscataway： IEEE， 2020： 687-694.
[22]	BOCHKOVSKIY A， WANG C Y， LIAO H Y M. YOLOv4： optimal speed and accuracy of object detection ［EB/OL］. ［2024-04-23］. .
[23]	Ultralytics. YOLOv5 ［EB/OL］. ［2024-05-22］. .
[24]	WANG C Y， BOCHKOVSKIY A， LIAO H Y M. YOLOv7： trainable bag-of-freebies sets new state-of-the-art for real-time object detectors ［C］// Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2023： 7464-7475.
[25]	WANG A， CHEN H， LIU L， et al. YOLOv10： real-time end-to-end object detection ［EB/OL］. ［2024-05-23］. .

复杂场景下跨层多尺度特征融合的安全帽佩戴检测算法

Helmet wearing detection algorithm for complex scenarios based on cross-layer multi-scale feature fusion

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 12

参考文献 25

相关文章 15

编辑推荐

Metrics

[1]	颜承志, 陈颖, 钟凯, 高寒. 基于多尺度网络与轴向注意力的3D目标检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2537-2545.
[2]	廖炎华, 鄢元霞, 潘文林. 基于YOLOv9的交通路口图像的多目标检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2555-2565.
[3]	谢斌红, 剌颖坤, 张英俊, 张睿. 自步学习指导下的半监督目标检测框架[J]. 《计算机应用》唯一官方网站, 2025, 45(8): 2546-2554.
[4]	张子墨, 赵雪专. 多尺度稀疏图引导的视觉图神经网络[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2188-2194.
[5]	于平平, 闫玉婷, 唐心亮, 苏鹤, 王建超. 输电线路场景下的施工机械多目标跟踪算法[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2351-2360.
[6]	范博淦, 王淑青, 陈开元. 基于改进YOLOv8的航拍无人机小目标检测模型[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2342-2350.
[7]	张英俊, 闫薇薇, 谢斌红, 张睿, 陆望东. 梯度区分与特征范数驱动的开放世界目标检测[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2203-2210.
[8]	蒋沛宇, 王永光, 任亚亭, 李硕晨, 谭火彬. 基于测量不确定度表示指南的红外目标检测不确定度测量方案[J]. 《计算机应用》唯一官方网站, 2025, 45(7): 2162-2168.
[9]	周得辉, 赵军, 程进峰. 基于RT-DETR的轴承表面微小缺陷检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(6): 1987-1997.
[10]	王向, 崔倩倩, 张晓明, 王建超, 王震洲, 宋佳霖. 改进ConvNeXt的无线胶囊内镜图像分类模型[J]. 《计算机应用》唯一官方网站, 2025, 45(6): 2016-2024.
[11]	赵轻轻, 胡滨. 不变性全局稀疏轮廓点表征的运动行人检测神经网络[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1271-1284.
[12]	郭诗月, 党建武, 王阳萍, 雍玖. 结合注意力机制和多尺度特征融合的三维手部姿态估计[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1293-1299.
[13]	张李伟, 梁泉, 胡禹涛, 朱乔乐. 基于分组卷积的通道重洗注意力机制[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1069-1076.
[14]	侯阳, 张琼, 赵紫煊, 朱正宇, 张晓博. 基于YOLOv5s的复杂场景下高效烟火检测算法YOLOv5s-MRD[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1317-1324.
[15]	张传浩, 屠晓涵, 谷学汇, 轩波. 基于多模态信息相互引导补充的雷达-相机三维目标检测[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 946-952.