Tunnel foreign object detection algorithm based on improved YOLOv8n

doi:10.11772/j.issn.1001-9081.2024020225

Journal of Computer Applications ›› 2025, Vol. 45 ›› Issue (2): 655-661.DOI: 10.11772/j.issn.1001-9081.2024020225

• Multimedia computing and computer simulation • Previous Articles

Tunnel foreign object detection algorithm based on improved YOLOv8n

Jiayang GUI¹, Shunji WANG¹, Zhengkang ZHOU², Jiashan TANG¹()

^1.College of Science，Nanjing University of Posts and Telecommunications，Nanjing Jiangsu 210023，China
^2.Nanjing Urban Construction Tunnel and Bridge Intelligent Management Company Limited，Nanjing Jiangsu 211800，China

Received:2024-03-04 Revised:2024-04-09 Accepted:2024-04-15 Online:2024-06-04 Published:2025-02-10
Contact: Jiashan TANG
About author:GUI Jiayang， born in 1998， M. S. candidate. Her research interests include computer vision， object detection.
WANG Shunji， born in 1996， Ph. D. candidate. His research interests include pattern recognition， structural equation model.
ZHOU Zhengkang， born in 1968， M. S.， professor. His research interests include smart city management， data science.
Supported by:
Horizontal Research Project of Nanjing University of Posts and Telecommunications(2023W221)

基于改进YOLOv8n的隧道内异物检测算法

桂佳扬¹, 王顺吉¹, 周正康², 唐加山¹()

^1.南京邮电大学理学院，南京 210023
^2.南京城建隧桥智慧管理有限公司，南京 211800

通讯作者: 唐加山
作者简介:桂佳扬（1998—），女，河南平顶山人，硕士研究生，主要研究方向：计算机视觉、目标检测
王顺吉（1996—），男，江苏宿迁人，博士研究生，主要研究方向：模式识别、结构方程模型
周正康（1968—），男，安徽池州人，教授，硕士，主要研究方向：智慧城市管理、数据科学；
基金资助:
南京邮电大学横向科研项目（2023外221）

Abstract

Abstract:

In order to address the problems of high labor costs and low efficiency in manual inspection for tunnel foreign object detection， a tunnel foreign object detection algorithm based on improved YOLOv8n was proposed. Firstly， C2f_CA module was proposed with the incorporation of Coordinate Attention （CA） mechanism. In the module， by embedding positional information into channel attention， the network’s focus on the spatial distribution of features in the image was enhanced， thereby improving feature extraction capability of the network. Secondly， inspired by the concept of high-resolution network， a new feature fusion module HRNet_Fusion （High Resolution Net） was proposed to take extracted feature maps with different resolutions as four parallel branches and input them into the network， and multiple up-sampling， down-sampling， and fusion operations were performed to obtain comprehensive and accurate feature information. The above enhanced performance in small target detection and feature fusion significantly. Finally， the WIoU （Wise-IoU） loss function was introduced to reduce the harmful gradient effects of low-quality samples on the network， further improving model detection accuracy. Experimental results on a tunnel foreign object detection dataset indicate that the improved algorithm achieves mean Average Precision （mAP@0.5） of 79.9%， with a model size of 6.0 MB. Compared to YOLOv8n， the proposed algorithm has the mAP@0.5 enhanced by 6 percentage points， while the model size decreased by 0.2 MB， and the model parameters reduced by 0.379×10⁶.

Key words: object detection, foreign object detection, YOLOv8n, Coordinate Attention (CA) mechanism, high resolution net, WIoU (Wise-IoU) loss function

摘要：

针对当前隧道内异物检测存在人工巡检成本高、效率低等问题，提出一种基于改进YOLOv8n的隧道内异物检测算法。首先，提出融入坐标注意力（CA）机制的C2f_CA模块，通过将位置信息嵌入通道注意力，增强网络对图像在空间上的特征分布的关注，从而增强网络的特征提取能力；其次，借鉴高分辨率网络的思想，提出新的特征融合模块HRNet_Fusion（High Resolution Net）将提取的不同分辨率特征图作为4个并行分支输入网络，并经过多次上、下采样和融合操作得到全面且准确的特征信息，从而显著提升在小目标检测和特征信息融合方面的性能；最后，引入WIoU（Wise-IoU）损失函数降低低质量样本对网络的不良梯度影响，进一步提高模型的检测精度。实验结果表明，在隧道异物数据集上，改进算法的平均精度均值（mAP@0.5）为79.9%，模型大小为6.0 MB，与YOLOv8n算法相比，mAP@0.5提升了6个百分点，模型大小减少了0.2 MB，模型参数量减少了0.379×10⁶。

关键词: 目标检测, 异物检测, YOLOv8n, 坐标注意力机制, 高分辨率网络, WIoU损失函数

CLC Number:

TP391.4

Jiayang GUI, Shunji WANG, Zhengkang ZHOU, Jiashan TANG. Tunnel foreign object detection algorithm based on improved YOLOv8n[J]. Journal of Computer Applications, 2025, 45(2): 655-661.

桂佳扬, 王顺吉, 周正康, 唐加山. 基于改进YOLOv8n的隧道内异物检测算法[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 655-661.

Figures/Tables 14

References 21

1	肖添文，徐永能，徐欣怡. 城市轨道交通隧道异物侵入检测与控制方法［J］. 电气技术， 2019， 20（S1）： 48-52， 56.
	XIAO T W， XU Y N， XU X Y. Methods for detecting and controlling foreign body invasion in urban rail transit tunnels［J］. Electrical Engineering， 2019， 20（S1）： 48-52， 56.
2	宋晓凤. 基于结构光测量技术的铁路隧道口异物检测方法研究［D］. 北京：北京交通大学， 2020.
	SONG X F. Study on the railway tunnel entrance obstacle detection method based on structured light measurement technology［D］. Beijing： Beijing Jiaotong University， 2020.
3	陈锴迪. 隧道线路异物检测系统研究［D］. 北京：北京交通大学， 2020.
	CHEN K D. Research on foreign body detection system in tunnel line［D］. Beijing： Beijing Jiaotong University， 2020.
4	GIRSHICK R， DONAHUE J， DARRELL T， et al. Rich feature hierarchies for accurate object detection and semantic segmentation［C］// Proceedings of the 2014 IEEE Conference on Computer Vision. Piscataway： IEEE， 2014： 580-587.
5	LIU W， ANGUELOV D， ERHAN D， et al. SSD： single shot multibox detector［C］// Proceedings of the 2016 European Conference on Computer Vision， LNCS 9905. Cham： Springer， 2016： 21-37.
6	REDMON J， DIVVALA S， GIRSHICK R， et al. You only look once： unified， real-time object detection［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2016： 779-788.
7	REDMON J， FARHADI A. YOLOv3： an incremental improvement［EB/OL］. ［2023-11-29］..
8	BOCHKOVSKIY A， WANG C Y， LIAO H Y M. YOLOv4： optimal speed and accuracy of object detection［EB/OL］. ［2023-12-09］..
9	LI C Y， LI L， JIANG H L， et al. YOLOv6： a single-stage object detection framework for industrial applications［EB/OL］. ［2023-12-09］..
10	WANG C Y， BOCHKOVSKIY A， LIAO H Y M. YOLOv7： trainable bag-of-freebies sets new state-of-the-art for real-time object detectors［C］// Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2023： 7464-7475.
11	HOU Q， ZHOU D， FENG J. Coordinate attention for efficient mobile network design［C］// Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2021： 13708-13717.
12	HE K， ZHANG X， REN S， et al. Deep residual learning for image recognition［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2016： 770-778.
13	TONG Z， CHEN Y， XU Z， et al. Wise-IoU： bounding box regression loss with dynamic focusing mechanism［EB/OL］. ［2023-09-10］..
14	HE K， ZHANG X， REN S， et al. Spatial pyramid pooling in deep convolutional networks for visual recognition［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2015， 37（9）： 1904-1916.
15	LIN T Y， DOLLÁR P， GIRSHICK R， et al. Feature pyramid networks for object detection［C］// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2017： 936-944.
16	LIU S， QI L， QIN H， et al. Path aggregation network for instance segmentation［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018： 8759-8768.
17	李文举，张干，崔柳，等. 基于坐标注意力的轻量级交通标志识别模型［J］. 计算机应用， 2023， 43（2）： 608-614.
	LI W J， ZHANG G， CUI L， et al. Lightweight traffic sign recognition model based on coordinate attention［J］. Journal of Computer Applications， 2023， 43（2）： 608-614.
18	KOBYLINSKI P， WIERZBOWSKI M， PIOTROWSKI K. High-resolution net load forecasting for micro-neighbourhoods with high penetration of renewable energy sources［J］. International Journal of Electrical Power and Energy Systems， 2020， 117： No.105635.
19	ZHENG Z， WANG P， REN D， et al. Enhancing geometric factors in model learning and inference for object detection and instance segmentation［J］. IEEE Transactions on Cybernetics， 2022， 52（8）： 8574-8586.
20	WOO S， PARK J， LEE J Y， et al. CBAM： convolutional block attention module［C］// Proceedings of the 2018 European Conference on Computer Vision， LNCS 11211. Cham： Springer， 2018：3-19.
21	HU J， SHEN L， SUN G. Squeeze-and-excitation networks［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018： 7132-7141.

类别	数据描述
动物类	隧道内出现的动物，主要包括猫和狗等
抛洒垃圾类	隧道内散落的垃圾，如袋状和包裹状物等
道路安全设施类	隧道内歪倒的路面安全设施，例如锥桶、防撞桶和轮胎等

类别	数据描述
动物类	隧道内出现的动物，主要包括猫和狗等
抛洒垃圾类	隧道内散落的垃圾，如袋状和包裹状物等
道路安全设施类	隧道内歪倒的路面安全设施，例如锥桶、防撞桶和轮胎等

类别	训练集	测试集	共计
共计	2 710	725	3 435
抛洒垃圾类	888	281	1 122
动物类	902	220	1 169
道路安全设施类	920	224	1 144

类别	训练集	测试集	共计
共计	2 710	725	3 435
抛洒垃圾类	888	281	1 122
动物类	902	220	1 169
道路安全设施类	920	224	1 144

注意力机制	P/%	R/%	mAP@0.5/%	参数量/10⁶	模型大小/MB
基线模型	82.1	67.5	73.9	3.006	6.2
+C2f_CBAM	82.5	64.5	72.5	3.034	6.3
+C2f_SE	84.4	68.0	70.9	3.009	6.3
+C2f_CA	82.8	72.5	76.8	3.015	6.3

Tunnel foreign object detection algorithm based on improved YOLOv8n

基于改进YOLOv8n的隧道内异物检测算法

RichHTML

PDF

Knowledge

Abstract

Cite this article

share this article

Figures/Tables 14

References 21

Related Articles 15

Recommended Articles

Metrics

算法	AP/%			mAP@0.5/%	参数量/10⁶	模型大小/MB	FPS
算法	动物类	抛洒垃圾类	道路安全设施类	mAP@0.5/%	参数量/10⁶	模型大小/MB	FPS
Faster-RCNN	93.9	43.6	84.2	73.9	41.358	315.9	89
Cascade-RCNN	89.8	44.7	82.8	72.4	69.158	528.0	54
YOLOv3-tiny	77.1	48.0	79.3	68.1	12.129	24.4	124
YOLOv5n	79.5	49.0	82.4	70.3	2.504	5.3	227
YOLOv6n	82.3	46.7	83.3	70.8	4.234	8.7	150
YOLOv7n	75.5	64.1	81.4	73.7	37.205	74.8	101
YOLOv8n	81.2	56.2	84.4	73.9	3.006	6.2	169
本文算法	88.4	60.5	90.7	79.9	2.627	6.0	138

[1]	Sheng YANG, Yan LI. Contrastive knowledge distillation method for object detection [J]. Journal of Computer Applications, 2025, 45(2): 354-361.
[2]	Shijia WEN, Shijun JING. Dynamic visual SLAM algorithm incorporating object detection and feature point association [J]. Journal of Computer Applications, 2025, 45(2): 610-615.
[3]	Zhongwei ZHANG, Jun WANG, Shudong LIU, Zhiheng WANG. Object detection in remote sensing image based on multi-scale feature fusion and weighted boxes fusion [J]. Journal of Computer Applications, 2025, 45(2): 633-639.
[4]	Songsen YU, Zhifan LIN, Guopeng XUE, Jianyu XU. Lightweight large-format tile defect detection algorithm based on improved YOLOv8 [J]. Journal of Computer Applications, 2025, 45(2): 647-654.
[5]	Yexin PAN, Zhe YANG. Optimization model for small object detection based on multi-level feature bidirectional fusion [J]. Journal of Computer Applications, 2024, 44(9): 2871-2877.
[6]	Yeheng LI, Guangsheng LUO, Qianmin SU. Logo detection algorithm based on improved YOLOv5 [J]. Journal of Computer Applications, 2024, 44(8): 2580-2587.
[7]	Yingjun ZHANG, Niuniu LI, Binhong XIE, Rui ZHANG, Wangdong LU. Semi-supervised object detection framework guided by curriculum learning [J]. Journal of Computer Applications, 2024, 44(8): 2326-2333.
[8]	Song XU, Wenbo ZHANG, Yifan WANG. Lightweight video salient object detection network based on spatiotemporal information [J]. Journal of Computer Applications, 2024, 44(7): 2192-2199.
[9]	Xun SUN, Ruifeng FENG, Yanru CHEN. Monocular 3D object detection method integrating depth and instance segmentation [J]. Journal of Computer Applications, 2024, 44(7): 2208-2215.
[10]	Yue LIU, Fang LIU, Aoyun WU, Qiuyue CHAI, Tianxiao WANG. 3D object detection network based on self-attention mechanism and graph convolution [J]. Journal of Computer Applications, 2024, 44(6): 1972-1977.
[11]	Yaping DENG, Yingjiang LI. Review of YOLO algorithm and its applications to object detection in autonomous driving scenes [J]. Journal of Computer Applications, 2024, 44(6): 1949-1958.
[12]	Huantong GENG, Zhenyu LIU, Jun JIANG, Zichen FAN, Jiaxing LI. Embedded road crack detection algorithm based on improved YOLOv8 [J]. Journal of Computer Applications, 2024, 44(5): 1613-1618.
[13]	Hongtian LI, Xinhao SHI, Weiguo PAN, Cheng XU, Bingxin XU, Jiazheng YUAN. Few-shot object detection via fusing multi-scale and attention mechanism [J]. Journal of Computer Applications, 2024, 44(5): 1437-1444.
[14]	Xiaogang SONG, Dongdong ZHANG, Pengfei ZHANG, Li LIANG, Xinhong HEI. Real-time object detection algorithm for complex construction environments [J]. Journal of Computer Applications, 2024, 44(5): 1605-1612.
[15]	Wei WANG, Chunhui ZHAO, Xinyao TANG, Liugang XI. 3D vehicle detection with adaptive horizon line constraints [J]. Journal of Computer Applications, 2024, 44(3): 909-915.

C2f_CA	HRNet_Fusion	C2	WIoU	mAP@0.5/%	参数量/10⁶	模型大小/MB
				73.9	3.006	6.2
√				76.8	3.015	6.3
		√		75.0	3.019	6.3
	√			76.6	2.597	5.6
	√	√		77.0	2.618	5.9
			√	74.5	3.006	6.2
√	√	√		78.6	2.627	6.0
	√	√	√	78.8	2.627	6.0
√	√	√	√	79.9	2.627	6.0

C2f_CA	HRNet_Fusion	C2	WIoU	mAP@0.5/%	参数量/10⁶	模型大小/MB
				73.9	3.006	6.2
√				76.8	3.015	6.3
		√		75.0	3.019	6.3
	√			76.6	2.597	5.6
	√	√		77.0	2.618	5.9
			√	74.5	3.006	6.2
√	√	√		78.6	2.627	6.0
	√	√	√	78.8	2.627	6.0
√	√	√	√	79.9	2.627	6.0