Real-time vehicle detection algorithm based on YOLOv10

doi:10.11772/j.issn.1001-9081.2025040487

Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (3): 950-958.DOI: 10.11772/j.issn.1001-9081.2025040487

• Multimedia computing and computer simulation • Previous Articles Next Articles

Real-time vehicle detection algorithm based on YOLOv10

Yinshan YU(), Xu TANG, Mingjian DING, Wenkai HUANG, Jiawen BI, Guochen TAN

Faculty of Electronic Information Engineering，Huaiyin Institute of Technology，Huai’an Jiangsu 223003，China

Received:2025-04-30 Revised:2025-07-26 Accepted:2025-07-29 Online:2025-08-01 Published:2026-03-10
Contact: Yinshan YU
About author:TANG Xu， born in 2000， M. S. candidate. Her research interests include object detection， deep learning.
DING Mingjian， born in 2001， M. S. candidate. His research interests include object detection， deep learning.
HUANG Wenkai， born in 2001， M. S. candidate. His research interests include computer vision， object detection.
BI Jiawen， born in 2000， M. S. candidate. Her research interests include computer vision， object detection.
TAN Guochen， born in 1999， M. S. candidate. Her research interests include computer vision， object detection.
Supported by:
National Natural Science Foundation of China(61801188);Basic Science Research Project for Colleges and Universities in Jiangsu Province(22KJD510001);Postgraduate Scientific Research and Practice Innovation Program of Jiangsu Province(SJCX25_2193);Huai’an Science and Technology Program(HAB2025007)

基于YOLOv10的实时车辆检测算法

于银山(), 唐旭, 丁明鉴, 黄文凯, 毕嘉文, 谭国辰

淮阴工学院电子信息工程学院，江苏淮安 223003

通讯作者: 于银山
作者简介:唐旭（2000—），女，江苏盐城人，硕士研究生，主要研究方向：目标检测、深度学习
丁明鉴（2001—），男，江苏淮安人，硕士研究生，主要研究方向：目标检测、深度学习
黄文凯（2001—），男，江苏盐城人，硕士研究生，主要研究方向：计算机视觉、目标检测
毕嘉文（2000—），女，山东菏泽人，硕士研究生，主要研究方向：计算机视觉、目标检测
谭国辰（1999—），女，江苏徐州人，硕士研究生，主要研究方向：计算机视觉、目标检测。
基金资助:
国家自然科学基金资助项目(61801188);江苏省高等学校基础科学研究面上项目(22KJD510001);江苏省研究生科研与实践创新计划项目(SJCX25_2193);淮安市科技计划项目(HAB2025007)

Abstract

Abstract:

With the advancement of autonomous driving technology， real-time vehicle detection has become crucial for ensuring system safety and reliability. Therefore， a lightweight detection model based on YOLOv10， named YOLOv10-LITE was designed by introducing four structural improvement modules to reduce model complexity and inference latency while maintaining detection accuracy， for real-time detection tasks in resource-constrained environments. Specifically， the Dynamic Upsampling （DySample） module was applied to enhance the resolution of feature maps while reducing computational cost； the Fast Multi-Scale Network （FastMSNet） module was used to improve multi-scale feature extraction and enhance detection performance for objects of different sizes； the Spatial Pyramid Pooling-Local Selective Kernel Attention （SPPF_LSKA） module was introduced to capture long-range dependencies effectively by combining local feature selection and global contextual modeling； the Adaptive Granular Fine-grained Channel Attention （AGFCA） module was incorporated to improve critical information perception ability through collaboration between spatial and channel attention. Experimental results on the KITTI dataset show that YOLOv10-LITE achieves a mean Average Precision （mAP） of 77.1%， which is 2.4% higher than that of YOLOv10， with the parameter count reduced by 8.7 percentage points. The above results verify the proposed model’s practicality in autonomous driving scenarios with both computational constraints and real-time demands.

Key words: autonomous driving, object detection, real-time detection, YOLOv10, lightweight model

摘要：

随着自动驾驶技术的发展，实时车辆检测在确保系统安全性和可靠性方面至关重要。因此，设计一种基于YOLOv10的轻量化检测模型——YOLOv10-LITE。所提模型通过引入4个结构改进模块，在保持检测精度的同时，有效降低模型的复杂度和推理延迟，适用于资源受限环境下的实时检测任务。具体而言，使用动态上采样（DySample）模块在降低计算开销的同时提升特征图的分辨率；使用快速多尺度网络（FastMSNet）模块增强多尺度特征提取能力，提高对不同尺寸目标的检测效果；使用空间金字塔池化-局部选择性大核注意力（SPPF_LSKA）模块结合局部特征选择与全局上下文建模，从而有效捕获长程依赖；使用自适应细粒度通道注意力（AGFCA）模块通过通道与空间注意力的协同作用，提升关键特征信息的感知能力。在KITTI数据集上的实验结果表明，YOLOv10-LITE的平均精度均值（mAP）达到了77.1%，相较于YOLOv10提升了2.4个百分点；同时，参数量减少了8.7%。以上结果验证了所提模型在计算受限且需满足实时性的自动驾驶场景中的实用性。

关键词: 自动驾驶, 目标检测, 实时检测, YOLOv10, 轻量化模型

CLC Number:

TP391.4

Yinshan YU, Xu TANG, Mingjian DING, Wenkai HUANG, Jiawen BI, Guochen TAN. Real-time vehicle detection algorithm based on YOLOv10[J]. Journal of Computer Applications, 2026, 46(3): 950-958.

于银山, 唐旭, 丁明鉴, 黄文凯, 毕嘉文, 谭国辰. 基于YOLOv10的实时车辆检测算法[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 950-958.

Figures/Tables 15

References 32

[1]	DAI X. HybridNet： a fast vehicle detection system for autonomous driving ［J］. Signal Processing： Image Communication， 2019， 70： 79-88.
[2]	CHOUDHURY S， CHATTOPADHYAY S P， HAZRA T K. Vehicle detection and counting using Haar feature-based classifier ［C］// Proceedings of the 8th Annual Industrial Automation and Electromechanical Engineering Conference. Piscataway： IEEE， 2017： 106-109.
[3]	WEI Y， TIAN Q， GUO J， et al. Multi-vehicle detection algorithm through combining Harr and HOG features ［J］. Mathematics and Computers in Simulation， 2019， 155： 130-145.
[4]	张依林，王学颖. 基于CNN-SVM的车辆检测与类型分类［J］. 电子技术与软件工程， 2022（7）： 190-193.
	ZHANG Y L， WANG X Y. Vehicle detection and type classification based on CNN-SVM［J］. Electronic Technology and Software Engineering， 2022（7）： 190-193.
[5]	LI Z， LIU F， YANG W， et al. A survey of convolutional neural networks： analysis， applications， and prospects ［J］. IEEE Transactions on Neural Networks and Learning Systems， 2021， 33（12）： 6999-7019.
[6]	LIANG F， ZHOU Y， CHEN X， et al. Review of target detection technology based on deep learning ［C］// Proceedings of the 5th International Conference on Control Engineering and Artificial Intelligence. New York： ACM， 2021： 132-135.
[7]	KANG L， LU Z， MENG L， et al. YOLO-FA： type-1 fuzzy attention based YOLO detector for vehicle detection ［J］. Expert Systems with Applications， 2024， 237（Pt B）： No.121209.
[8]	LI Y， ZHANG M， ZHANG C， et al. YOLO-CCS： vehicle detection algorithm based on coordinate attention mechanism ［J］. Digital Signal Processing， 2024， 153： No.104632.
[9]	CHENG X， QIU G， YU J， et al. An improved small object detection method based on Yolo V3 ［J］. Pattern Analysis and Applications， 2021， 24（3）： 1347-1355.
[10]	CAI Y， LUAN T， GAO H， et al. YOLOv4-5D： an effective and efficient object detector for autonomous driving ［J］. IEEE Transactions on Instrumentation and Measurement， 2021， 70： No.4503613.
[11]	GUO L， ZHOU X， ZHAO Y， et al. Improved YOLOv7 algorithm incorporating InceptionNeXt and attention mechanism for vehicle detection under adverse lighting conditions ［J］. Signal， Image and Video Processing， 2025， 19（4）： No.299.
[12]	LIU L， LU S， ZHONG R， et al. Computing systems for autonomous driving： state of the art and challenges ［J］. IEEE Internet of Things Journal， 2021， 8（8）： 6469-6486.
[13]	LIU W， LU H， FU H， et al. Learning to upsample by learning to sample［C］// Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision. Piscataway： IEEE， 2023： 6004-6014.
[14]	DU D， XIE Y. Vehicle and pedestrian detection algorithm in an autonomous driving scene based on improved YOLOv8 ［J］. Journal of Transportation Engineering， Part A： Systems， 2025， 151（1）： No.8446.
[15]	LAU K W， PO L M， REHMAN Y A. Large separable kernel attention： rethinking the large kernel attention design in CNN ［J］. Expert Systems with Applications， 2024， 236： No.121352.
[16]	GEIGER A， LENZ P， STILLER C， et al. Vision meets robotics： the KITTI dataset ［J］. The International Journal of Robotics Research， 2013， 32（11）： 1231-1237.
[17]	赖勤波，马正华，朱蓉. 基于注意力机制和空洞卷积的无人机图像目标检测［J］. 计算机应用与软件， 2025， 42（2）： 227-235.
	LAI Q B， MA Z H， ZHU R. UAV image target detection based on attention mechanism and dilated convolution ［J］. Computer Applications and Software， 2025， 42（2）： 227-235.
[18]	JEONG J， PARK H， KWAK N. Enhancement of SSD by concatenating feature maps for object detection ［C］// Proceedings of the 2017 British Machine Vision Conference. Durham： BMVA Press， 2017： No.76.
[19]	LU X， SONG W. Improved YOLOv5s model for vehicle detection and recognition ［C］// Proceedings of the 2022 International Conference on Intelligent Computing， LNCS 13395. Cham： Springer， 2022： 423-434.
[20]	LI C， LI L， JIANG H， et al. YOLOv6： a single-stage object detection framework for industrial applications ［EB/OL］. ［2025-07-22］..
[21]	BAKIRCI M， BAYRAKTAR I. YOLOv9-enabled vehicle detection for urban security and forensics applications［C］// Proceedings of the 12th International Symposium on Digital Forensics and Security. Piscataway： IEEE， 2024： 1-6.
[22]	BAKIRCI M. Enhancing vehicle detection in intelligent transportation systems via autonomous UAV platform and YOLOv8 integration［J］. Applied Soft Computing， 2024， 164： No.112015.
[23]	ZHANG H. Real-time pedestrian detection algorithm of YOLOv10 under complex lighting and occlusion conditions ［C］// Proceedings of the 2024 International Conference on Advances in Electrical Engineering and Computer Applications. Piscataway： IEEE， 2024： 645-650.
[24]	LI Y， YANG W， WANG L， et al. HawkEye Conv-driven YOLOv10 with advanced feature pyramid networks for small object detection in UAV imagery ［J］. Drones， 2024， 8（12）： No.713.
[25]	VELYCHKO D， SINGH S， MARKOPOULOS P P， et al. Image preprocessing and YOLO architectures for enhanced small and slow-moving object detection［C］// Proceedings of the 2024 IEEE Western New York Image and Signal Processing Workshop. Piscataway： IEEE， 2024： 1-4.
[26]	LI B， QIN Z， MU Y， et al. Small-target-driven YOLOV10-WD： Wasserstein distance optimization and adaptive scale fusion［EB/OL］. ［2025-07-08］. .
[27]	高旭，闫奥函，范文博，等. 基于ResNet网络的烟叶成熟度图像识别［J］. 农业灾害研究， 2025， 15（2）： 52-54.
	GAO X， YAN A H， FAN W B， et al. Image recognition of tobacco leaf maturity based on ResNet network ［J］. Journal of Agricultural Catastrophology， 2025， 15（2）： 52-54.
[28]	RUBAN S， MOOSA JABEER M， SHENOY BASTI R. Improvising breast cancer detection using CNN， VGG and SSD algorithms ［C］// Proceedings of the 3rd International Conference on Intelligent Computing Instrumentation and Control Technologies. Piscataway： IEEE， 2022： 1772-1776.
[29]	MIAO T， ZENG H， YANG W， et al. An improved lightweight RetinaNet for ship detection in SAR images ［J］. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing， 2022， 15： 4667-4679.
[30]	HUANG S， HE Y， CHEN X. M-YOLO： a nighttime vehicle detection method combining MobileNet v2 and YOLO v3 ［J］. Journal of Physics： Conference Series， 2021， 1883： No.012094.
[31]	MURTHY C B， HASHMI M F， KESKAR A G. EfficientLiteDet： a real-time pedestrian and vehicle detection algorithm ［J］. Machine Vision and Applications， 2022， 33（3）： No.47.
[32]	DONG Z， WU Y， PEI M， et al. Vehicle type classification using a semisupervised convolutional neural network ［J］. IEEE Transactions on Intelligent Transportation Systems， 2015， 16（4）： 2247-2256.

项目	参数
实验平台	AutoDL 算力云
操作系统	Ubuntu 18.04
显卡型号	NVIDIA GeForce RTX 4090（24 GB）
CPU型号	16 vCPU Intel Xeon Gold 6430
内存	120 GB
Python版本	Python 3.8
模型框架	PyTorch2.0.0+cu118+torchvision 0.15.1

项目	参数
实验平台	AutoDL 算力云
操作系统	Ubuntu 18.04
显卡型号	NVIDIA GeForce RTX 4090（24 GB）
CPU型号	16 vCPU Intel Xeon Gold 6430
内存	120 GB
Python版本	Python 3.8
模型框架	PyTorch2.0.0+cu118+torchvision 0.15.1

名称	配置	名称	配置
Epochs	300	Learn rate	0.01
Batch_size	32	workers	8
Momentum	0.937	Optimizer	SGD
Weight decay	0.000 5	imgsz	640

名称	配置	名称	配置
Epochs	300	Learn rate	0.01
Batch_size	32	workers	8
Momentum	0.937	Optimizer	SGD
Weight decay	0.000 5	imgsz	640

方法	DySample	FastMSNet	SPPF_LSKA	AGFCA	mAP	召回率	精度	浮点计算量/GFLOPs	参数量/10⁶
1					0.747	0.681	0.762	6.5	2.3
2	√				0.748	0.717	0.730	6.0	2.2
3		√			0.721	0.664	0.729	5.0	1.7
4			√		0.753	0.707	0.738	5.3	2.5
5				√	0.747	0.682	0.753	6.5	2.3
6	√	√			0.742	0.687	0.745	5.1	1.8
7	√	√	√		0.752	0.702	0.724	5.3	2.1
8	√	√	√	√	0.771	0.706	0.779	5.4	2.1

Real-time vehicle detection algorithm based on YOLOv10

基于YOLOv10的实时车辆检测算法

RichHTML

PDF

Knowledge

Abstract

Cite this article

share this article

Figures/Tables 15

References 32

Related Articles 15

Recommended Articles

Metrics

算法	输入尺寸	参数量/10⁶	计算量/GFLOPs	mAP	召回率	精度
Faster-RCNN	720×720	61.90	170.40	0.603	0.567	0.622
SSD	512×512	26.28	62.75	0.689	0.641	0.715
RetinaNet^［29］	112×112	31.20	63.20	0.595	0.554	0.612
MobileNet^［30］	256×256	4.30	12.60	0.706	0.663	0.728
Tiny-YOLOv4^［31］	768×384	5.60	20.40	0.697	0.647	0.738
YOLOv5s	640×640	4.50	16.20	0.729	0.684	0.745
YOLOv6	640×640	4.20	11.80	0.712	0.665	0.745
YOLOv9	640×640	2.60	10.70	0.738	0.681	0.770
YOLOv8n	640×640	3.20	8.40	0.741	0.678	0.765
YOLOv10	640×640	2.30	6.50	0.747	0.681	0.762
YOLOv10X	640×640	3.20	10.40	0.698	0.643	0.712
YOLOv10-masks	640×640	2.40	6.80	0.752	0.715	0.766
YOLOv10-WD	640×640	4.10	11.60	0.714	0.673	0.756
YOLOv10-LITE	640×640	2.10	5.40	0.771	0.706	0.779

数据集	模型	参数量/10⁶	计算量/GFLOPs	mAP	召回率	精度
KITTI	YOLOv10	2.3	6.5	0.747	0.681	0.762
KITTI	YOLOv10- LITE	2.1	5.4	0.771	0.706	0.779
BIT Vehicle	YOLOv10	2.2	6.5	0.961	0.918	0.918
BIT Vehicle	YOLOv10- LITE	2.2	5.7	0.980	0.939	0.956

[1]	Quanjie LIU, Zhaoyi GU, Chunyuan WANG. Unsafe driving behavior detection under complex lighting conditions [J]. Journal of Computer Applications, 2026, 46(2): 613-619.
[2]	Yuebo FAN, Mingxuan CHEN, Xian TANG, Yongbin GAO, Wenchao LI. Multi-dimensional frequency domain feature fusion for human-object interaction detection [J]. Journal of Computer Applications, 2026, 46(2): 580-586.
[3]	Mingguang LI, Chongben TAO. Hierarchical cross-modal fusion method for 3D object detection based on Mamba model [J]. Journal of Computer Applications, 2026, 46(2): 572-579.
[4]	Shiwei LI, Yufeng ZHOU, Pengfei SUN, Weisong LIU, Zhuxuan MENG, Haojie LIAN. Point cloud data augmentation method based on scattering and absorption effects of coal dust on LiDAR electromagnetic waves [J]. Journal of Computer Applications, 2026, 46(1): 331-340.
[5]	Shuwen HUANG, Keyu GUO, Xiangyu SONG, Feng HAN, Shijie SUN, Huansheng SONG. Multi-target 3D visual grounding method based on monocular images [J]. Journal of Computer Applications, 2026, 46(1): 207-215.
[6]	Yu SANG, Tong GONG, Chen ZHAO, Bowen YU, Siman LI. Domain-adaptive nighttime object detection method with photometric alignment [J]. Journal of Computer Applications, 2026, 46(1): 242-251.
[7]	Binhong XIE, Rui WANG, Rui ZHANG, Yingjun ZHANG. Agent prototype distillation algorithm for few-shot object detection [J]. Journal of Computer Applications, 2026, 46(1): 233-241.
[8]	Jiaxiang ZHANG, Xiaoming LI, Jiahui ZHANG. Few-shot object detection algorithm based on new category feature enhancement and metric mechanism [J]. Journal of Computer Applications, 2025, 45(9): 2984-2992.
[9]	Lili WEI, Lirong YAN, Xiaofen TANG. Contextual semantic representation and pixel relationship correction for few-shot object detection [J]. Journal of Computer Applications, 2025, 45(9): 2993-3002.
[10]	Binhong XIE, Yingkun LA, Yingjun ZHANG, Rui ZHANG. Semi-supervised object detection framework guided by self-paced learning [J]. Journal of Computer Applications, 2025, 45(8): 2546-2554.
[11]	Haiyang PENG, Weixing JI, Fawang LIU. Blockchain-based data notarization model for autonomous driving simulation testing [J]. Journal of Computer Applications, 2025, 45(8): 2421-2427.
[12]	Chengzhi YAN, Ying CHEN, Kai ZHONG, Han GAO. 3D object detection algorithm based on multi-scale network and axial attention [J]. Journal of Computer Applications, 2025, 45(8): 2537-2545.
[13]	Liang CHEN, Xuan WANG, Kun LEI. Helmet wearing detection algorithm for complex scenarios based on cross-layer multi-scale feature fusion [J]. Journal of Computer Applications, 2025, 45(7): 2333-2341.
[14]	Zimo ZHANG, Xuezhuan ZHAO. Multi-scale sparse graph guided vision graph neural networks [J]. Journal of Computer Applications, 2025, 45(7): 2188-2194.
[15]	Pingping YU, Yuting YAN, Xinliang TANG, He SU, Jianchao WANG. Multi-object tracking algorithm for construction machinery in transmission line scenarios [J]. Journal of Computer Applications, 2025, 45(7): 2351-2360.