基于YOLOv11的低秩自适应参数高效微调算法

doi:10.11772/j.issn.1001-9081.2025060751

《计算机应用》唯一官方网站 ›› 2026, Vol. 46 ›› Issue (6): 1738-1745.DOI: 10.11772/j.issn.1001-9081.2025060751

基于YOLOv11的低秩自适应参数高效微调算法

杜艺, 续明进, 孔佳仪(), 王力瑶, 赵晨

北京印刷学院机电工程学院，北京 102600

收稿日期:2025-07-09 修回日期:2025-09-25 接受日期:2025-09-29 发布日期:2025-10-16 出版日期:2026-06-10
通讯作者: 孔佳仪
作者简介:杜艺（2003—），男，湖北咸宁人，硕士研究生，主要研究方向：计算机视觉、目标检测
续明进（1970—），男，山东临沂人，副教授，硕士，主要研究方向：计算机视觉、模式识别
王力瑶（2000—），男，河北秦皇岛人，硕士研究生，主要研究方向：机器学习、电力电子
赵晨（2002—），男，甘肃兰州人，硕士研究生，主要研究方向：智能控制。
第一联系人：孔佳仪（1995—），女，山东泰安人，讲师，博士，主要研究方向：自然语言处理
基金资助:
国家重点研发计划项目(2024YFF0617202);北京市重点实验室建设项目(KYCPT202508)

Low-rank adaptive parameter-efficient fine-tuning algorithm based on YOLOv11

Yi DU, Mingjin XU, Jiayi KONG(), Liyao WANG, Chen ZHAO

School of Mechanical and Electrical Engineering，Beijing Institute of Graphic Communication，Beijing 102600，China

Received:2025-07-09 Revised:2025-09-25 Accepted:2025-09-29 Online:2025-10-16 Published:2026-06-10
Contact: Jiayi KONG
About author:DU Yi， born in 2003， M. S. candidate. His research interests include computer vision， object detection.
XU Mingjin， born in 1970， M. S.， associate professor. His research interests include computer vision， pattern recognition.
WANG Liyao， born in 2000， M. S. candidate. His research interests include machine learning， power electronics.
ZHAO Chen， born in 2002， M. S. candidate. His research interests include intelligent control.
First author contact:KONG Jiayi， born in 1995， Ph. D.， lecturer. Her research interests include natural language processing.
Supported by:
National Key Research and Development Program of China(2024YFF0617202);Beijing Key Laboratory Construction Project(KYCPT202508)

摘要/Abstract

摘要：

针对复杂场景下目标检测任务中深度学习算法的泛化性和鲁棒性受限以及全参数微调（FPFT）计算成本高的问题，提出一种基于YOLOv11 （You Only Look Once version 11）的低秩自适应参数高效微调（PEFT）算法。首先，在YOLOv11的骨干和颈部网络中嵌入低秩自适应（LoRA）模块；其次，结合LoRA、权重分解低秩自适应（DoRA）和主奇异值与奇异向量自适应（PiSSA）这3种低秩分解算法，通过权重分解与动态调整机制实现参数的高效更新；最后，在训练过程中，将YOLOv11网络的绝大部分预训练权重保持冻结状态，仅对LoRA模块中由3种低秩分解算法生成的低秩矩阵进行训练，将可训练参数规模缩减至原算法的1.56%。COCO （Common Objects in COntext）数据集上的实验结果表明，所提算法相较于基线算法YOLOv11在精确度、召回率和交并比阈值为0.5时的平均精度均值（mAP）上分别提升了4.18、7.11和7.85个百分点。可见，所提算法为资源受限场景下的大型检测算法轻量化与高效微调提供了有效技术路径。

关键词: YOLOv11, 参数高效微调, 目标检测, 低秩自适应, 深度学习

Abstract:

In view of the limitations of deep learning algorithms’ generalization and robustness， as well as the high computational cost of Full Parameter Fine-Tuning （FPFT） in object detection tasks in complex scenarios， a low-rank adaptive Parameter-Efficient Fine-Tuning （PEFT） algorithm based on YOLOv11 （You Only Look Once version 11） was proposed. Firstly， a Low-Rank Adaptation （LoRA） module was embedded into the backbone and neck networks of YOLOv11. Secondly， three low-rank decomposition algorithms， including LoRA， weight-Decomposed low-Rank Adaptation （DoRA） and Principal Singular values and Singular vectors Adaptation （PiSSA） were combined， and efficient parameter updates were achieved through weight decomposition and dynamic adjustment mechanisms. Finally， during the training process， most of the pre-trained weights of the YOLOv11 network were kept frozen， and only the low-rank matrices generated by the three low-rank decomposition algorithms in the LoRA module were trained， thereby reducing the trainable parameter size to 1.56% of the original algorithm. Experimental results on the COCO （Common Objects in COntext） dataset demonstrate that the proposed algorithm improves the precision， recall and mean Average Precision （mAP） at IoU （Intersection over Union） threshold of 0.5 by 4.18， 7.11 and 7.85 percentage points， respectively， compared with the baseline algorithm YOLOv11. It can be seen that the proposed algorithm provides an effective technical path for lightweight and efficient fine-tuning of large-scale detection algorithms in resource-constrained scenarios.

Key words: YOLOv11 (You Only Look Once version 11), Parameter?Efficient Fine?Tuning (PEFT), object detection, Low-Rank Adaptation (LoRA), deep learning

中图分类号:

TP391.4

杜艺, 续明进, 孔佳仪, 王力瑶, 赵晨. 基于YOLOv11的低秩自适应参数高效微调算法[J]. 计算机应用, 2026, 46(6): 1738-1745.

Yi DU, Mingjin XU, Jiayi KONG, Liyao WANG, Chen ZHAO. Low-rank adaptive parameter-efficient fine-tuning algorithm based on YOLOv11[J]. Journal of Computer Applications, 2026, 46(6): 1738-1745.

图/表 11

参考文献 36

[1]	GIRSHICK R， DONAHUE J， DARRELL T， et al. Rich feature hierarchies for accurate object detection and semantic segmentation［C］// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2014： 580-587.
[2]	GIRSHICK R. Fast R-CNN［C］// Proceedings of the 2015 IEEE International Conference on Computer Vision. Piscataway： IEEE， 2015： 1440-1448.
[3]	REN S， HE K， GIRSHICK R， et al. Faster R-CNN： towards real-time object detection with region proposal networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2017， 39（6）： 1137-1149.
[4]	李鸿天，史鑫昊，潘卫国，等. 融合多尺度和注意力机制的小样本目标检测［J］. 计算机应用， 2024， 44（5）： 1437-1444.
	LI H T， SHI X H， PAN W G， et al. Few-shot object detection via fusing multi-scale and attention mechanism［J］. Journal of Computer Applications， 2024， 44（5）： 1437-1444.
[5]	REDMON J， DIVVALA S， GIRSHICK R， et al. You only look once： unified， real-time object detection［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2016： 779-788.
[6]	LIU W， ANGUELOV D， ERHAN D， et al. SSD： single shot multiBox detector［C］// Proceedings of the 2016 European Conference on Computer Vision， LNCS 9905. Cham： Springer， 2016： 21-37.
[7]	ZHAO Y， LV W， XU S， et al. DETRS beat YOLOs on real-time object detection［C］// Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2024： 16965-16974.
[8]	郑华伟，王飞，高建邦. DES-YOLO：一种更精确的目标检测方法［J］. 光电工程， 2024， 51（11）： No.240212.
	ZHENG H W， WANG F， GAO J B. DES-YOLO： a more accurate object detection method［J］. Opto-Electronic Engineering， 2024， 51（11）： No.240212.
[9]	刘皓皎，刘力双，张明淳. 基于YOLOv5改进的红外目标检测算法［J］. 激光技术， 2024， 48（4）： 534-541.
	LIU H J， LIU L S， ZHANG M C. An improved infrared object detection algorithm based on YOLOv5［J］. Laser Technology， 2024， 48（4）： 534-541.
[10]	肖振久，张杰浩，林渤翰. 特征协同与细粒度感知的遥感图像小目标检测［J］. 光电工程， 2024， 51（6）： No.240066.
	XIAO Z J， ZHANG J H， LIN B H. Feature coordination and fine-grained perception of small targets in remote sensing images［J］. Opto-Electronic Engineering， 2024， 51（6）： No.240066.
[11]	PU G， JAIN A， YIN J， et al. Empirical analysis of the strengths and weaknesses of PEFT techniques for LLMs［EB/OL］. ［2025-01-08］..
[12]	KAUR R， SINGH S. A comprehensive review of object detection with deep learning［J］. Digital Signal Processing， 2023， 132： No.103812.
[13]	HAN X， PU N， FENG Z， et al. Benchmarking multi-scene fire and smoke detection［C］// Proceedings of the 2024 Chinese Conference on Pattern Recognition and Computer Vision， LNCS 15041. Singapore： Springer， 2025： 203-218.
[14]	LI P， GUO T， WANG B， et al. Grid-Attention： enhancing computational efficiency of large vision models without fine-tuning［C］// Proceedings of the 2024 European Conference on Computer Vision， LNCS 15108. Cham： Springer， 2025： 54-70.
[15]	YE T， ZHENG Z， LI X， et al. An efficient few-shot object detection method for railway intrusion via fine-tune approach and contrastive learning［J］. IEEE Transactions on Instrumentation and Measurement， 2023， 72： No.5018713.
[16]	CHENG T， SONG L， GE Y， et al. YOLO-World： real-time open-vocabulary object detection［C］// Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2024： 16901-16911.
[17]	MA B， XU W. Efficient fine tuning for fashion object detection［J］. Sensors， 2023， 23（13）： No.6083.
[18]	HUANG X， HE B， TONG M， et al. Few-shot object detection on remote sensing images via shared attention module and balanced fine-tuning strategy［J］. Remote Sensing， 2021， 13（19）： No.3816.
[19]	LI M， WU J， WANG X， et al. AlignDet： aligning pre-training and fine-tuning in object detection［C］// Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision. Piscataway： IEEE， 2023： 6843-6853.
[20]	KHANAM R， HUSSAIN M. YOLOv11： an overview of the key architectural enhancements［EB/OL］. ［2025-01-08］..
[21]	HU E J， SHEN Y， WALLIS P， et al. LoRA： low-rank adaptation of large language models［EB/OL］. ［2025-04-02］..
[22]	LIU S Y， WANG C Y， YIN H， et al. DoRA： weight-decomposed low-rank adaptation［C］// Proceedings of the 41st International Conference on Machine Learning. New York： JMLR.org， 2024： 32100-32121.
[23]	MENG F， WANG Z， ZHANG M. PiSSA： principal singular values and singular vectors adaptation of large language models［C］// Proceedings of the 38th International Conference on Neural Information Processing Systems. Red Hook： Curran Associates Inc.， 2024： 121038-121072.
[24]	张溢文，蔡满春，陈咏豪，等. 融合多种参数高效微调技术的深度伪造检测方法［J］. 计算机科学与探索， 2024， 18（12）： 3335-3347.
	ZHANG Y W， CAI M C， CHEN Y H， et al. Deepfake detection method integrating multiple parameter-efficient fine-tuning techniques［J］. Journal of Frontiers of Computer Science and Technology， 2024， 18（12）： 3335-3347.
[25]	SREENIVASAN K， SOHN J， YANG L， et al. Rare gems： finding lottery tickets at initialization［C］// Proceedings of the 36th International Conference on Neural Information Processing Systems. Red Hook： Curran Associates Inc.， 2022： 14529-14540.
[26]	ZHONG Z， TANG Z， HE T， et al. Convolution meets LoRA： parameter efficient finetuning for segment anything model［EB/OL］. ［2025-04-18］..
[27]	朱琦，周德强，盛卫锋，等. 基于DSCS-YOLO的苹果表面缺陷检测方法［J］. 南京农业大学学报， 2024， 47（3）： 592-601.
	ZHU Q， ZHOU D Q， SHENG W F， et al. Apple surface defect detection method based on DSCS-YOLO［J］. Journal of Nanjing Agricultural University， 2024， 47（3）： 592-601.
[28]	WANG C Y， MARK LIAO H Y， WU Y H， et al. CSPNet： a new backbone that can enhance learning capability of CNN［C］// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. Piscataway： IEEE， 2020： 1571-1580.
[29]	JOOSHIN H K， NANGIR M， SEYEDARABI H. Inception-YOLO： computational cost and accuracy improvement of the YOLOv5 model based on employing modified CSP， SPPF， and inception modules［J］. IET Image Processing， 2024， 18（8）： 1985-1999.
[30]	LIU S， QI L， QIN H， et al. Path aggregation network for instance segmentation［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018： 8759-8768.
[31]	HOSNA A， MERRY E， GYALMO J， et al. Transfer learning： a friendly introduction［J］. Journal of Big Data， 2022， 9： No.102.
[32]	BROWN T B， MANN B， RYDER N， et al. Language models are few-shot learners［C］// Proceedings of the 34th International Conference on Neural Information Processing Systems. Red Hook： Curran Associates Inc.， 2020： 1877-1901.
[33]	KAPLAN J， McCANDLISH S， HENIGHAN T， et al. Scaling laws for neural language models［EB/OL］. ［2025-02-22］..
[34]	HAN Z， GAO C， LIU J， et al. Parameter-efficient fine-tuning for large models： a comprehensive survey［EB/OL］. ［2025-02-22］..
[35]	LIN Z， MADOTTO A， FUNG P. Exploring versatile generative language model via parameter-efficient transfer learning［C］// Findings of the Association for Computational Linguistics： EMNLP 2020. Stroudsburg： ACL， 2020： 441-459.
[36]	LIN T Y， MAIRE M， BELONGIE S， et al. Microsoft COCO： common objects in context［C］// Proceedings of the 2014 European Conference on Computer Vision， LNCS 8693. Cham： Springer， 2014： 740-755.

算法	P/%	R/%	mAP_0.5/%	mAP_0.5：0.95/%	S/MB
RT-DETR	69.30	58.06	61.83	44.19	31.44
YOLOv3	52.34	38.81	40.37	27.61	31.20
YOLOv5m	66.11	52.80	57.58	41.46	23.90
YOLOv8m	66.77	55.24	59.46	43.46	24.70
YOLOv10m	65.03	54.00	58.01	42.14	15.80
YOLOv11m	67.17	54.51	58.84	42.87	19.20
YOLOv12m	67.03	54.68	58.78	43.08	19.30
YOLOv13l	69.55	55.83	60.39	44.38	26.34
本文算法	71.35	61.62	66.69	49.90	19.50

算法	P/%	R/%	mAP_0.5/%	mAP_0.5：0.95/%	S/MB
RT-DETR	69.30	58.06	61.83	44.19	31.44
YOLOv3	52.34	38.81	40.37	27.61	31.20
YOLOv5m	66.11	52.80	57.58	41.46	23.90
YOLOv8m	66.77	55.24	59.46	43.46	24.70
YOLOv10m	65.03	54.00	58.01	42.14	15.80
YOLOv11m	67.17	54.51	58.84	42.87	19.20
YOLOv12m	67.03	54.68	58.78	43.08	19.30
YOLOv13l	69.55	55.83	60.39	44.38	26.34
本文算法	71.35	61.62	66.69	49.90	19.50

C3K2	SPPF	C2PSA	P/%	R/%	mAP_0.5/%	mAP_0.5：0.95/%	S/MB
			67.17	54.51	58.84	42.87	19.20
√			70.67	61.16	66.61	49.92	19.45
√	√		70.68	61.16	66.61	49.92	19.46
√	√	√	71.35	61.62	66.69	49.90	19.50

C3K2	SPPF	C2PSA	P/%	R/%	mAP_0.5/%	mAP_0.5：0.95/%	S/MB
			67.17	54.51	58.84	42.87	19.20
√			70.67	61.16	66.61	49.92	19.45
√	√		70.68	61.16	66.61	49.92	19.46
√	√	√	71.35	61.62	66.69	49.90	19.50

算法	训练参数量/MB	P/%	R/%	mAP_0.5/%	mAP_0.5：0.95/%
FPFT	19.18	67.17	54.51	58.84	42.87
LoRA	0.65	71.51	60.94	66.34	49.86
DoRA	0.73	68.79	58.61	63.02	46.56
PiSSA	0.30	71.35	61.62	66.69	49.90

基于YOLOv11的低秩自适应参数高效微调算法

Low-rank adaptive parameter-efficient fine-tuning algorithm based on YOLOv11

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 11

参考文献 36

相关文章 15

编辑推荐

Metrics

[1]	张纾豪, 何坤金, 徐佳晨, 沙河山, 陈正鸣. 融合透视校正与轻量注意力机制的轮毂缺陷检测方法[J]. 《计算机应用》唯一官方网站, 2026, 46(6): 2007-2015.
[2]	尹秋燕, 丁婧, 聂志刚. 无人机航拍视角下的人体姿态估计算法YOLO-AirPose[J]. 《计算机应用》唯一官方网站, 2026, 46(6): 1989-1997.
[3]	熊珍凯, 徐梦军, 孙胤胤, 王鑫. 基于改进YOLOv8的复杂天气环境下海面船舶检测算法[J]. 《计算机应用》唯一官方网站, 2026, 46(6): 1998-2006.
[4]	何丽丽, 曹勐, 张磊, 潘洪军, 刘义, 孙成心. 基于Kolmogorov-Arnold网络与扩散Transformer的手语生成模型[J]. 《计算机应用》唯一官方网站, 2026, 46(6): 1801-1810.
[5]	盛兴, 翁孙贤, 陈扩松, 王忠平, 任芮锋, 刘勇. 基于深度学习的电网企业专利价值评估[J]. 《计算机应用》唯一官方网站, 2026, 46(5): 1468-1474.
[6]	吴闵奇, 杨元华, 李航, 胡雅琴, 汤智豪, 梅腾. 基于图Transformer和RT-DETR的轻量化水下小目标检测[J]. 《计算机应用》唯一官方网站, 2026, 46(5): 1586-1595.
[7]	张红瑞, 冯威铭, 杨潞霞, 马永杰. 基于YOLO11改进的水下小目标检测算法CSAF-YOLO[J]. 《计算机应用》唯一官方网站, 2026, 46(5): 1578-1585.
[8]	刘馨瑶, 梁军, 龙嘉濠, 颜仁梁. 基于特征融合和通道信息补偿的中草药细粒度图像分类[J]. 《计算机应用》唯一官方网站, 2026, 46(5): 1677-1683.
[9]	郭慧洁, 窦天凤, 张振琳, 亓开元, 吴栋, 曲志坚, 李钊, 任崇广. 基于时间依赖建模的动态贝叶斯网络交通预测[J]. 《计算机应用》唯一官方网站, 2026, 46(5): 1507-1517.
[10]	张永兵, 闫丽蓉, 唐晓芬. 渐进式双阶段模态交互的单域泛化目标检测[J]. 《计算机应用》唯一官方网站, 2026, 46(4): 1264-1274.
[11]	陈浩轩, 叶培昌, 刘磊, 刘承明, 胡文华. 自动代码编辑推荐综述[J]. 《计算机应用》唯一官方网站, 2026, 46(4): 1227-1237.
[12]	陈鹏, 李旭, 余肖生. RGB-D双流镜像伪装目标检测网络[J]. 《计算机应用》唯一官方网站, 2026, 46(4): 1253-1263.
[13]	郭阳, 王海亮, 高需, 王海涛, 王翌博. BEV三维目标检测算法体系综述[J]. 《计算机应用》唯一官方网站, 2026, 46(4): 1238-1252.
[14]	何帅, 邓春华. 基于YOLO-World的少样本学习目标检测算法[J]. 《计算机应用》唯一官方网站, 2026, 46(4): 1275-1282.
[15]	陈敏, 秦小林, 李绍涵, 杨昊, 李韬弘. 深度学习应用于强对流天气预测的综述[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 980-992.