《计算机应用》唯一官方网站 ›› 2026, Vol. 46 ›› Issue (6): 1738-1745.DOI: 10.11772/j.issn.1001-9081.2025060751
收稿日期:2025-07-09
修回日期:2025-09-25
接受日期:2025-09-29
发布日期:2025-10-16
出版日期:2026-06-10
通讯作者:
孔佳仪
作者简介:杜艺(2003—),男,湖北咸宁人,硕士研究生,主要研究方向:计算机视觉、目标检测基金资助:
Yi DU, Mingjin XU, Jiayi KONG(
), Liyao WANG, Chen ZHAO
Received:2025-07-09
Revised:2025-09-25
Accepted:2025-09-29
Online:2025-10-16
Published:2026-06-10
Contact:
Jiayi KONG
About author:DU Yi, born in 2003, M. S. candidate. His research interests include computer vision, object detection.Supported by:摘要:
针对复杂场景下目标检测任务中深度学习算法的泛化性和鲁棒性受限以及全参数微调(FPFT)计算成本高的问题,提出一种基于YOLOv11 (You Only Look Once version 11)的低秩自适应参数高效微调(PEFT)算法。首先,在YOLOv11的骨干和颈部网络中嵌入低秩自适应(LoRA)模块;其次,结合LoRA、权重分解低秩自适应(DoRA)和主奇异值与奇异向量自适应(PiSSA)这3种低秩分解算法,通过权重分解与动态调整机制实现参数的高效更新;最后,在训练过程中,将YOLOv11网络的绝大部分预训练权重保持冻结状态,仅对LoRA模块中由3种低秩分解算法生成的低秩矩阵进行训练,将可训练参数规模缩减至原算法的1.56%。COCO (Common Objects in COntext)数据集上的实验结果表明,所提算法相较于基线算法YOLOv11在精确度、召回率和交并比阈值为0.5时的平均精度均值(mAP)上分别提升了4.18、7.11和7.85个百分点。可见,所提算法为资源受限场景下的大型检测算法轻量化与高效微调提供了有效技术路径。
中图分类号:
杜艺, 续明进, 孔佳仪, 王力瑶, 赵晨. 基于YOLOv11的低秩自适应参数高效微调算法[J]. 计算机应用, 2026, 46(6): 1738-1745.
Yi DU, Mingjin XU, Jiayi KONG, Liyao WANG, Chen ZHAO. Low-rank adaptive parameter-efficient fine-tuning algorithm based on YOLOv11[J]. Journal of Computer Applications, 2026, 46(6): 1738-1745.
| 算法 | P/% | R/% | mAP0.5/% | mAP0.5:0.95/% | S/MB |
|---|---|---|---|---|---|
| RT-DETR | 69.30 | 58.06 | 61.83 | 44.19 | 31.44 |
| YOLOv3 | 52.34 | 38.81 | 40.37 | 27.61 | 31.20 |
| YOLOv5m | 66.11 | 52.80 | 57.58 | 41.46 | 23.90 |
| YOLOv8m | 66.77 | 55.24 | 59.46 | 43.46 | 24.70 |
| YOLOv10m | 65.03 | 54.00 | 58.01 | 42.14 | 15.80 |
| YOLOv11m | 67.17 | 54.51 | 58.84 | 42.87 | 19.20 |
| YOLOv12m | 67.03 | 54.68 | 58.78 | 43.08 | 19.30 |
| YOLOv13l | 69.55 | 55.83 | 60.39 | 44.38 | 26.34 |
| 本文算法 | 71.35 | 61.62 | 66.69 | 49.90 | 19.50 |
表1 在COCO数据集上不同算法的实验结果对比
Tab. 1 Comparison of experimental results of different algorithms on COCO dataset
| 算法 | P/% | R/% | mAP0.5/% | mAP0.5:0.95/% | S/MB |
|---|---|---|---|---|---|
| RT-DETR | 69.30 | 58.06 | 61.83 | 44.19 | 31.44 |
| YOLOv3 | 52.34 | 38.81 | 40.37 | 27.61 | 31.20 |
| YOLOv5m | 66.11 | 52.80 | 57.58 | 41.46 | 23.90 |
| YOLOv8m | 66.77 | 55.24 | 59.46 | 43.46 | 24.70 |
| YOLOv10m | 65.03 | 54.00 | 58.01 | 42.14 | 15.80 |
| YOLOv11m | 67.17 | 54.51 | 58.84 | 42.87 | 19.20 |
| YOLOv12m | 67.03 | 54.68 | 58.78 | 43.08 | 19.30 |
| YOLOv13l | 69.55 | 55.83 | 60.39 | 44.38 | 26.34 |
| 本文算法 | 71.35 | 61.62 | 66.69 | 49.90 | 19.50 |
| C3K2 | SPPF | C2PSA | P/% | R/% | mAP0.5/% | mAP0.5:0.95/% | S/MB |
|---|---|---|---|---|---|---|---|
| 67.17 | 54.51 | 58.84 | 42.87 | 19.20 | |||
| √ | 70.67 | 61.16 | 66.61 | 49.92 | 19.45 | ||
| √ | √ | 70.68 | 61.16 | 66.61 | 49.92 | 19.46 | |
| √ | √ | √ | 71.35 | 61.62 | 66.69 | 49.90 | 19.50 |
表2 消融实验
Tab. 2 Ablation experiments
| C3K2 | SPPF | C2PSA | P/% | R/% | mAP0.5/% | mAP0.5:0.95/% | S/MB |
|---|---|---|---|---|---|---|---|
| 67.17 | 54.51 | 58.84 | 42.87 | 19.20 | |||
| √ | 70.67 | 61.16 | 66.61 | 49.92 | 19.45 | ||
| √ | √ | 70.68 | 61.16 | 66.61 | 49.92 | 19.46 | |
| √ | √ | √ | 71.35 | 61.62 | 66.69 | 49.90 | 19.50 |
| 算法 | 训练参数量/MB | P/% | R/% | mAP0.5/% | mAP0.5:0.95/% |
|---|---|---|---|---|---|
| FPFT | 19.18 | 67.17 | 54.51 | 58.84 | 42.87 |
| LoRA | 0.65 | 71.51 | 60.94 | 66.34 | 49.86 |
| DoRA | 0.73 | 68.79 | 58.61 | 63.02 | 46.56 |
| PiSSA | 0.30 | 71.35 | 61.62 | 66.69 | 49.90 |
表3 本文算法在不同PEFT算法下的实验结果对比
Tab. 3 Comparison of experimental results of proposed algorithm under different PEFT algorithms
| 算法 | 训练参数量/MB | P/% | R/% | mAP0.5/% | mAP0.5:0.95/% |
|---|---|---|---|---|---|
| FPFT | 19.18 | 67.17 | 54.51 | 58.84 | 42.87 |
| LoRA | 0.65 | 71.51 | 60.94 | 66.34 | 49.86 |
| DoRA | 0.73 | 68.79 | 58.61 | 63.02 | 46.56 |
| PiSSA | 0.30 | 71.35 | 61.62 | 66.69 | 49.90 |
| 秩 | PEFT | P/% | R/% | mAP0.5/% | mAP0.5:0.95/% |
|---|---|---|---|---|---|
| 4 | LoRA | 71.51 | 60.94 | 66.34 | 49.86 |
| DoRA | 68.79 | 58.61 | 63.02 | 46.56 | |
| PiSSA | 71.35 | 61.62 | 66.69 | 49.90 | |
| 8 | LoRA | 70.38 | 60.71 | 66.31 | 49.96 |
| DoRA | 68.41 | 58.30 | 62.42 | 46.03 | |
| PiSSA | 70.38 | 61.26 | 66.47 | 49.94 | |
| 16 | LoRA | 70.58 | 61.20 | 66.38 | 50.01 |
| DoRA | 68.38 | 57.68 | 61.56 | 45.13 | |
| PiSSA | 69.77 | 60.70 | 65.61 | 49.27 |
表4 本文算法在不同秩下PEFT算法的实验结果对比
Tab. 4 Comparison of experimental results of proposed algorithm under PEFT algorithms at different ranks
| 秩 | PEFT | P/% | R/% | mAP0.5/% | mAP0.5:0.95/% |
|---|---|---|---|---|---|
| 4 | LoRA | 71.51 | 60.94 | 66.34 | 49.86 |
| DoRA | 68.79 | 58.61 | 63.02 | 46.56 | |
| PiSSA | 71.35 | 61.62 | 66.69 | 49.90 | |
| 8 | LoRA | 70.38 | 60.71 | 66.31 | 49.96 |
| DoRA | 68.41 | 58.30 | 62.42 | 46.03 | |
| PiSSA | 70.38 | 61.26 | 66.47 | 49.94 | |
| 16 | LoRA | 70.58 | 61.20 | 66.38 | 50.01 |
| DoRA | 68.38 | 57.68 | 61.56 | 45.13 | |
| PiSSA | 69.77 | 60.70 | 65.61 | 49.27 |
| [1] | GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2014: 580-587. |
| [2] | GIRSHICK R. Fast R-CNN[C]// Proceedings of the 2015 IEEE International Conference on Computer Vision. Piscataway: IEEE, 2015: 1440-1448. |
| [3] | REN S, HE K, GIRSHICK R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149. |
| [4] | 李鸿天,史鑫昊,潘卫国,等. 融合多尺度和注意力机制的小样本目标检测[J]. 计算机应用, 2024, 44(5): 1437-1444. |
| LI H T, SHI X H, PAN W G, et al. Few-shot object detection via fusing multi-scale and attention mechanism[J]. Journal of Computer Applications, 2024, 44(5): 1437-1444. | |
| [5] | REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2016: 779-788. |
| [6] | LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multiBox detector[C]// Proceedings of the 2016 European Conference on Computer Vision, LNCS 9905. Cham: Springer, 2016: 21-37. |
| [7] | ZHAO Y, LV W, XU S, et al. DETRS beat YOLOs on real-time object detection[C]// Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2024: 16965-16974. |
| [8] | 郑华伟,王飞,高建邦. DES-YOLO:一种更精确的目标检测方法[J]. 光电工程, 2024, 51(11): No.240212. |
| ZHENG H W, WANG F, GAO J B. DES-YOLO: a more accurate object detection method[J]. Opto-Electronic Engineering, 2024, 51(11): No.240212. | |
| [9] | 刘皓皎,刘力双,张明淳. 基于YOLOv5改进的红外目标检测算法[J]. 激光技术, 2024, 48(4): 534-541. |
| LIU H J, LIU L S, ZHANG M C. An improved infrared object detection algorithm based on YOLOv5[J]. Laser Technology, 2024, 48(4): 534-541. | |
| [10] | 肖振久,张杰浩,林渤翰. 特征协同与细粒度感知的遥感图像小目标检测[J]. 光电工程, 2024, 51(6): No.240066. |
| XIAO Z J, ZHANG J H, LIN B H. Feature coordination and fine-grained perception of small targets in remote sensing images[J]. Opto-Electronic Engineering, 2024, 51(6): No.240066. | |
| [11] | PU G, JAIN A, YIN J, et al. Empirical analysis of the strengths and weaknesses of PEFT techniques for LLMs[EB/OL]. [2025-01-08].. |
| [12] | KAUR R, SINGH S. A comprehensive review of object detection with deep learning[J]. Digital Signal Processing, 2023, 132: No.103812. |
| [13] | HAN X, PU N, FENG Z, et al. Benchmarking multi-scene fire and smoke detection[C]// Proceedings of the 2024 Chinese Conference on Pattern Recognition and Computer Vision, LNCS 15041. Singapore: Springer, 2025: 203-218. |
| [14] | LI P, GUO T, WANG B, et al. Grid-Attention: enhancing computational efficiency of large vision models without fine-tuning[C]// Proceedings of the 2024 European Conference on Computer Vision, LNCS 15108. Cham: Springer, 2025: 54-70. |
| [15] | YE T, ZHENG Z, LI X, et al. An efficient few-shot object detection method for railway intrusion via fine-tune approach and contrastive learning[J]. IEEE Transactions on Instrumentation and Measurement, 2023, 72: No.5018713. |
| [16] | CHENG T, SONG L, GE Y, et al. YOLO-World: real-time open-vocabulary object detection[C]// Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2024: 16901-16911. |
| [17] | MA B, XU W. Efficient fine tuning for fashion object detection[J]. Sensors, 2023, 23(13): No.6083. |
| [18] | HUANG X, HE B, TONG M, et al. Few-shot object detection on remote sensing images via shared attention module and balanced fine-tuning strategy[J]. Remote Sensing, 2021, 13(19): No.3816. |
| [19] | LI M, WU J, WANG X, et al. AlignDet: aligning pre-training and fine-tuning in object detection[C]// Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE, 2023: 6843-6853. |
| [20] | KHANAM R, HUSSAIN M. YOLOv11: an overview of the key architectural enhancements[EB/OL]. [2025-01-08].. |
| [21] | HU E J, SHEN Y, WALLIS P, et al. LoRA: low-rank adaptation of large language models[EB/OL]. [2025-04-02].. |
| [22] | LIU S Y, WANG C Y, YIN H, et al. DoRA: weight-decomposed low-rank adaptation[C]// Proceedings of the 41st International Conference on Machine Learning. New York: JMLR.org, 2024: 32100-32121. |
| [23] | MENG F, WANG Z, ZHANG M. PiSSA: principal singular values and singular vectors adaptation of large language models[C]// Proceedings of the 38th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2024: 121038-121072. |
| [24] | 张溢文,蔡满春,陈咏豪,等. 融合多种参数高效微调技术的深度伪造检测方法[J]. 计算机科学与探索, 2024, 18(12): 3335-3347. |
| ZHANG Y W, CAI M C, CHEN Y H, et al. Deepfake detection method integrating multiple parameter-efficient fine-tuning techniques[J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(12): 3335-3347. | |
| [25] | SREENIVASAN K, SOHN J, YANG L, et al. Rare gems: finding lottery tickets at initialization[C]// Proceedings of the 36th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2022: 14529-14540. |
| [26] | ZHONG Z, TANG Z, HE T, et al. Convolution meets LoRA: parameter efficient finetuning for segment anything model[EB/OL]. [2025-04-18].. |
| [27] | 朱琦,周德强,盛卫锋,等. 基于DSCS-YOLO的苹果表面缺陷检测方法[J]. 南京农业大学学报, 2024, 47(3): 592-601. |
| ZHU Q, ZHOU D Q, SHENG W F, et al. Apple surface defect detection method based on DSCS-YOLO[J]. Journal of Nanjing Agricultural University, 2024, 47(3): 592-601. | |
| [28] | WANG C Y, MARK LIAO H Y, WU Y H, et al. CSPNet: a new backbone that can enhance learning capability of CNN[C]// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. Piscataway: IEEE, 2020: 1571-1580. |
| [29] | JOOSHIN H K, NANGIR M, SEYEDARABI H. Inception-YOLO: computational cost and accuracy improvement of the YOLOv5 model based on employing modified CSP, SPPF, and inception modules[J]. IET Image Processing, 2024, 18(8): 1985-1999. |
| [30] | LIU S, QI L, QIN H, et al. Path aggregation network for instance segmentation[C]// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2018: 8759-8768. |
| [31] | HOSNA A, MERRY E, GYALMO J, et al. Transfer learning: a friendly introduction[J]. Journal of Big Data, 2022, 9: No.102. |
| [32] | BROWN T B, MANN B, RYDER N, et al. Language models are few-shot learners[C]// Proceedings of the 34th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2020: 1877-1901. |
| [33] | KAPLAN J, McCANDLISH S, HENIGHAN T, et al. Scaling laws for neural language models[EB/OL]. [2025-02-22].. |
| [34] | HAN Z, GAO C, LIU J, et al. Parameter-efficient fine-tuning for large models: a comprehensive survey[EB/OL]. [2025-02-22].. |
| [35] | LIN Z, MADOTTO A, FUNG P. Exploring versatile generative language model via parameter-efficient transfer learning[C]// Findings of the Association for Computational Linguistics: EMNLP 2020. Stroudsburg: ACL, 2020: 441-459. |
| [36] | LIN T Y, MAIRE M, BELONGIE S, et al. Microsoft COCO: common objects in context[C]// Proceedings of the 2014 European Conference on Computer Vision, LNCS 8693. Cham: Springer, 2014: 740-755. |
| [1] | 张纾豪, 何坤金, 徐佳晨, 沙河山, 陈正鸣. 融合透视校正与轻量注意力机制的轮毂缺陷检测方法[J]. 《计算机应用》唯一官方网站, 2026, 46(6): 2007-2015. |
| [2] | 尹秋燕, 丁婧, 聂志刚. 无人机航拍视角下的人体姿态估计算法YOLO-AirPose[J]. 《计算机应用》唯一官方网站, 2026, 46(6): 1989-1997. |
| [3] | 熊珍凯, 徐梦军, 孙胤胤, 王鑫. 基于改进YOLOv8的复杂天气环境下海面船舶检测算法[J]. 《计算机应用》唯一官方网站, 2026, 46(6): 1998-2006. |
| [4] | 何丽丽, 曹勐, 张磊, 潘洪军, 刘义, 孙成心. 基于Kolmogorov-Arnold网络与扩散Transformer的手语生成模型[J]. 《计算机应用》唯一官方网站, 2026, 46(6): 1801-1810. |
| [5] | 盛兴, 翁孙贤, 陈扩松, 王忠平, 任芮锋, 刘勇. 基于深度学习的电网企业专利价值评估[J]. 《计算机应用》唯一官方网站, 2026, 46(5): 1468-1474. |
| [6] | 吴闵奇, 杨元华, 李航, 胡雅琴, 汤智豪, 梅腾. 基于图Transformer和RT-DETR的轻量化水下小目标检测[J]. 《计算机应用》唯一官方网站, 2026, 46(5): 1586-1595. |
| [7] | 张红瑞, 冯威铭, 杨潞霞, 马永杰. 基于YOLO11改进的水下小目标检测算法CSAF-YOLO[J]. 《计算机应用》唯一官方网站, 2026, 46(5): 1578-1585. |
| [8] | 刘馨瑶, 梁军, 龙嘉濠, 颜仁梁. 基于特征融合和通道信息补偿的中草药细粒度图像分类[J]. 《计算机应用》唯一官方网站, 2026, 46(5): 1677-1683. |
| [9] | 郭慧洁, 窦天凤, 张振琳, 亓开元, 吴栋, 曲志坚, 李钊, 任崇广. 基于时间依赖建模的动态贝叶斯网络交通预测[J]. 《计算机应用》唯一官方网站, 2026, 46(5): 1507-1517. |
| [10] | 张永兵, 闫丽蓉, 唐晓芬. 渐进式双阶段模态交互的单域泛化目标检测[J]. 《计算机应用》唯一官方网站, 2026, 46(4): 1264-1274. |
| [11] | 陈浩轩, 叶培昌, 刘磊, 刘承明, 胡文华. 自动代码编辑推荐综述[J]. 《计算机应用》唯一官方网站, 2026, 46(4): 1227-1237. |
| [12] | 陈鹏, 李旭, 余肖生. RGB-D双流镜像伪装目标检测网络[J]. 《计算机应用》唯一官方网站, 2026, 46(4): 1253-1263. |
| [13] | 郭阳, 王海亮, 高需, 王海涛, 王翌博. BEV三维目标检测算法体系综述[J]. 《计算机应用》唯一官方网站, 2026, 46(4): 1238-1252. |
| [14] | 何帅, 邓春华. 基于YOLO-World的少样本学习目标检测算法[J]. 《计算机应用》唯一官方网站, 2026, 46(4): 1275-1282. |
| [15] | 陈敏, 秦小林, 李绍涵, 杨昊, 李韬弘. 深度学习应用于强对流天气预测的综述[J]. 《计算机应用》唯一官方网站, 2026, 46(3): 980-992. |
| 阅读次数 | ||||||
|
全文 |
|
|||||
|
摘要 |
|
|||||