Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (6): 1738-1745.DOI: 10.11772/j.issn.1001-9081.2025060751

• Artificial intelligence • Previous Articles    

Low-rank adaptive parameter-efficient fine-tuning algorithm based on YOLOv11

Yi DU, Mingjin XU, Jiayi KONG(), Liyao WANG, Chen ZHAO   

  1. School of Mechanical and Electrical Engineering,Beijing Institute of Graphic Communication,Beijing 102600,China
  • Received:2025-07-09 Revised:2025-09-25 Accepted:2025-09-29 Online:2025-10-16 Published:2026-06-10
  • Contact: Jiayi KONG
  • About author:DU Yi, born in 2003, M. S. candidate. His research interests include computer vision, object detection.
    XU Mingjin, born in 1970, M. S., associate professor. His research interests include computer vision, pattern recognition.
    WANG Liyao, born in 2000, M. S. candidate. His research interests include machine learning, power electronics.
    ZHAO Chen, born in 2002, M. S. candidate. His research interests include intelligent control.
    First author contact:KONG Jiayi, born in 1995, Ph. D., lecturer. Her research interests include natural language processing.
  • Supported by:
    National Key Research and Development Program of China(2024YFF0617202);Beijing Key Laboratory Construction Project(KYCPT202508)

基于YOLOv11的低秩自适应参数高效微调算法

杜艺, 续明进, 孔佳仪(), 王力瑶, 赵晨   

  1. 北京印刷学院 机电工程学院,北京 102600
  • 通讯作者: 孔佳仪
  • 作者简介:杜艺(2003—),男,湖北咸宁人,硕士研究生,主要研究方向:计算机视觉、目标检测
    续明进(1970—),男,山东临沂人,副教授,硕士,主要研究方向:计算机视觉、模式识别
    王力瑶(2000—),男,河北秦皇岛人,硕士研究生,主要研究方向:机器学习、电力电子
    赵晨(2002—),男,甘肃兰州人,硕士研究生,主要研究方向:智能控制。
    第一联系人:孔佳仪(1995—),女,山东泰安人,讲师,博士,主要研究方向:自然语言处理
  • 基金资助:
    国家重点研发计划项目(2024YFF0617202);北京市重点实验室建设项目(KYCPT202508)

Abstract:

In view of the limitations of deep learning algorithms’ generalization and robustness, as well as the high computational cost of Full Parameter Fine-Tuning (FPFT) in object detection tasks in complex scenarios, a low-rank adaptive Parameter-Efficient Fine-Tuning (PEFT) algorithm based on YOLOv11 (You Only Look Once version 11) was proposed. Firstly, a Low-Rank Adaptation (LoRA) module was embedded into the backbone and neck networks of YOLOv11. Secondly, three low-rank decomposition algorithms, including LoRA, weight-Decomposed low-Rank Adaptation (DoRA) and Principal Singular values and Singular vectors Adaptation (PiSSA) were combined, and efficient parameter updates were achieved through weight decomposition and dynamic adjustment mechanisms. Finally, during the training process, most of the pre-trained weights of the YOLOv11 network were kept frozen, and only the low-rank matrices generated by the three low-rank decomposition algorithms in the LoRA module were trained, thereby reducing the trainable parameter size to 1.56% of the original algorithm. Experimental results on the COCO (Common Objects in COntext) dataset demonstrate that the proposed algorithm improves the precision, recall and mean Average Precision (mAP) at IoU (Intersection over Union) threshold of 0.5 by 4.18, 7.11 and 7.85 percentage points, respectively, compared with the baseline algorithm YOLOv11. It can be seen that the proposed algorithm provides an effective technical path for lightweight and efficient fine-tuning of large-scale detection algorithms in resource-constrained scenarios.

Key words: YOLOv11 (You Only Look Once version 11), Parameter?Efficient Fine?Tuning (PEFT), object detection, Low-Rank Adaptation (LoRA), deep learning

摘要:

针对复杂场景下目标检测任务中深度学习算法的泛化性和鲁棒性受限以及全参数微调(FPFT)计算成本高的问题,提出一种基于YOLOv11 (You Only Look Once version 11)的低秩自适应参数高效微调(PEFT)算法。首先,在YOLOv11的骨干和颈部网络中嵌入低秩自适应(LoRA)模块;其次,结合LoRA、权重分解低秩自适应(DoRA)和主奇异值与奇异向量自适应(PiSSA)这3种低秩分解算法,通过权重分解与动态调整机制实现参数的高效更新;最后,在训练过程中,将YOLOv11网络的绝大部分预训练权重保持冻结状态,仅对LoRA模块中由3种低秩分解算法生成的低秩矩阵进行训练,将可训练参数规模缩减至原算法的1.56%。COCO (Common Objects in COntext)数据集上的实验结果表明,所提算法相较于基线算法YOLOv11在精确度、召回率和交并比阈值为0.5时的平均精度均值(mAP)上分别提升了4.18、7.11和7.85个百分点。可见,所提算法为资源受限场景下的大型检测算法轻量化与高效微调提供了有效技术路径。

关键词: YOLOv11, 参数高效微调, 目标检测, 低秩自适应, 深度学习

CLC Number: