Efficient fine-tuning algorithm for low-rank adaptive parameters based on YOLOv11

doi:10.11772/j.issn.1001-9081.2025060751

Abstract

Abstract: In view of the limitations of deep learning algorithms' generalization and robustness, as well as the high computational cost of full parameter fine-tuning (FPFT) in object detection tasks in complex scenarios, an efficient fine-tuning algorithm for low-rank adaptive parameters based on YOLOv11 (You Only Look Once version 11) was proposed. Firstly, a Low-rank Adaptation (LrAn) module was embedded into the backbone network of YOLOv11. Secondly, three low-rank decomposition algorithms, which include Low-Rank Adaptation (LoRA), weight-Decomposed low-Rank Adaptation (DoRA) and Principal Singular values and Singular vectors Adaptation (PiSSA) were combined. Efficient parameter updates were achieved through weight decomposition and dynamic adjustment mechanisms. Finally, during the training process, most of the pre-trained weights of the YOLOv11 network were kept frozen, and only the low rank matrices generated by the three low rank decomposition algorithms in the LrAn module were trained, reducing the trainable parameter size to 1.56% of the original model. Experiments conducted on the COCO (Common Objects in Context) dataset demonstrate that the proposed algorithm improves the precision, recall and average precision by 4.18, 7.11 and 7.85 percentage points respectively compared with the baseline YOLOv11 algorithm. It can be seen that the proposed algorithm provides an effective technical path for lightweight and efficient fine-tuning of large-scale detection algorithms in resource constrained scenarios.

Key words: YOLOv11, Efficient parameter fine-tuning, Object detection, Low-rank adaptive, Deep learning

摘要： 针对复杂场景下目标检测任务中深度学习算法泛化性、鲁棒性受限以及全参数微调(FPFT)计算成本高的问题，提出一种基于YOLOv11(You Only Look Once version 11)的低秩自适应参数高效微调算法。首先，在YOLOv11骨干和颈部网络中嵌入低秩自适应(LrAn)模块；其次，结合低秩自适应(LoRA)、权重分解低秩自适应(DoRA)和主奇异值与奇异向量自适应(PiSSA)三种低秩分解算法，通过权重分解与动态调整机制实现参数的高效更新；最后，在训练过程中，将YOLOv11网络的绝大部分预训练权重保持冻结状态，仅对LrAn模块中由三种低秩分解算法生成的低秩矩阵进行训练，将可训练参数规模缩减至原算法的1.56%。COCO(Common Objects in Context)数据集实验表明，所提算法相较基线YOLOv11算法在精确度、召回率和平均精度均值指标上分别提升4.18、7.11和7.85个百分点。可见，所提算法为资源受限场景下的大型检测算法轻量化与高效微调提供了有效技术路径。

关键词: YOLOv11, 参数高效微调, 目标检测, 低秩自适应, 深度学习

CLC Number:

中图分类号:TP391.4

杜艺续明进孔佳仪王力瑶赵晨. 基于YOLOv11的低秩自适应参数高效微调算法[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2025060751.

[1]	Hongjun ZHANG, Gaojun PAN, Hao YE, Yubin LU, Yiheng MIAO. Multi-source heterogeneous data analysis method combining deep learning and tensor decomposition [J]. Journal of Computer Applications, 2025, 45(9): 2838-2847.
[2]	Jin LI, Liqun LIU. SAR and visible image fusion based on residual Swin Transformer [J]. Journal of Computer Applications, 2025, 45(9): 2949-2956.
[3]	Bing YIN, Zhenhua LING, Yin LIN, Changfeng XI, Ying LIU. Emotion recognition method compatible with missing modal reasoning [J]. Journal of Computer Applications, 2025, 45(9): 2764-2772.
[4]	Lili WEI, Lirong YAN, Xiaofen TANG. Contextual semantic representation and pixel relationship correction for few-shot object detection [J]. Journal of Computer Applications, 2025, 45(9): 2993-3002.
[5]	Weigang LI, Jiale SHAO, Zhiqiang TIAN. Point cloud classification and segmentation network based on dual attention mechanism and multi-scale fusion [J]. Journal of Computer Applications, 2025, 45(9): 3003-3010.
[6]	Zhixiong XU, Bo LI, Xiaoyong BIAN, Qiren HU. Adversarial sample embedded attention U-Net for 3D medical image segmentation [J]. Journal of Computer Applications, 2025, 45(9): 3011-3016.
[7]	Jiaxiang ZHANG, Xiaoming LI, Jiahui ZHANG. Few-shot object detection algorithm based on new category feature enhancement and metric mechanism [J]. Journal of Computer Applications, 2025, 45(9): 2984-2992.
[8]	Panfeng JING, Yudong LIANG, Chaowei LI, Junru GUO, Jinyu GUO. Semi-supervised image dehazing algorithm based on teacher-student learning [J]. Journal of Computer Applications, 2025, 45(9): 2975-2983.
[9]	Peng PENG, Ziting CAI, Wenling LIU, Caihua CHEN, Wei ZENG, Baolai HUANG. Speech emotion recognition method based on hybrid Siamese network with CNN and bidirectional GRU [J]. Journal of Computer Applications, 2025, 45(8): 2515-2521.
[10]	Binhong XIE, Yingkun LA, Yingjun ZHANG, Rui ZHANG. Semi-supervised object detection framework guided by self-paced learning [J]. Journal of Computer Applications, 2025, 45(8): 2546-2554.
[11]	Shuo ZHANG, Guokai SUN, Yuan ZHUANG, Xiaoyu FENG, Jingzhi WANG. Dynamic detection method of eclipse attacks for blockchain node analysis [J]. Journal of Computer Applications, 2025, 45(8): 2428-2436.
[12]	Chengzhi YAN, Ying CHEN, Kai ZHONG, Han GAO. 3D object detection algorithm based on multi-scale network and axial attention [J]. Journal of Computer Applications, 2025, 45(8): 2537-2545.
[13]	Yanhua LIAO, Yuanxia YAN, Wenlin PAN. Multi-target detection algorithm for traffic intersection images based on YOLOv9 [J]. Journal of Computer Applications, 2025, 45(8): 2555-2565.
[14]	Lina GE, Mingyu WANG, Lei TIAN. Review of research on efficiency of federated learning [J]. Journal of Computer Applications, 2025, 45(8): 2387-2398.
[15]	Pingping YU, Yuting YAN, Xinliang TANG, He SU, Jianchao WANG. Multi-object tracking algorithm for construction machinery in transmission line scenarios [J]. Journal of Computer Applications, 2025, 45(7): 2351-2360.

Efficient fine-tuning algorithm for low-rank adaptive parameters based on YOLOv11

基于YOLOv11的低秩自适应参数高效微调算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics