Journal of Computer Applications ›› 2025, Vol. 45 ›› Issue (10): 3342-3352.DOI: 10.11772/j.issn.1001-9081.2024101511

• Frontier and comprehensive applications • Previous Articles    

Pavement defect detection algorithm with enhanced morphological perception

Jiahui ZHANG, Xiaoming LI(), Jiaxiang ZHANG   

  1. College of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan Shanxi 030024,China
  • Received:2024-10-24 Revised:2025-01-07 Accepted:2025-01-10 Online:2025-01-15 Published:2025-10-10
  • Contact: Xiaoming LI
  • About author:ZHANG Jiahui, born in 1999, M. S. candidate. Her research interests include computer vision, object detection.
    LI Xiaoming, born in 1965, Ph. D., professor. His research interests include computer vision, image processing and analysis.
    ZHANG Jiaxiang, born in 1999, M. S. candidate. His research interests include computer vision, object detection.
  • Supported by:
    National Natural Science Foundation of China(62273248)

强化形态感知的路面缺陷检测算法

张佳慧, 李晓明(), 张嘉祥   

  1. 太原科技大学 计算机科学与技术学院,太原 030024
  • 通讯作者: 李晓明
  • 作者简介:张佳慧(1999—),女,山西榆社人,硕士研究生,CCF会员,主要研究方向:计算机视觉、目标检测
    李晓明(1965—),男,山西太原人,教授,博士,CCF会员,主要研究方向:计算机视觉、图像处理与分析 Email:lixiaoming@tyust.edu.cn
    张嘉祥(1999—),男,山西文水人,硕士研究生,CCF会员,主要研究方向:计算机视觉、目标检测。
  • 基金资助:
    国家自然科学基金资助项目(62273248)

Abstract:

To address the problems of low detection accuracy and high missed detection rate caused by the narrow lateral, multi-scale, and long-range dependency characteristics of pavement defect morphology, a pavement defect detection algorithm improved by YOLOv8_n with enhanced morphological perception was proposed. Firstly, an Edge-Enhancement Focus Module (EEFM) was introduced in the backbone fusion stage, a strip pooling kernel was used to capture directional and position-aware information, thereby enhancing edge details in deep features and improving representation ability of elongated features. Secondly, a Dual Chain Feature Redistribution Pyramid Network (DCFRPN) was designed to reconstruct the fusion method, so as to provide multi-scale features with extensive perception and rich localization information, thereby improving fusion ability for multi-scale defects. Additionally, a Morphological Aware Task Interaction Detection Head (MATIDH) was constructed to enhance task interaction between classification and localization, thereby adjusting data representation dynamically and integrating multi-scale strip convolutions to optimize the classification and regression of elongated defects. Finally, a PWIoU (Penalized Weighted Intersection over Union) loss function was proposed to allocate gradient gains dynamically for prediction boxes of different qualities, thereby optimizing the regression of bounding boxes. Experimental results show that on the RDD2022 dataset, compared to YOLOv8_n, the proposed algorithm has the precision and recall improved by 3.5 and 2.3 percentage points, respectively, and the mean Average Precision (mAP) at 50% Intersection over Union (IoU) increased by 3.2 percentage points, verifying the effectiveness of the proposed algorithm.

Key words: YOLOv8_n, pavement defect detection, long-range dependency, crack, multi-scale

摘要:

针对路面缺陷形态侧向狭窄、尺度多变和长程依赖特性导致检测精度低和漏检率高的问题,提出基于YOLOv8_n改进的强化形态感知的路面缺陷检测算法。首先,在主干网络融合阶段提出边缘增益聚焦模块(EEFM),采用条状池化核捕捉定向和位置感知信息并强化深层特征的边缘细节,增强细长特征的表达能力。其次,设计双链特征重分配金字塔网络(DCFRPN),重构融合方式,提供大范围感知和丰富定位信息的多尺度特征,提升对多尺度缺陷的融合能力。此外,构造形态感知任务交互检测头(MATIDH),增强分类与定位之间的任务交互,动态调整数据表征,融合多尺度带状卷积,优化细长缺陷的分类和回归。最后,提出PWIoU(Penalized Weighted Intersection over Union)损失函数,动态分配不同质量预测框的梯度增益,优化Box框的回归方式。实验结果表明,在RDD2022数据集上,所提算法的精确率和召回率相较于YOLOv8_n分别提升了3.5和2.3个百分点,在50%交并比(IoU)阈值下的平均精度均值(mAP)提升了3.2个百分点,验证了所提算法的有效性。

关键词: YOLOv8_n, 路面缺陷检测, 长程依赖关系, 裂缝, 多尺度

CLC Number: