《计算机应用》唯一官方网站

• •    下一篇

强化形态感知的路面缺陷检测算法

张佳慧,李晓明*,张嘉祥   

  1. 太原科技大学 计算机科学与技术学院,太原 030024
  • 收稿日期:2024-10-23 修回日期:2025-01-07 接受日期:2025-01-10 发布日期:2025-01-15 出版日期:2025-01-15
  • 通讯作者: 李晓明
  • 基金资助:
    国家自然科学基金项目

Pavement defect detection algorithm with enhanced morphological perception

  • Received:2024-10-23 Revised:2025-01-07 Accepted:2025-01-10 Online:2025-01-15 Published:2025-01-15
  • Supported by:
    National Natural Science Foundation of China

摘要: 针对路面缺陷形态侧向狭窄、尺度多变和长程依赖特性导致检测精度低和漏检率高的问题,提出基于YOLOv8_n改进的强化形态感知的路面缺陷检测算法。首先,在主干网络融合阶段提出边缘增益聚焦模块(EEFM),采用条形池化核捕捉定向和位置感知信息并强化深层特征的边缘细节,增强细长特征的表达能力。其次,设计双链特征重分配金字塔网络(DCFRPN),重构融合方式,提供大范围感知和丰富定位信息的多尺度特征,提升对多尺度缺陷的融合能力。此外,构造形态感知任务交互检测头(MATIDH),增强分类与定位之间的任务交互,动态调整数据表征,融合多尺度带状卷积,优化细长缺陷的分类和回归。最后,提出PWIoU(Penalized Weighted Intersection over Union)损失函数,动态分配不同质量预测框的梯度增益,优化Box框的回归方式。实验结果表明,在RDD2022数据集上,所提算法相较于基线的精确率和召回率分别提升3.5和2.3个百分点,平均精度均值在50%交并比阈值下的值提升3.2个百分点,验证了所提算法的有效性。

关键词: YOLOv8_n, 路面缺陷检测, 长程依赖关系, 裂缝, 多尺度

Abstract: To address the problem of low detection accuracy and high missed detection rates caused by the narrow lateral, multi-scale, and long-range dependency characteristics of Pavement defects, a road defect detection algorithm based on the improved YOLOv8_n with enhanced morphological perception was proposed. First, an Edge-Enhancement Focus Module (EEFM) was introduced in the backbone network fusion stage. A strip pooling kernel was used to capture directional and position-aware information, enhancing edge details in deep features and improving the expression of elongated features. Second, a Dual Chain Feature Redistribution Pyramid Network (DCFRPN) was designed to reconstruct the fusion method, providing multi-scale features with extensive perceptual and rich localization information, improving the fusion ability for multi-scale defects. Additionally, a Morphological Aware Task Interaction Detection Head (MATIDH) was constructed to enhance task interaction between classification and localization, dynamically adjusting data representation and integrating multi-scale strip convolutions to optimize the classification and regression of elongated defects. Finally, a PWIoU (Penalized Weighted Intersection over Union) loss function was proposed to dynamically allocate gradient gains for prediction boxes of different qualities, optimizing the regression of bounding boxes. Experimental results showed that, on the RDD2022 dataset, compared to the baseline, precision and recall were improved by 3.5 and 2.3 percentage points, respectively, and mean average precision at 50% IoU increased by 3.2 percentage points, verifying the effectiveness of the algorithm. 

Key words: YOLOv8_n, pavement defect detection, long-range dependency, crack, multi-scale

中图分类号: