Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (6): 2007-2015.DOI: 10.11772/j.issn.1001-9081.2025050666

• Multimedia computing and computer simulation • Previous Articles    

Wheel hub defect detection method based on perspective correction and lightweight attention mechanism

Shuhao ZHANG1,2, Kunjin HE1,2(), Jiachen XU2, Heshan SHA2, Zhengming CHEN1,2   

  1. 1.Key Laboratory of Maritime Intelligent Cyberspace Technology of Ministry of Education (Hohai University),Changzhou Jiangsu 213200,China
    2.College of Information Science and Engineering,Hohai University,Changzhou Jiangsu 213200,China
  • Received:2025-06-23 Revised:2025-07-21 Accepted:2025-07-23 Online:2025-08-01 Published:2026-06-10
  • Contact: Kunjin HE
  • About author:ZHANG Shuhao, born in 2002, M. S. candidate. His research interests include deep learning, object detection.
    XU Jiachen, born in 2001, M. S. candidate. His research interests include image processing, feature matching.
    SHA Heshan, born in 2003, M. S. candidate. His research interests include image processing, hardware-software co-design.
    CHEN Zhengming, born in 1965, Ph. D., professor. His research interests include computer-aided design, computer graphics.
    First author contact:HE Kunjin, born in 1974, Ph. D., professor. His research interests include computer-aided design, computer graphics.
  • Supported by:
    National Key Research and Development Program of China(2020YFB1708900);Fundamental Research Funds for the Central Universities(B240203012)

融合透视校正与轻量注意力机制的轮毂缺陷检测方法

张纾豪1,2, 何坤金1,2(), 徐佳晨2, 沙河山2, 陈正鸣1,2   

  1. 1.海上智能网信技术教育部重点实验室(河海大学),江苏 常州 213200
    2.河海大学 信息科学与工程学院,江苏 常州 213200
  • 通讯作者: 何坤金
  • 作者简介:张纾豪(2002—),男,云南昆明人,硕士研究生,主要研究方向:深度学习、目标检测
    徐佳晨(2001—),男,江苏苏州人,硕士研究生,主要研究方向:图像处理、特征匹配
    沙河山(2003—),男,河北邢台人,硕士研究生,主要研究方向:图像处理、软硬件协同设计
    陈正鸣(1965—),男,浙江东阳人,教授,博士,CCF高级会员,主要研究方向:计算机辅助设计、计算机图形学。
    第一联系人:何坤金(1974—),男,安徽芜湖人,教授,博士,CCF高级会员,主要研究方向:计算机辅助设计、计算机图形学
  • 基金资助:
    国家重点研发计划项目(2020YFB1708900);中央高校基本科研业务费专项(B240203012)

Abstract:

In wheel hub surface defect detection tasks in industrial visual inspection, geometric distortion caused by shooting angle deviation as well as the small scale and complex morphology of defect objects limit performance of the existing detection methods. To address these challenges, a defect detection method that combines perspective correction and lightweight attention mechanism was proposed. Firstly, the offset relationship between the ellipse center and the geometric center of the wheel hub was utilized to construct a perspective transformation quadrilateral for solving the homography matrix, so as to complete image perspective correction, thereby mitigating the impact of distortion on subsequent feature extraction. Secondly, based on YOLOv11 model, the conventional CBS (Convolution-BatchNorm-SiLU(Sigmoid Linear Unit)) modules in the backbone and neck were replaced by lightweight Ghost convolutions to reduce the number of parameters and the computational cost of the model. Meanwhile, an Efficient Channel Attention (ECA) mechanism was introduced to enhance the network’s perceptual ability to tiny defect regions, leading to the construction of the improved model YOLOv11n-GAConv. Experimental results on a self-built wheel hub defect dataset show that the mean Average Precision of the proposed model when the intersection over union threshold between the predicted bounding boxes and the ground truth bounding boxes is set to 0.5 (mAP@0.5) of 84.7%, with an improvement of 2.4 percentage points compared to that of YOLOv11n, and achieves a recall of 79.5%, with an improvement is 8.6 percentage points compared to that of YOLOv11n. At the same time, the number of parameters and computational cost of the proposed model are reduced by 12.4% and 11.1%, respectively, compared to those of YOLOv11n. It can be seen that the proposed method reduces the model complexity while achieving improved detection precision.

Key words: YOLOv11, wheel hub defect detection, perspective correction, lightweight structure, Efficient Channel Attention (ECA) mechanism

摘要:

工业视觉检测下的轮毂表面缺陷检测任务中,拍摄视角偏差引发的几何畸变,以及缺陷目标尺度小、形态复杂等因素会导致现有检测方法的性能受限。针对上述问题,提出一种融合透视校正与轻量注意力机制的缺陷检测方法。首先,利用椭圆的圆心与轮毂几何中心偏移的关系,构建透视变换四边形,以求解单应性矩阵完成图像视角校正,从而消除畸变对后续特征提取的影响;其次,在YOLOv11模型的基础上,采用轻量化的Ghost卷积替换主干网络与颈部结构中的传统CBS (Convolution-BatchNorm-SiLU(Sigmoid Linear Unit))模块,从而降低模型的参数量与计算量;同时,引入高效通道注意力(ECA)机制,增强网络对微小缺陷区域的感知能力,构建改进模型YOLOv11n-GAConv。实验结果表明,在自建轮毂缺陷数据集上,所提模型预测框与真实框的交并比阈值设为0.5时的平均精度均值(mAP@0.5)达到84.7%,相较于YOLOv11n提升了2.4个百分点,召回率达到79.5%,相较于YOLOv11n 提升了8.6个百分点,而模型的参数量与计算量分别下降12.4%与11.1%。可见,所提方法在提升检测精度的同时降低了模型复杂度。

关键词: YOLOv11, 轮毂缺陷检测, 透视校正, 轻量化结构, 高效通道注意力机制

CLC Number: