Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (5): 1624-1633.DOI: 10.11772/j.issn.1001-9081.2025050592

• Frontier and comprehensive applications • Previous Articles    

Defect detection algorithm for train bearing rollers based on FHC-DETR

Yuanhao HE, Jun ZHAO()   

  1. School of Mechanical Engineering,Lanzhou Jiaotong University,Lanzhou Gansu 730070,China
  • Received:2025-05-29 Revised:2025-08-17 Accepted:2025-08-29 Online:2025-09-16 Published:2026-05-10
  • Contact: Jun ZHAO
  • About author:HE Yuanhao, born in 2000, M. S. candidate. His research interests include computer vision,deep learning.
  • Supported by:
    National Natural Science Foundation of China(51868037)

基于FHC-DETR的列车轴承滚子缺陷检测算法

贺元昊, 赵军()   

  1. 兰州交通大学 机电工程学院,兰州 730070
  • 通讯作者: 赵军
  • 作者简介:贺元昊(2000—),男,宁夏中卫人,硕士研究生,CCF会员,主要研究方向:计算机视觉、深度学习
  • 基金资助:
    国家自然科学基金资助项目(51868037)

Abstract:

To address the issues such as low precision in small object recognition and large-scale variations in the detection of train bearing roller defects, a detection algorithm based on FHC-DETR(Fourier-fused High-low frequency interactive and Context-aware DEtection TRansformer) was proposed. Firstly, aiming at the problems of complex model computation and small object features being disturbed by noise, a frequency-domain feature extraction module C2f-FG (C2f with Fourier-Gated bottleneck) was designed for feature extraction. It synchronously acquired spatial-domain local features and frequency-domain global features via Fourier transform, and their fusion enhanced the accuracy of small-object detection while reducing computational complexity. Secondly, to tackle feature confusion caused by variations in defect scale, a high-low-frequency feature interaction module, HiLo (High-Low frequency), was introduced. The high-frequency branch focused on local defect textures, while the low-frequency branch captured overall semantics via global attention, thereby improving multi-scale adaptability. Subsequently, to resolve the issue of small object feature attenuation in feature fusion, a Context-aware Multi-scale Bidirectional Feature Pyramid Network (CM-BiFPN) was constructed. By dynamically perceiving context and strengthening cross-layer interaction, it reduced feature transmission loss and improved fusion efficiency. Finally, the EMASlideVarifocalLoss adaptive loss function was adopted to dynamically adjust classification thresholds and optimize weights of hard examples, further enhancing localization and category discrimination capabilities. Experimental results show that FHC-DETR achieves a mean Average Precision (mAP) of 91.5%, which is 2.3 percentage points higher than that of the original RT-DETR (Real-Time DETR). Additionally, its parameter count is reduced by 28.1%, its computational load is reduced by 23.0%, and its memory usage is reduced by 23.7%, demonstrating a balance between precision and efficiency and confirming its practicality in industrial scenarios.

Key words: defect detection, small object, RT-DETR (Real-Time DEtection TRansformer), Feature Pyramid Network (FPN), loss function

摘要:

针对列车轴承滚子缺陷检测中存在的小目标识别精度低和尺度变化大等问题,提出基于FHC-DETR(Fourier-fused High-low frequency interactive and Context-aware DEtection TRansformer)的检测算法。首先,针对模型计算复杂及小目标特征受噪声干扰的问题,设计频域特征提取模块C2f-FG(C2f with Fourier-Gated bottleneck)提取特征,通过傅里叶变换同步获取空间域局部特征与频域全局特征,二者融合可提高小目标检测精度并降低计算复杂度;其次,针对缺陷尺度变化导致的特征混淆,引入高低频特征交互模块HiLo(High-Low frequency),高频分支聚焦局部缺陷纹理,低频分支通过全局注意力捕捉整体语义,从而提升多尺度适应能力;再次,针对特征融合中存在的小目标特征衰减问题,构建上下文感知型多尺度双向特征金字塔网络(CM-BiFPN),通过动态感知上下文并强化跨层交互,减小特征传递损失,提升融合效率;最后,采用EMASlideVarifocalLoss自适应损失函数,动态调整分类阈值并优化难例权重,进一步提升定位与类别区分能力。实验结果显示,所提检测算法的平均精度均值(mAP)达91.5%,比实时检测变换器(RT-DETR)高2.3个百分点,且参数量减少28.1%,计算量降低23.0%,内存占用下降23.7%,实现了精度与效率的平衡,验证了它在工业场景的实用性。

关键词: 缺陷检测, 小目标, 实时检测变换器, 特征金字塔网络, 损失函数

CLC Number: