Journal of Computer Applications ›› 2025, Vol. 45 ›› Issue (2): 633-639.DOI: 10.11772/j.issn.1001-9081.2024020252

• Multimedia computing and computer simulation • Previous Articles    

Object detection in remote sensing image based on multi-scale feature fusion and weighted boxes fusion

Zhongwei ZHANG1, Jun WANG1, Shudong LIU1(), Zhiheng WANG2   

  1. 1.School of Computer and Information Engineering,Tianjin Chengjian University,Tianjin 300384,China
    2.School of Geology and Geomatics,Tianjin Chengjian University,Tianjin 300384,China
  • Received:2024-03-11 Revised:2024-04-23 Accepted:2024-04-25 Online:2024-06-04 Published:2025-02-10
  • Contact: Shudong LIU
  • About author:ZHANG Zhongwei, born in 1986, Ph. D., lecturer. Her research interests include deep learning, image processing, pattern recognition.
    WANG Jun, born in 1996, M. S. candidate. Her research interests include image processing.
    WANG Zhiheng, born in 1983, Ph. D., associate professor. His research interests include geographic information modeling and its application in disaster prevention and reduction.
  • Supported by:
    National Natural Science Foundation of China(41971310)

多尺度特征融合与加权框融合的遥感图像目标检测

张众维1, 王俊1, 刘树东1(), 王志恒2   

  1. 1.天津城建大学 计算机与信息工程学院,天津 300384
    2.天津城建大学 地质与测绘学院,天津 300384
  • 通讯作者: 刘树东
  • 作者简介:张众维(1986—),女,黑龙江齐齐哈尔人,讲师,博士,主要研究方向:深度学习、图像处理、模式识别
    王俊(1996—),女,山东青岛人,硕士研究生,主要研究方向:图像处理
    王志恒(1983—),男,山西阳泉人,副教授,博士,主要研究方向:地理信息建模技术及其在防灾减灾中的应用。
  • 基金资助:
    国家自然科学基金资助项目(41971310)

Abstract:

Significant differences in object scale and aspect ratio in remote sensing images lead to difficult object detection in remote sensing images. Aiming at this characteristic of remote sensing image, in order to improve the precision of object detection in remote sensing images, EW-YOLO (Efficient Weighted-YOLO) was proposed by improving the YOLO framework. Firstly, the multi-level feature fusion structure was introduced in the feature fusion section, so that the dual-branch residual module was utilized to promote the fusion of features at different scales. And by the cascade of feature fusion modules and the cross-layer feature fusion design, the extraction capability of objects at different scales was improved, and the detection capability was further enhanced. Secondly, in the prediction section, the weighted detection head was proposed and Weighted Boxes Fusion (WBF) was introduced, so as to improve the detection precision of objects with different aspect ratios by weighting each candidate box using the confidence scores and generating prediction boxes by fusion. Finally, to address the issue of too large image size, an image resampling technique was proposed, which means that the images were sampled to appropriate sizes and joined into network training, solving the problem of low detection precision of large-size objects caused by cropping. Experimental results on DOTA dataset show that the detection mean Average Precision (mAP) of the proposed method is 77.47%, which is increased by 1.55 percentage points compared to that of the original YOLO framework based method. And compared with the current mainstream methods, the proposed method has superior performance. At the same time, the proposed method’s effectiveness is also verified on HRSC and UCAS-AOD datasets.

Key words: remote sensing image, object detection, deep learning, multi-scale feature fusion, Weighted Boxes Fusion (WBF)

摘要:

遥感图像中目标尺度变化大且目标长宽比差异大,导致遥感图像目标检测困难。针对遥感图像的这一特点,通过改进YOLO框架,提出EW-YOLO(Efficient Weighted-YOLO)提高遥感图像目标检测的精度。首先,在特征融合部分,设计多级特征融合结构,以利用双分支的残差模块促进不同尺度特征的融合,并通过融合模块的级联以及跨层特征的融合设计,增强对不同尺度目标的提取能力,并进一步增强检测能力;其次,在预测部分,提出加权检测头,引入加权检测框融合(WBF),以利用置信度分数对每个候选框进行加权,并融合生成预测框,从而提高不同长宽比目标的检测精度;最后,针对图像尺寸过大的问题,提出图像重采样处理方法,即通过将图像采样至合适大小并参与网络训练,解决由于切割造成的大尺寸目标检测精度较低的问题。在DOTA数据集上进行的实验的结果表明,所提方法的检测平均精度均值(mAP)达到了77.47%,较基于原始YOLO框架的方法提升了1.55个百分点,且优于目前的主流方法。同时,也在HRSC和UCAS-AOD数据集上验证了所提方法的有效性。

关键词: 遥感图像, 目标检测, 深度学习, 多尺度特征融合, 加权检测框融合

CLC Number: