Remote sensing image target detection and identification based on deep learning
SHI Wenxu1,2, BAO Jiahui3, YAO Yu1,2
1. Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu Sichuan 610081, China; 2. University of Chinese Academy of Sciences, Beijing 100049, China; 3. Glasgow College, University of Electronic Science and Technology of China, Chengdu Sichuan 611731, China
Abstract:In order to improve the precision and speed of existing remote sensing image target detection algorithms in small-scale target detection, a remote sensing image target detection and identification algorithm based on deep learning was proposed. Firstly, a dataset of remote sensing images with different scales was constructed for model training and testing. Secondly, based on the original Single Shot multibox Detector (SSD) network model, the shallow feature fusion module, shallow feature enhancement module and deep feature enhancement module were designed and fused. Finally, the focal loss function was introduced into the training strategy to solve the problem of the imbalance of positive and negative samples in the training process, and the experiment was carried out on the remote sensing image dataset. Experimental results on high-resolution remote sensing image dataset show that the detection mean Average Precision (mAP) of the proposed algorithm achieves 77.95%, which is 3.99 percentage points higher than that of SSD network model, and has the detection speed of 33.8 frame/s. In the extended experiment, the performance of the proposed algorithm is better than that of SSD network model for the detection of fuzzy targets in high-resolution remote sensing images. Experimental results show that the proposed algorithm can effectively improve the precision of remote sensing image target detection.
史文旭, 鲍佳慧, 姚宇. 基于深度学习的遥感图像目标检测与识别[J]. 计算机应用, 2020, 40(12): 3558-3562.
SHI Wenxu, BAO Jiahui, YAO Yu. Remote sensing image target detection and identification based on deep learning. Journal of Computer Applications, 2020, 40(12): 3558-3562.
[1] 王港, 陈金勇, 高峰, 等. 基于深度学习的遥感影像基础设施目标检测研究[J]. 无线电工程, 2018, 48(3):219-224.(WANG G, CHEN J Y,GAO F,et al. Research on the infrastructure target detection of remote sensing image based on deep learning[J]. Radio Engineering,2018,48(3):219-224.) [2] MARINO A,SUGIMOTO M,OUCHI K. et al. Validating a notch filter for detection of targets at sea with ALOS-PALSAR data:Tokyo Bay[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing,2014,7(12):74907-74918. [3] PELICH R, LONGÉPÉ N, MERCIER G, et al. AIS-based evaluation of target detectors and SAR sensors characteristics for maritime surveillance[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing,2015,8(8):3892-3901. [4] KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neuralnetworks[C]//Proceedings of the 25th International Conference on Neural Information Processing Systems. Red Hook:Curran Associates Inc.,2012:1097-1105. [5] RUSSAKOVSKY O,DENG J,SU H,et al. ImageNet large scale visual recognition challenge[J]. International Journal of Computer Vision,2015,115(3):211-252. [6] 邓志鹏, 孙浩, 雷琳, 等. 基于多尺度形变特征卷积网络的高分辨率遥感影像目标检测[J]. 测绘学报, 2018, 47(9):1216-1227. (DENG Z P,SUN H,LEI L,et al. Object detection in remote sensing imagery with multi-scale deformable convolutionalnetworks[J]. Acta Geodaetica et Cartographica Sinica,2018,47(9):1216-1227.) [7] 王方石, 王坚, 李兵, 等. 基于深度属性学习的交通标志检测[J]. 吉林大学学报(工学版), 2018, 48(1):319-329.(WANG F S, WANG J,LI B,et al. Deep attribute learning based traffic sign detection[J]. Journal of Jilin University (Engineering and Technology Edition),2018,48(1):319-329.) [8] GIRSHICK R. Fast R-CNN[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision. Piscataway:IEEE, 2015:1440-1448. [9] REN S,HE K,GIRSHICK R,et al. Faster R-CNN:towards realtime object detection with region proposalnetworks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2017, 39(6):1137-1149. [10] DAI J,LI Y,HE K,et al. R-FCN:object detection via regionbased fully convolutionalnetworks[C]//Proceedings of the 30th International Conference on Neural Information Processing Systems. Red Hook:Curran Associates Inc.,2016:379-387. [11] REDMON J,DIVVALA S,GIRSHICK R,et al. You Only Look Once:unified,real-time object detection[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2016:779-788. [12] LIU W,ANGUELOV D,ERHAN D,et al. SSD:Single Shot MultiBox Detector[C]//Proceedings of the 2014 European Conference on Computer Vision,LNCS 9905. Cham Springer, 2016:21-37. [13] FU C,LIU W,RANGA A,et al. DSSD:Deconvolutional Single Shot Detector[EB/OL].[2020-02-23]. https://arxiv.org/pdf/1701.06659.pdf. [14] 姚群力, 胡显, 雷宏. 基于多尺度融合特征卷积神经网络的遥感图像飞机目标检测[J]. 测绘学报, 2019, 48(10):1266-1274. (YAO Q L,HU X,LEI H. Aircraft detection in remote sensing imagery with multi-scale feature fusion convolutional neuralnetworks[J]. Acta Geodaetica et Cartographica Sinica,2019, 4810:1266-1274. [15] 王俊强, 李建胜, 周学文, 等. 改进的SSD算法及其对遥感影像小目标检测性能的分析[J]. 光学学报, 2019, 39(6):373-382. (WANG J Q,LI J S,ZHOU X W, et al. Improved SSD algorithm and its performance analysis of small target detection in remote sensing images[J]. Acta Optica Sinica,2019,39(6):73-382.) [16] LIN T Y,DOLLÁR P,GIRSHICK R,et al. Feature pyramidnetworks for object detection[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:2117-2125. [17] 江源, 李建伟, 张玉婷. 基于特征重用和语义聚合的SAR图像舰船目标检测[J]. 海军航空工程学院学报, 2019, 34(6):470-479, 498.(JIANG Y,LI J W,ZHANG Y T. Ship object detection of SAR images based on feature reuse and semantic aggregation[J]. Journal of Naval Aeronautical and Astronautical University, 2019,34(6):470-479,498.) [18] SIMONYAN K, ZISSERMAN A. Very deep convolutionalnetworks for large-scale image recognition[EB/OL].[2020-02-17]. https://arxiv.org/pdf/1409.1556.pdf. [19] SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2015:1-9. [20] ZEILER M D, FERGUS R. Visualizing and understanding convolutionalnetworks[C]//Proceedings of the 2014 European Conference on Computer Vision,LNCS 8689. Cham:Springer, 2014:818-833 [21] HE K,ZHANG X,REN S,et al. Deep residual learning for image recognition[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2016:770-778. [22] IOFFE S,SZEGEDY C. Batch normalization:accelerating deepnetwork training by reducing internal covariate shift[C]//Proceedings of the 32nd International Conference on Machine Learning. New York:ACM,2015:448-456. [23] BWLL S, ZITNICK L, BALA K, et al. Inside-outsidenet:detecting objects in context with skip pooling and recurrent neuralnetworks[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2016:2874-2883. [24] LIN T Y,GOYAL P,GIRSHICK R,et al. Focal loss for dense object detection[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway:IEEE,2017:2999-3007. [25] EVERINGHAM M,ESLAMI S M A,VAN GOOL L,et al. The PASCAL visual object classes challenge:a retrospective[J]. International Journal of Computer Vision,2015,111(1):98-136. [26] LENG J,LIU Y. An enhanced SSD with feature fusion and visual reasoning for object detection[J]. Neural Computing and Applications,2019,31:6549-6558.