计算机应用 ›› 2017, Vol. 37 ›› Issue (6): 1702-1707.DOI: 10.11772/j.issn.1001-9081.2017.06.1702

• 人工智能 • 上一篇    下一篇

基于深度卷积神经网络的航空器检测与识别

俞汝劼1, 杨贞1, 熊惠霖1,2   

  1. 1. 上海交通大学 电子信息与电气工程学院, 上海 200240;
    2. 上海交通大学 计算机模式识别实验室, 上海 200240
  • 收稿日期:2016-10-12 修回日期:2017-02-10 出版日期:2017-06-10 发布日期:2017-06-14
  • 通讯作者: 俞汝劼
  • 作者简介:俞汝劼(1992-),男,上海人,硕士研究生,主要研究方向:图像解译与评估;杨贞(1985-),男,山东菏泽人,博士研究生,主要研究方向:模式识别、计算机视觉;熊惠霖(1964-),男,湖北黄冈人,教授,博士,主要研究方向:基于核方法的非线性模式识别和机器学习、图像处理、机器视觉、生物信息学。
  • 基金资助:
    国家自然科学基金资助项目(61375008)。

Aircraft detection and recognition based on deep convolutional neural network

YU Rujie1, YANG Zhen1, XIONG Huilin1,2   

  1. 1. School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China;
    2. Computer Pattern Recognition Laboratory, Shanghai Jiao Tong University, Shanghai 200240, China
  • Received:2016-10-12 Revised:2017-02-10 Online:2017-06-10 Published:2017-06-14
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61375008).

摘要: 针对军用机场大尺寸卫星图像中航空器检测识别的具体应用场景,建立了一套实时目标检测识别框架,将深度卷积神经网络应用到大尺寸图像中的航空器目标检测与识别任务中。首先,将目标检测的任务看成空间上独立的bounding-box的回归问题,用一个24层卷积神经网络模型来完成bounding-box的预测;然后,利用图像分类网络来完成目标切片的分类任务。大尺寸图像上的传统目标检测识别算法通常在时间效率上很难突破,而基于卷积神经网络的航空器目标检测识别算法充分利用了计算硬件的优势,大大缩短了任务耗时。在符合应用场景的自采数据集上进行测试,所提算法目标检测实时性达到平均每张5.765 s,在召回率65.1%的工作点上达到了79.2%的精确率,分类网络的实时性达到平均每张0.972 s,Top-1错误率为13%。所提框架在军用机场大尺寸卫星图像中航空器检测识别的具体应用问题上提出了新的解决思路,同时保证了实时性和算法精度。

关键词: 深度学习, 卷积神经网络, 航空器检测, 目标检测识别

Abstract: Aiming at the specific application scenario of aircraft detection in large-scale satellite images of military airports, a real-time target detection and recognition framework was proposed. The deep Convolutional Neural Network (CNN) was applied to the target detection task and recognition task of aircraft in large-scale satellite images. Firstly, the task of aircraft detection was regarded as a regression problem of the spatially independent bounding-box, and a 24-layer convolutional neural network model was used to complete the bounding-box prediction. Then, an image classification network was used to complete the classification task of the target slices. The traditional target detection and recognition algorithm on large-scale images is usually difficult to make a breakthrough in time efficiency. The proposed target detection and recognition framework of aircraft based on CNN makes full use of the advantages of computing hardware greatly and shortens the executing time. The proposed framework was tested on a self-collected data set consistent with application scenarios. The average time of the proposed framework is 5.765 s for processing each input image, meanwhile, the precision is 79.2% at the operating point with the recall of 65.1%. The average time of the classification network is 0.972 s for each image and the Top-1 error rate is 13%. The proposed framework provides a new solution for application problem of aircraft detection in large-scale satellite images of military airports with relatively high efficiency and precision.

Key words: deep learning, Convolutional Neural Network (CNN), aircraft detection, target detection and recognition

中图分类号: