Journal of Computer Applications ›› 2017, Vol. 37 ›› Issue (2): 530-534.DOI: 10.11772/j.issn.1001-9081.2017.02.0530

Previous Articles     Next Articles

Traffic sign recognition based on optimized convolutional neural network architecture

WANG Xiaobin1, HUANG Jinjie1, LIU Wenju2   

  1. 1. School of Automation, Harbin University of Science and Technology, Harbin Heilongjiang 150080, China;
    2. Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
  • Received:2016-07-28 Revised:2016-09-21 Online:2017-02-10 Published:2017-02-11
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61573357,61503382,61403370, 61273267).

基于优化卷积神经网络结构的交通标志识别

王晓斌1, 黄金杰1, 刘文举2   

  1. 1. 哈尔滨理工大学 自动化学院, 哈尔滨 150080;
    2. 中国科学院 自动化研究所, 北京 100190
  • 通讯作者: 刘文举,lwj@nlpr.ia.ac.cn
  • 作者简介:王晓斌(1990-),男,山东潍坊人,硕士研究生,主要研究方向:图像识别、目标检测;黄金杰(1967-),男,山东莱阳人,教授,博士,主要研究方向:智能建模、优化控制、模式识别;刘文举(1960-),男,北京人,教授,博士,主要研究方向:机器学习、语音增强、语音识别、声源定位、声音事件检测、图像识别。
  • 基金资助:
    国家自然科学基金资助项目(61573357,61503382,61403370,61273267)。

Abstract: In the existing algorithms for traffic sign recognition, sometimes the training time is short but the recognition rate is low, and other times the recognition rate is high but the training time is long. To resolve these problems, the Convolutional Neural Network (CNN) architecture was optimized by using Batch Normalization (BN) method, Greedy Layer-Wise Pretraining (GLP) method and replacing classifier with Support Vector Machine (SVM), and a new traffic sign recognition algorithm based on optimized CNN architecture was proposed. BN method was used to change the data distribution of the middle layer, and the output data of convolutional layer was normalized to the mean value of 0 and the variance value of 1, thus accelerating the training convergence and reducing the training time. By using the GLP method, the first layer of convolutional network was trained with its parameters preserved when the training was over, then the second layer was also trained with the parameters preserved until all the convolution layers were trained completely. The GLP method can effectively improve the recognition rate of the convolutional network. The SVM classifier only focused on the samples with error classification and no longer processed the correct samples, thus speeding up the training. The experiments were conducted on Germany traffic sign recognition benchmark, the results showed that compared with the traditional CNN, the training time of the new algorithm was reduced by 20.67%, and the recognition rate of the new algorithm reached 98.24%. The experimental results prove that the new algorithm greatly shortens the training time and reached a high recognition rate by optimizing the structure of the traditional CNN.

Key words: Convolutional Neural Network (CNN), batch normalization, Greedy Layer-wise Pretraining (GLP), Support Vector Machine (SVM)

摘要: 现有算法对交通标志进行识别时,存在训练时间短但识别率低,或识别率高但训练时间长的问题。为此,综合批量归一化(BN)方法、逐层贪婪预训练(GLP)方法,以及把分类器换成支持向量机(SVM)这三种方法对卷积神经网络(CNN)结构进行优化,提出基于优化CNN结构的交通标志识别算法。其中:BN方法可以用来改变中间层的数据分布情况,把卷积层输出数据归一化为均值为0、方差为1,从而提高训练收敛速度,减少训练时间;GLP方法则是先训练第一层卷积网络,训练完把参数保留,继续训练第二层,保留参数,直到把所有卷积层训练完毕,这样可以有效提高卷积网络识别率;SVM分类器只专注于那些分类错误的样本,对已经分类正确的样本不再处理,从而提高了训练速度。使用德国交通标志识别数据库进行训练和识别,新算法的训练时间相对于传统CNN训练时间减少了20.67%,其识别率达到了98.24%。所提算法通过对传统CNN结构进行优化,极大地缩短了训练时间,并具有较高的识别率。

关键词: 卷积神经网络, 批量归一化, 贪婪预训练, 支持向量机

CLC Number: