计算机应用 ›› 2018, Vol. 38 ›› Issue (12): 3403-3408.DOI: 10.11772/j.issn.1001-9081.2018050974

• 人工智能 • 上一篇    下一篇

基于改进卷积神经网络的多源数字识别算法

卜令正, 王洪栋, 朱美强, 代伟   

  1. 中国矿业大学 信息与控制工程学院, 江苏 徐州 221116
  • 收稿日期:2018-05-10 修回日期:2018-06-14 出版日期:2018-12-10 发布日期:2018-12-15
  • 通讯作者: 朱美强
  • 作者简介:卜令正(1991-),男,江苏徐州人,硕士研究生,主要研究方向:机器视觉、机器学习、深度学习;王洪栋(1986-),男,山东临沂人,博士研究生,主要研究方向:机器视觉、图像处理;朱美强(1979-),男,重庆人,副教授,博士,主要研究方向:机器学习、机器视觉、机器人与智能系统、工业综合自动化;代伟(1984-),男,河南安阳人,副教授,博士,主要研究方向:机器学习、机器视觉、人工智能、工业大数据分析。
  • 基金资助:
    国家自然科学基金资助项目(61603393);江苏省自然科学基金资助项目(BK2016275)。

Multi-source digit recognition algorithm based on improved convolutional neural network

BU Lingzheng, WANG Hongdong, ZHU Meiqiang, DAI Wei   

  1. School of Information and Control Engineering, China University of Mining and Technology, Xuzhou Jiangsu 221116, China
  • Received:2018-05-10 Revised:2018-06-14 Online:2018-12-10 Published:2018-12-15
  • Contact: 朱美强
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61603393), the Natural Science Foundation of Jiangsu Province (BK2016275).

摘要: 现有的数字识别算法多是对单一类型数字进行识别,无法应对识别多源数字。针对包含手写体数字与数码管数字的字符识别场景,提出一种基于改进卷积神经网络(CNN)的多源数字识别算法。首先,使用从数显仪表生产企业现场采集的样本,结合MINIST数据集,建立起包含手写体和数码管的混合数据集;然后,考虑更好的鲁棒性,提出一种改进的CNN,并用上述混合数据集对其训练,实现了一个网络识别多类型数字;最后,训练好的神经网络模型被成功应用于RoboMaster机甲大赛的多源数字识别场景中。测试结果表明,所提算法整体识别准确率稳定且较高,具有较好的鲁棒性和泛化能力。

关键词: 卷积神经网络, 多源数字识别, 混合数据集, RoboMaster

Abstract: Most of the existing digit recognition algorithms recognize single-type digits, and can not recognize multi-source digits. Aiming at the character recognition scenarios with handwritten digits and digital tube digits, a multi-source digit recognition algorithm based on improved Convolutional Neural Network (CNN) was proposed. Firstly, a mixed data set consisting of handwritten and digital tube digits was established by using the samples collected from the field of digital display instrument manufacturer and MINIST data set. Then, considering better robustness, an improved CNN was proposed, which was trained by the above mixed data set, and a network was realized to recognize multi-type digits. Finally, the trained neural network model was successfully applied to the multi-source digit recognition scene of RoboMaster robotics competition. The test results show that, the overall recognition accuracy of the proposed algorithm is stable and high, and it has good robustness and generalization ability.

Key words: Convolutional Neural Network (CNN), multi-source digit recognition, mixed data set, RoboMaster

中图分类号: