Pedestrian detection method based on cascade networks

doi:10.11772/j.issn.1001-9081.2018061351

Journal of Computer Applications ›› 2019, Vol. 39 ›› Issue (1): 186-191.DOI: 10.11772/j.issn.1001-9081.2018061351

Previous Articles Next Articles

Pedestrian detection method based on cascade networks

CHEN Guangxi¹, WANG Jiaxin¹, HUANG Yong², ZHAN Yijun¹, ZHAN Baoying¹

1. Guangxi Key Laboratory of Intelligent Processing of Computer Images and Graphics(Guilin University of Electronic Technology), Guilin Guangxi 541004, China;
2. Guangdong Engineering Technology Research Center for Mathematical Educational Software(Guangzhou University), Guangzhou Guangdong 510006, China

Received:2018-06-28 Revised:2018-08-14 Online:2019-01-21 Published:2019-01-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61462018), the Open Fund of Guangdong Engineering Technology Research Center for Mathematical Educational Software(LD16124X), the Graduate Education Innovation Project of Guilin University of Electronic Science and Technology (2016XWYJ09).

基于级联网络的行人检测方法

陈光喜¹, 王佳鑫¹, 黄勇², 詹益俊¹, 詹宝莹¹

1. 广西图像图形智能处理重点实验室(桂林电子科技大学), 广西桂林 541004;
2. 广东省数学教育软件工程技术研究中心(广州大学), 广州 510006

通讯作者: 王佳鑫
作者简介:陈光喜(1971-),男,四川金堂人,教授,博士,CCF会员,主要研究方向:可信计算、图像处理;王佳鑫(1992-),男,江苏泰州人,硕士研究生,主要研究方向:图像处理;黄勇(1958-),男,四川达州人,教授,博士,主要研究方向:数学教育智能软件与应用;詹益俊(1990-),男,河南商城人,硕士研究生,主要研究方向:图像处理;詹宝莹(1994-),女,辽宁辽阳人,硕士研究生,主要研究方向:图像处理。
基金资助:
国家自然科学基金资助项目（61462018）；广东省数学教育软件工程技术研究中心开放基金资助项目（LD16124X）；桂林电子科技大学研究生教育创新项目（2016XWYJ09）。

Abstract

Abstract: In complex environment, existing pedestrian detection methods can not be very good to achieve high recall rate and efficient detection. To solve this problem, a pedestrian detection method based on Convolutional Neural Network (CNN) was proposed. Firstly, pedestrian locations in input images were initially detected with single step detection upgrade network (YOLOv2) derived from CNN. Secondly, a network with target classification and bounding box regression was designed to cascade with YOLOv2 network, which made reclassification and regression of pedestrian location initially detected by YOLOv2, to reduce error detections and increase recall rate. Finally, a Non-Maximum Suppression (NMS) method was used to remove redundant bounding boxes. The experimental results show that, in INRIA and Caltech dataset, the proposed method increases recall rate by 3.3 percentage points, and the accuracy is increased by 5.1 percentage points compared with original YOLOv2. It also reached a speed of 11.6FPS (Frames Per Second) to realize real-time detection. Compared with the existing six popular pedestrian detection methods, the proposed method has better overall performance.

Key words: pedestrian detection, Convolutional Neural Network (CNN), cascade network, classification and regression, real-time detection

摘要： 针对复杂环境下行人检测不能同时满足高召回率与高效率检测的问题，提出一种基于卷积神经网络（CNN）的行人检测方法。首先，采用CNN中的单步检测升级版网络YOLOv2初步检测行人；然后，设计一个网络与YOLOv2网络级联。设计的网络具有目标分类和边界框回归的功能，对YOLOv2初步检测出的行人位置进行再分类与回归，以此降低误检，提高召回率；最后，采用非极大值抑制（NMS）处理的方法去除冗余的边界框。实验结果显示，在数据集INRIA和Caltech上，所提方法与原始YOLOv2相比，召回率提高3.3个百分点，准确率提高5.1个百分点，同时速度上达到了11.6帧/s，实现了实时检测。与现有的流行的行人检测方法相比，所提方法具有更好的整体性能。

关键词: 行人检测, 卷积神经网络, 级联网络, 分类回归, 实时检测

CLC Number:

CHEN Guangxi, WANG Jiaxin, HUANG Yong, ZHAN Yijun, ZHAN Baoying. Pedestrian detection method based on cascade networks[J]. Journal of Computer Applications, 2019, 39(1): 186-191.

陈光喜, 王佳鑫, 黄勇, 詹益俊, 詹宝莹. 基于级联网络的行人检测方法[J]. 计算机应用, 2019, 39(1): 186-191.

References

[1] 苏松志,李绍滋,陈淑媛,等.行人检测技术综述[J].电子学报,2012,40(4):814-820.(SU S Z, LI S Z, CHEN S Y, et al. A survey on pedestrian detection[J]. Acta Electronica Sinica, 2012, 40(4):814-820.)
[2] REN S, HE K, GIRSHICK R, et al. Faster R-CNN:towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6):1137-1149.
[3] GIRSHICK R. Fast R-CNN[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision. Piscataway, NJ:IEEE, 2015:1440-1448.
[4] REDMON J, DIVVALA S, GIRSHICK R, et al. You Only Look Once:unified, real-time object detection[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2016:779-788.
[5] REDMON J, FARHADI A. YOLO9000:better, faster, stronger[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2017:6517-6525.
[6] LIU W, ANGUELOV D, ERHAN D, et al. SSD:Single Shot multibox Detector[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2016:21-37.
[7] DALAL N, TRIGGS B. Histograms of oriented gradients for human detection[C]//Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2005:886-893.
[8] DOLLAR P, WOJEK C, SCHIELE B, et al. Pedestrian detection:a benchmark[C]//Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2009:304-311.
[9] KONG T, YAO A, CHEN Y, et al. HyperNet:towards accurate region proposal generation and joint object detection[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society,2016:845-853.
[10] LIN T Y, DOLLAR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2017:936-944.
[11] MAO J, XIAO T, JIANG Y, et al. What can help pedestrian detection?[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2017:6034-6043.
[12] ZHANG K, ZHANG Z, LI Z, et al. Joint face detection and alignment using multitask cascaded convolutional networks[J]. IEEE Signal Processing Letters, 2016, 23(10):1499-1503.
[13] HE K, ZHANG X, REN S, et al. Delving deep into rectifiers:surpassing human-level performance on ImageNet classification[C]//ICCV 2015:Proceedings of the 2015 IEEE International Conference on Computer Vision. Piscataway, NJ:IEEE, 2015:1026-1034.
[14] SZEGEDY C, TOSHEV A, ERHAN D. Deep neural networks for object detection[J]. Advances in Neural Information Processing Systems, 2013, 26(1):2553-2561.
[15] TOMÈ D, MONTI F, BAROFFIO L, et al. Deep convolutional neural networks for pedestrian detection[J]. Signal Processing:Image Communication, 2016, 47(1):482-489.
[16] ROTHE R, GUILLAUMIN M, VAN GOOL L. Non-maximum suppression for object detection by passing messages between windows[C]//Proceedings of the 2014 Asian Conference on Computer Vision. Berlin:Springer, 2014:290-306.
[17] FELZENSZWALB P, MCALLESTER D, RAMANAN D. A dis-criminatively trained, multiscale, deformable part model[C]//Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2008:1-8.
[18] DOLLAR P, APPEL R, BELONGIE S, et al. Fast feature pyramids for object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 36(8):1532-1545.
[19] ZHANG L, LIN L, LIANG X, et al. Is Faster R-CNN doing well for pedestrian detection?[C]//ECCV 2016:Proceedings of the 14th European Conference on Computer Vision. Berlin:Springer, 2016:443-457.

Pedestrian detection method based on cascade networks

基于级联网络的行人检测方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

[1]	Yun LI, Fuyou WANG, Peiguang JING, Su WANG, Ao XIAO. Uncertainty-based frame associated short video event detection method [J]. Journal of Computer Applications, 2024, 44(9): 2903-2910.
[2]	Hong CHEN, Bing QI, Haibo JIN, Cong WU, Li’ang ZHANG. Class-imbalanced traffic abnormal detection based on 1D-CNN and BiGRU [J]. Journal of Computer Applications, 2024, 44(8): 2493-2499.
[3]	Cui WANG, Miaolei DENG, Dexian ZHANG, Lei LI, Xiaoyan YANG. Review of end-to-end person search algorithms based on images [J]. Journal of Computer Applications, 2024, 44(8): 2544-2550.
[4]	Dongwei WANG, Baichen LIU, Zhi HAN, Yanmei WANG, Yandong TANG. Deep network compression method based on low-rank decomposition and vector quantization [J]. Journal of Computer Applications, 2024, 44(7): 1987-1994.
[5]	Yangyi GAO, Tao LEI, Xiaogang DU, Suiyong LI, Yingbo WANG, Chongdan MIN. Crowd counting and locating method based on pixel distance map and four-dimensional dynamic convolutional network [J]. Journal of Computer Applications, 2024, 44(7): 2233-2242.
[6]	Mengyuan HUANG, Kan CHANG, Mingyang LING, Xinjie WEI, Tuanfa QIN. Progressive enhancement algorithm for low-light images based on layer guidance [J]. Journal of Computer Applications, 2024, 44(6): 1911-1919.
[7]	Jianjing LI, Guanfeng LI, Feizhou QIN, Weijun LI. Multi-relation approximate reasoning model based on uncertain knowledge graph embedding [J]. Journal of Computer Applications, 2024, 44(6): 1751-1759.
[8]	Yaping DENG, Yingjiang LI. Review of YOLO algorithm and its applications to object detection in autonomous driving scenes [J]. Journal of Computer Applications, 2024, 44(6): 1949-1958.
[9]	Wenshuo GAO, Xiaoyun CHEN. Point cloud classification network based on node structure [J]. Journal of Computer Applications, 2024, 44(5): 1471-1478.
[10]	Min SUN, Qian CHENG, Xining DING. CBAM-CGRU-SVM based malware detection method for Android [J]. Journal of Computer Applications, 2024, 44(5): 1539-1545.
[11]	Jie WANG, Hua MENG. Image classification algorithm based on overall topological structure of point cloud [J]. Journal of Computer Applications, 2024, 44(4): 1107-1113.
[12]	Tianhua CHEN, Jiaxuan ZHU, Jie YIN. Bird recognition algorithm based on attention mechanism [J]. Journal of Computer Applications, 2024, 44(4): 1114-1120.
[13]	Lijun XU, Hui LI, Zuyang LIU, Kansong CHEN, Weixuan MA. 3D-GA-Unet： MRI image segmentation algorithm for glioma based on 3D-Ghost CNN [J]. Journal of Computer Applications, 2024, 44(4): 1294-1302.
[14]	Jingxian ZHOU, Xina LI. UAV detection and recognition based on improved convolutional neural network and radio frequency fingerprint [J]. Journal of Computer Applications, 2024, 44(3): 876-882.
[15]	Ruifeng HOU, Pengcheng ZHANG, Liyuan ZHANG, Zhiguo GUI, Yi LIU, Haowen ZHANG, Shubin WANG. Iterative denoising network based on total variation regular term expansion [J]. Journal of Computer Applications, 2024, 44(3): 916-921.