Dense crowd counting model based on spatial dimensional recurrent perception network

doi:10.11772/j.issn.1001-9081.2020050623

Journal of Computer Applications ›› 2021, Vol. 41 ›› Issue (2): 544-549.DOI: 10.11772/j.issn.1001-9081.2020050623

Special Issue: 多媒体计算与计算机仿真

• Multimedia computing and computer simulation • Previous Articles Next Articles

Dense crowd counting model based on spatial dimensional recurrent perception network

FU Qianhui, LI Qingkui, FU Jingnan, WANG Yu

School of Automation, Beijing Information Science and Technology University, Beijing 100192, China

Received:2020-05-12 Revised:2020-09-18 Online:2020-10-20 Published:2021-02-10
Supported by:
This work is partially supported by the Promoting Connotative Development of University-Postgraduate Science and Technology Innovation Project (5121911048).

基于空间维度循环感知网络的密集人群计数模型

付倩慧, 李庆奎, 傅景楠, 王羽

北京信息科技大学自动化学院, 北京 100192

通讯作者: 李庆奎
作者简介:付倩慧(1996-),女,山东聊城人,硕士研究生,主要研究方向:图像处理、供应链系统;李庆奎(1971-),男,山东临沂人,教授,博士,主要研究方向:切换时滞系统、供应链系统;傅景楠(1993-),男,福建莆田人,硕士研究生,主要研究方向:图像处理、深度学习;王羽(1996-),女,北京人,硕士研究生,主要研究方向:图像处理、供应链系统。
基金资助:
促进高校内涵发展-研究生科技创新项目（5121911048）。

Abstract

Abstract: Considering the limitations of the feature extraction of high-density crowd images with perspective distortion, a crowd counting model, named LMCNN, that combines Global Feature Perception Network (GFPNet) and Local Association Feature Perception Network (LAFPNet) was proposed. GFPNet was the backbone network of LMCNN, its output feature map was serialized and used as the input of LAFPNet. And the characteristic that the Recurrent Neural Network (RNN) senses the local association features on the time-series dimension was used to map the single spatial static feature to the feature space with local sequence association features, thus effectively reducing the impact of perspective distortion on crowd density estimation. To verify the effectiveness of the proposed model, experiments were conducted on Shanghaitech Part A and UCF_CC_50 datasets. The results show that compared to Atrous Convolutions Spatial Pyramid Network (ACSPNet), the Mean Absolute Error (MAE) of LMCNN was decreased by 18.7% and 20.3% at least, respectively, and the Mean Square Error (MSE) was decreased by 22.3% and 22.6% at least, respectively. The focus of LMCNN is the association between the front and back features on the spatial dimension, and by fully integrating the spatial dimension features and the sequence features in a single image, the crowd counting error caused by perspective distortion is reduced, and the number of people in dense areas can be more accurately predicted, thereby improving the regression accuracy of crowd density.

Key words: crowd counting, crowd density estimation, Convolutional Neural Network (CNN), Multi-column Convolutional Neural Network (MCNN), Long Short-Term Memory (LSTM) neural network

摘要： 考虑目前对具有透视畸变的高密度人群图像进行特征提取的局限性，提出了一种融合全局特征感知网络（GFPNet）和局部关联性特征感知网络（LAFPNet）的人群计数模型LMCNN。GFPNet是LMCNN的主干网络，将其输出的特征图进一步序列化并作为LAFPNet的输入，再利用循环神经网络（RNN）在时序维度上对局部关联性特征感知的特点将单一的空间静态特征映射到具有局部序列关联性特征的特征空间，从而有效地削减了透视畸变对人群密度估计造成的影响。为了验证所提模型的有效性，在Shanghaitech Part A子集和UCF_CC_50数据集上与原子卷积空间金字塔网络（ACSPNet）进行对比，结果表明所提模型的平均绝对误差（MAE）分别至少减小了18.7%和20.30%，均方误差（MSE）分别至少减小了22.3%和22.6%。LMCNN注重空间维度上前后特征的相关性，通过对空间维度特征与单图像内序列特征的充分融合，减小了由透视畸变引起的人群计数误差，能更加准确地预测密集区域人数，提高人群密度回归精度。

关键词: 人群计数, 人群密度估计, 卷积神经网络, 多列卷积神经网络, 长短时记忆神经网络

CLC Number:

TP391.4

FU Qianhui, LI Qingkui, FU Jingnan, WANG Yu. Dense crowd counting model based on spatial dimensional recurrent perception network[J]. Journal of Computer Applications, 2021, 41(2): 544-549.

付倩慧, 李庆奎, 傅景楠, 王羽. 基于空间维度循环感知网络的密集人群计数模型[J]. 计算机应用, 2021, 41(2): 544-549.

References

[1] SINDAGI V A,PATEL V M. A survey of recent advances in CNNbased single image crowd counting and density estimation[J]. Pattern Recognition Letters,2018,107:3-16.
[2] 何鹏, 麻文华, 黄磊, 等. 实时人数计数系统[J]. 中国图象图形学报,2011,16(5):813-820.(HE P,MA W H,HUANG L,et al. Real-time people counting system[J]. Journal of Image and Graphics,2011,16(5):813-820.)
[3] GAO C,LI P,ZHANG Y J,et al. People counting based on head detection combining Adaboost and CNN in crowded surveillance environment[J]. Neurocomputing,2016,208:108-116.
[4] ZHANG C,LI H,WANG X,et al. Cross-scene crowd counting via deep convolutional neural networks[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:833-841.
[5] ZHANG Y, ZHOU D, CHEN S, et al. Single-image crowd counting via multi-column convolutional neural network[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2016:589-597.
[6] SAM D B,SURYA S,BABU R V. Switching convolutional neural network for crowd counting[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:4031-4039.
[7] ZHANG L,SHI M,CHEN Q. Crowd counting via scale-adaptive convolutional neural network[C]//Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision. Piscataway:IEEE,2018:1113-1121.
[8] PU S,SONG T,ZHANG Y,et al. Estimation of crowd density in surveillance scenes based on deep convolutional neural network[J]. Procedia Computer Science,2017,111:154-159.
[9] 郭继昌, 李翔鹏. 基于卷积神经网络和密度分布特征的人数统计方法[J]. 电子科技大学学报,2018,47(6):806-813.(GUO J C,LI X P. A crowd counting method based on convolutional neural networks and density distribution features[J]. Journal of University of Electronic Science and Technology of China,2018,47(6):806-813.)
[10] XU M,GE Z,JIANG X,et al. Depth information guided crowd counting for complex crowd scenes[J]. Pattern Recognition Letters,2019,125:563-569.
[11] MA J,DAI Y,TAN Y P. Atrous convolutions spatial pyramid network for crowd counting and density estimation[J]. Neurocomputing,2019,350:91-101.
[12] 陆金刚, 张莉. 基于多尺度多列卷积神经网络的密集人群计数模型[J]. 计算机应用,2019,39(12):3445-3449.(LU J G, ZHANG L. Crowd counting model based on multi-scale multicolumn convolutional neural network[J]. Journal of Computer Applications,2019,39(12):3445-3449.)
[13] 马皓, 殷保群, 彭思凡. 基于特征金字塔网络的人群计数算法[J]. 计算机工程,2019,45(7):203-207.(MA H,YIN B Q, PENG S F. Crowd counting algorithm based on feature pyramid network[J]. Computer Engineering,2019,45(7):203-207.)
[14] 郭瑞琴, 陈雄杰, 骆炜, 等. 基于优化的Inception-ResNet-A模块与Gradient Boosting的人群计数方法[J]. 同济大学学报(自然科学版),2019,47(8):1216-1224.(GUO R Q,CHEN X J, LUO W,et al. A method of crowd counting based on improved Inception-ResNet-A module with Gradient Boosting[J]. Journal of Tongji University(Natural Science),2019,47(8):1216-1224.)
[15] ZHU M, WANG X, TANG J, et al. Attentive multi-stage convolutional neural network for crowd counting[J]. Pattern Recognition Letters,2020,135:279-285.
[16] WANG S,LU Y,ZHOU T,et al. SCLNet:spatial context learning network for congested crowd counting[J]. Neurocomputing,2020,404:227-239.
[17] LIU W,SALZMANN M,FUA P. Context-aware crowd counting[C]//Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2019:5094-5103.
[18] WANG Z,CHEN T,LI G,et al. Multi-label image recognition by recurrently discovering attentional regions[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway:IEEE,2017:464-472.
[19] JADERBERG M,SIMONYAN K,ZISSERMAN A,et al. Spatial transformer networks[C]//Proceedings of the 28th International Conference on Neural Information Processing Systems. Cambridge:MIT Press,2015:2017-2025.
[20] LI L,YANG Z,JIAO L,et al. High-resolution SAR change detection based on ROI and SPP net[J]. IEEE Access,2019,7:177009-177022.

Dense crowd counting model based on spatial dimensional recurrent perception network

基于空间维度循环感知网络的密集人群计数模型

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

[1]	Yun LI, Fuyou WANG, Peiguang JING, Su WANG, Ao XIAO. Uncertainty-based frame associated short video event detection method [J]. Journal of Computer Applications, 2024, 44(9): 2903-2910.
[2]	Zhiqiang ZHAO, Peihong MA, Xinhong HEI. Crowd counting method based on dual attention mechanism [J]. Journal of Computer Applications, 2024, 44(9): 2886-2892.
[3]	Hong CHEN, Bing QI, Haibo JIN, Cong WU, Li’ang ZHANG. Class-imbalanced traffic abnormal detection based on 1D-CNN and BiGRU [J]. Journal of Computer Applications, 2024, 44(8): 2493-2499.
[4]	Dongwei WANG, Baichen LIU, Zhi HAN, Yanmei WANG, Yandong TANG. Deep network compression method based on low-rank decomposition and vector quantization [J]. Journal of Computer Applications, 2024, 44(7): 1987-1994.
[5]	Yangyi GAO, Tao LEI, Xiaogang DU, Suiyong LI, Yingbo WANG, Chongdan MIN. Crowd counting and locating method based on pixel distance map and four-dimensional dynamic convolutional network [J]. Journal of Computer Applications, 2024, 44(7): 2233-2242.
[6]	Runze TIAN, Yulong ZHOU, Hong ZHU, Gang XUE. Local information based path selection algorithm for service migration [J]. Journal of Computer Applications, 2024, 44(7): 2168-2174.
[7]	Wei LI, Xiaorong ZHANG, Peng CHEN, Qing LI, Changqing ZHANG. Crowd counting algorithm with multi-scale fusion based on normal inverse Gamma distribution [J]. Journal of Computer Applications, 2024, 44(7): 2243-2249.
[8]	Mengyuan HUANG, Kan CHANG, Mingyang LING, Xinjie WEI, Tuanfa QIN. Progressive enhancement algorithm for low-light images based on layer guidance [J]. Journal of Computer Applications, 2024, 44(6): 1911-1919.
[9]	Jianjing LI, Guanfeng LI, Feizhou QIN, Weijun LI. Multi-relation approximate reasoning model based on uncertain knowledge graph embedding [J]. Journal of Computer Applications, 2024, 44(6): 1751-1759.
[10]	Min SUN, Qian CHENG, Xining DING. CBAM-CGRU-SVM based malware detection method for Android [J]. Journal of Computer Applications, 2024, 44(5): 1539-1545.
[11]	Wenshuo GAO, Xiaoyun CHEN. Point cloud classification network based on node structure [J]. Journal of Computer Applications, 2024, 44(5): 1471-1478.
[12]	Jie WANG, Hua MENG. Image classification algorithm based on overall topological structure of point cloud [J]. Journal of Computer Applications, 2024, 44(4): 1107-1113.
[13]	Tianhua CHEN, Jiaxuan ZHU, Jie YIN. Bird recognition algorithm based on attention mechanism [J]. Journal of Computer Applications, 2024, 44(4): 1114-1120.
[14]	Lijun XU, Hui LI, Zuyang LIU, Kansong CHEN, Weixuan MA. 3D-GA-Unet： MRI image segmentation algorithm for glioma based on 3D-Ghost CNN [J]. Journal of Computer Applications, 2024, 44(4): 1294-1302.
[15]	Jingxian ZHOU, Xina LI. UAV detection and recognition based on improved convolutional neural network and radio frequency fingerprint [J]. Journal of Computer Applications, 2024, 44(3): 876-882.