基于深度自编码网络的运动目标检测

doi:10.11772/j.issn.1001-9081.2014.10.2934

计算机应用 ›› 2014, Vol. 34 ›› Issue (10): 2934-2937.DOI: 10.11772/j.issn.1001-9081.2014.10.2934

基于深度自编码网络的运动目标检测

徐培,蔡小路,何文伟,谢易道

电子科技大学计算机科学与工程学院，成都 611731

收稿日期:2014-05-05 修回日期:2014-06-16 发布日期:2014-10-30 出版日期:2014-10-01
通讯作者: 徐培
作者简介:徐培（1986-），男，四川自贡人，博士研究生，主要研究方向：计算机视觉、机器学习；蔡小路（1990-），男，湖北黄冈人，硕士研究生，主要研究方向：计算机视觉、机器学习；何文伟（1988-），男，四川泸州人，硕士研究生，主要研究方向：计算机视觉、机器学习；谢易道（1988-），男，四川成都人，硕士研究生，主要研究方向：计算机视觉、机器学习。
基金资助:
中央基本业务经费资助项目

Motion detection based on deep auto-encoder networks

XU Pei,CAI Xiaolu,HE Wenwei,XIE Yidao

School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu Sichuan 611731, China

Received:2014-05-05 Revised:2014-06-16 Online:2014-10-30 Published:2014-10-01
Contact: XU Pei
Supported by:
Fundamental Research Funds for the Central Universities

摘要/Abstract

摘要：

针对从动态背景中提取前景效果较差的问题，提出了一种基于深度自编码网络的运动目标检测方法。首先，用一个三层的深度自编码网络从视频图像中提取不包含运动目标的背景图像，将背景图像作为变量构造了深度自编码网络的代价函数；然后，构造了一个分离函数得到了输入图像的背景图像，再用另一个三层的深度自编码网络学习提取出的背景图像；为了使深度自编码网络的学习能够在线地提取运动目标，还提出了一种在线学习算法，通过寻找对代价函数敏感度较低的权重进行合并，从而能够对更多的视频图像进行处理。实验结果表明，所提方法在从动态背景中提取出前景运动目标上相比Lu等的前景检测的工作(LU C, SHI J, JIA J. Online robust dictionary learning. Proceeding of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, Piscataway: IEEE Press, 2013:415-422)检测的准确率提高了6%，并且误报率降低了4.5%。在实际的应用中，能够获得更好的前景背景分离效果，为视频分析等方面的研究奠定更好的基础。

Abstract:

To address the poor results of foreground extraction from dynamic background, a motion detection method based on deep auto-encoder networks was proposed. Firstly, background images without containing motion objects were subtracted from video frames using a three-layer deep auto-encoder network whose cost function contained background as variable. Then, another three-layer deep auto-encoder network was used to learn the subtracted background images which are obtained by constructed separating function. To achieve online motion detection through deep auto-encoder learning, an online learning method of deep auto-encoder network was also proposed. The weights of network were merged according to the sensitivity of cost function to process more video frames. From the experimental results, the proposed method obtains better motion detection accuracy by 6%, and lower false rate by 4.5% than Lus work (LU C, SHI J, JIA J. Online robust dictionary learning. Proceeding of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, Piscataway: IEEE Press, 2013:415-422). This work also obtains better extraction results of background and foreground in real applications, and lays better basis for video analysis.

中图分类号:

TP391.41

徐培蔡小路何文伟谢易道. 基于深度自编码网络的运动目标检测[J]. 计算机应用, 2014, 34(10): 2934-2937.

XU Pei CAI Xiaolu HE Wenwei XIE Yidao. Motion detection based on deep auto-encoder networks[J]. Journal of Computer Applications, 2014, 34(10): 2934-2937.

参考文献

[1]STAUFFER C, GRIMSON W E L. Adaptive background mixture models for real-time tracking [C]// Proceeding of the 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 1999,2:246-253.
[2]MITTAL A, PARAGIOS N. Motion-based background subtraction using adaptive kernel density estimation [C]// Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2004,2:302-309.
[3]MATSUYAMA T, OHYA T, HABE H. Background subtraction for non-stationary scenes [C]// Proceeding of the 2000 Asian Conference of Computer Vision. Berlin: Springer-Verlag, 2000:662-667.
[4]KIM K, CHALIDABHONGSE T, HARWOOD D, et al.Real-time foreground-background segmentation using codebook model [J]. Real-time Imaging, 2005,11(3):172-185.
[5]RITTSCHER J, KATO J, JOGA S, et al.A probabilistic back-ground model for tracking [C]// Proceedings of the 2000 European Conference Computer Vision, LNCS 6312. Berlin: Springer-Verlag, 2000:336-350.
[6]ZHONG J, SCLAROFF S. Segmenting foreground objects from a dynamic textured background via a robust Kalman filter [C]// Proceedings of the 2003 IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2003:44-50.
[7]TIAN Y, TIAN S, XU Y, et al.Image object detection based on local feature and sparse representation [J]. Journal of Computer Applications, 2013,33(6):1670-1673.(田元荣，田松，许悦雷，等. 基于局部特征和稀疏表示的图像目标检测算法[J]. 计算机应用，2013，33(6):1670-1673.)
[8]BENGIO Y, LAMBLIN P, POPOVICI D, et al.Greedy layer-wise training of deep networks [C]// Proceedings of the 20th Annual Conference on Neural Information Processing Systems. Cambridge: MIT Press, 2007:153-160.
[9]HINTON G E, OSINDERO S, TEH Y W. A fast learning algorithm for deep belief nets [J]. Neural Computation, 2006,18(7):1527-1554.
[10]VINCENT P, LAROCHELLE H, BENGIO Y, et al.Extracting and composing robust features with denoising autoencoders [C]// Proceedings of the 25th International Conference on Machine Learning. New York: ACM, 2008:1096-1103.
[11]YUAN F. Codebook generation based on self-organizing incremental neural network for image classification [J]. Journal of Computer Applications, 2013,33(7):1976-1979.(袁飞云. 基于自组织增量神经网络的码书产生方法在图像分类中的应用[J]. 计算机应用，2013，33(7):1976-1979.)
[12]OUYANG W, WANG X. Joint deep learning for pedestrian detection [C]// Proceeding of the 2013 IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2013:2056-2063.
[13]LE Q V, ZOU W Y, YEUNG S Y, et al.Learning hierarchical invariant spatiotemporal features for action recognition with independent subspace analysis [C]// Proceeding of the 2011 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2011:3361-3368.
[14]TAYLOR G W, HINTON G E, ROWEIS S T. Modeling human motion using binary latent vairiables [C]// Proceedings of the 20th Annual Conference on Neural Information Processing Systems. Cambridge: MIT Press, 2007:1345-1353.
[15]HEESS N, ROUX N L, WINN J. Weakly supervised learning of background segmentation using masked RBMs [C]// International Conference on Artificial Neural Networks, LNCS 6312. Berlin: Springer-Verlag, 2011:9-16.
[16]ZHAO C, WANG X, CHAM W K. Background subtraction via robust dictionary learning [EB/OL]. [2014-02-22]. http://www.docin.com/p-233234564.html
[17]HUANG J, HUANG X, METAXAS D N. Learning with dynamic group sparsity [C]// Proceeding of the 2009 IEEE International Conference of Computer Vision. Piscataway: IEEE Press, 2009:64-71.
[18]LU C, SHI J, JIA J. Online robust dictionary learning [C]// Proceeding of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, Piscataway: IEEE Press, 2013:415-422.
[19]CEVHER V, SANKARANARAYANAN A, DUARTE M, et al.〖WTBZ〗 Compressive sensing background subtraction [C]// Proceedings of the 2008 European Conference on Computer Vision, LNCS 6312. Berlin: Springer-Verlag, 2008:155-168.
[20]XU J, DANIEL W C H. A new training and pruning algorithm based on node dependence and Jacobian rank deficiency [J]. Neurocomputing, 2006,70(1/2/3):544-558.
[21]LI L, HUANG W, GU I, et al.Statistical modeling of complex backgrounds for foreground object detecting [J]. IEEE Transactions on Image Processing, 2004,13(11):1459-1472.
[22]ZHOU X, YANG C, YU W. Moving object detection by detecting contiguous outliers in the low-rank representation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013,35(3):597-610.
[23]STAUFFER C, GRIMSON W. Adaptive background mixture models for real-time tracking [C]// Proceeding of the 1999 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 1999:2246-2252.
[24]GUTCHESS D, TRAJKOVICS M, COHEN-SOLAL E, et al.A background model initialization algorithm for video surveillance [C]// Proceeding of the 2001 8th IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2001:733-740.
[25]CANDES E, LI X, MA Y, et al.Robust principal component analysis? [EB/OL]. [2014-02-01]. http://wenku.baidu.com/view/95964f3243323968011c9261.html.

基于深度自编码网络的运动目标检测

Motion detection based on deep auto-encoder networks

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	张佳慧李晓明张嘉祥. 强化形态感知的路面缺陷检测算法[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[2]	况世雄姚俊波陆佳炜王琪冰肖刚. 基于动态图卷积网络的电梯乘客异常行为数据增强方法[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[3]	康斌陈斌王俊杰李昱林赵军智咸伟志. 基于多粒度共享语义中心关联的文本到人物检索方法[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[4]	王子怡李卫军刘雪洋丁建平刘世侠苏易礌. 基于Swin Transformer与多尺度特征融合的图像描述方法#br# [J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[5]	付可意, 王高才, 邬满. 基于改进区域提议网络和特征聚合小样本目标检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(12): 3790-3797.
[6]	庞玉东, 李志星, 刘伟杰, 李天昊, 王宁宁. 基于改进实时检测Transformer的塔机上俯视场景小目标检测模型[J]. 《计算机应用》唯一官方网站, 2024, 44(12): 3922-3929.
[7]	赵欣, 李鑫杰, 徐健, 刘步云, 毕祥. 基于卷积神经网络与Transformer并行的医学图像配准模型[J]. 《计算机应用》唯一官方网站, 2024, 44(12): 3915-3921.
[8]	颜承志陈颖钟凯高寒. 基于多尺度网络与轴向注意力的3D目标检测算法[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[9]	王静刘嘉星宋婉莹薛嘉兴丁温欣. 基于空间变换和特征分布校准的小样本皮肤图像分类模型[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[10]	廖炎华鄢元霞潘文林. 基于YOLOv9的交通路口图像的多目标检测算法[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[11]	谢斌红剌颖坤张英俊张睿. 自步学习指导下的半监督目标检测框架[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[12]	胡立华, 李小平, 胡建华, 张素兰. 基于四叉树先验辅助的多视图立体方法[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3556-3564.
[13]	顾聪, 段其强, 任思雨. 基于上下文感知网络的息肉分割算法[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3617-3622.
[14]	邹耀斌, 张彬. 四向加权香农熵最大化导向的自动阈值分割方法[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3565-3573.
[15]	刘涛, 鞠事宏, 高一萌. 基于改进YOLOv8n的无人机视角下小目标检测算法[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3603-3609.