基于空洞卷积的快速背景自动更换

doi:10.11772/j.issn.1001-9081.2017081966

计算机应用 ›› 2018, Vol. 38 ›› Issue (2): 405-409.DOI: 10.11772/j.issn.1001-9081.2017081966

基于空洞卷积的快速背景自动更换

张浩, 窦奇伟, 栾桂凯, 姚绍文, 周维

云南大学软件学院, 昆明 650091

收稿日期:2017-08-11 修回日期:2017-09-09 出版日期:2018-02-10 发布日期:2018-02-10
通讯作者: 周维
作者简介:张浩(1992-),男,云南玉溪人,硕士研究生,主要研究方向:深度学习;窦奇伟(1994-),男,内蒙古包头人,硕士研究生,主要研究方向:深度学习;栾桂凯(1993-),男,山东烟台人,硕士研究生,主要研究方向:深度学习;姚绍文(1966-),男,云南昆明人,教授,博士,主要研究方向:工作流、Petri网;周维(1974-),男,云南昆明人,教授,博士,主要研究方向:分布式处理、生物信息学。
基金资助:
国家自然科学基金资助项目（61762089，61363021，61640306）。

Fast image background automatic replacement based on dilated convolution

ZHANG Hao, DOU Qiwei, LUAN Guikai, YAO Shaowen, ZHOU Wei

School of Software, Yunnan University, Kunming Yunnan 650091, China

Received:2017-08-11 Revised:2017-09-09 Online:2018-02-10 Published:2018-02-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61762089, 61363021, 61640306).

摘要/Abstract

摘要： 针对背景更换过程复杂性较高导致传统方法效率低下并且精确度难以提高的问题，提出一种基于空洞卷积的快速图像背景更换方法——FABRNet。首先，采用VGG（Visual Geometry Group network）模型中前三部分网络结构对输入图片进行卷积和池化操作；其次，多组空洞卷积并联组合使得网络拥有足够大和足够细的感受野，并且加上残差网络结构来保证卷积过程中信息位置分布的准确性；最后，通过双线性插值算法将图片缩放到原图尺寸输出。在实验部分，与三种经典方法KNN（K-Nearest Neighbor）matting、Portrait matting和Deep matting进行了对比，结果表明，FABRNet能够有效地完成背景自动更换的操作，并且在速度方面有一定的优势。

关键词: 深度学习, 空洞卷积, 残差网络, 背景更换, 双线性插值

Abstract: Because of complexity of background replacement, the traditional method is inefficient and the accuracy is difficult to improve. To solve these problems, a fast image background replacement method based on dilated convolution, called FABRNet, was proposed. First of all, the first three parts of VGG (Visual Geometry Group network) model were used for convolution and pooling operations of input images. Secondly, the combination of multiple sets of dilated convolutions were embedded into convolution neural network to make the network have a large and fine enough receptive field; meanwhile, the residual network structure was used to ensure the accuracy of the information distribution in the convolution process. Finally, the image was scaled to the original size and output by bilinear interpolation algorithm. Compared with three classical methods such as KNN (K-Nearest Neighbors) matting, Portrait matting and Deep matting, the experimental results show that FABRNet can effectively complete the background automatic replacement, and has advantages in running speed.

Key words: deep learning, dilated convolution, residual network, background replacement, bilinear interpolation

中图分类号:

TP391.41

张浩, 窦奇伟, 栾桂凯, 姚绍文, 周维. 基于空洞卷积的快速背景自动更换[J]. 计算机应用, 2018, 38(2): 405-409.

ZHANG Hao, DOU Qiwei, LUAN Guikai, YAO Shaowen, ZHOU Wei. Fast image background automatic replacement based on dilated convolution[J]. Journal of Computer Applications, 2018, 38(2): 405-409.

参考文献

[1] SHEN X, TAO X, GAO H, et al. Deep automatic portrait matting[C]//ECCV 2016:Proceedings of the 2016 European Conference on Computer Vision. Cham:Springer, 2016:92-107.
[2] LI B. Image background replacement method:US, US6912313[P]. 2005-06-28.
[3] QIAN R J, SEZAN M I. Video background replacement without a blue screen[C]//ICIP 99:Proceedings of the 1999 International Conference on Image Processing. Piscataway, NJ:IEEE, 1999, 4:143-146.
[4] LEVIN A, LISCHINSKI D, WEISS Y. A closed-form solution to natural image matting[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 30(2):228-242.
[5] ZHU B, CHEN Y, WANG J, et al. Fast deep matting for portrait animation on mobile phone[C]//MM'17:Proceedings of the 2017 ACM on Multimedia Conference. New York:ACM, 2017:297-305.
[6] XU N, PRICE B, COHEN S, et al. Deep image matting[J/OL]. arXiv:1703.03872, (2017-04-11)[2017-05-16]. https://arxiv.org/abs/1703.03872.
[7] SHELHAMER E, LONG J, DARRELL T. Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4):640-651.
[8] YU F, KOLTUN V. Multi-scale context aggregation by dilated convolutions[J/OL]. arXiv:1511.07122, (2016-04-30)[2017-05-16]. https://arxiv.org/abs/1511.07122.
[9] CHEN L-C, PAPANDREOU G, KOKKINOS I, et al. DeepLab:semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J/OL]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017[2017-05-26]. http://ieeexplore.ieee.org/document/7913730/.
[10] CHEN L-C, PAPANDREOU G, KOKKINOS I, et al. Semantic image segmentation with deep convolutional nets and fully connected CRFs[J/OL]. arXiv:1412.7062, (2016-06-07)[2017-04-08]. https://arxiv.org/abs/1412.7062.
[11] CHEN L-C, YANG Y, WANG J, et al. Attention to scale:scale-aware semantic image segmentation[C]//CVPR 2016:Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2016:3640-3649.
[12] MOESKOPS P, VETA M, LAFARGE M W, et al. Adversarial training and dilated convolutions for brain MRI segmentation[C]//DLMIA 2017, ML-CDS 2017:Proceedings of the 2017 Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. Cham:Springer, 2017:56-64.
[13] YANG Y, SOINI R F. Background replacement for an image:US, US5574511[P]. 1996-11-12.
[14] PENTA S K. Background replacement[C]//SIGGRAPH 2008:Proceedings of the 2008 International Conference on Computer Graphics and Interactive Techniques. New York:ACM, 2008:Article No. 59.
[15] SWANSON R L, ADOLPHI E J, SURMA M J, et al. Method and apparatus for background replacement in still photographs:US, US7834894[P]. 2010-11-16.
[16] CHEN Q, LI D, TANG C-K. KNN matting[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(9):2175-2188.
[17] LECUN Y, BOTTOU L, BENGIO Y, et al. Gradient-based learning applied to document recognition[J]. Proceedings of the IEEE, 1998, 86(11):2278-2324.
[18] GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//CVPR 2014:Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2014:580-587.
[19] SWIETOJANSKI P, GHOSHAL A, RENALS S. Convolutional neural networks for distant speech recognition[J]. IEEE Signal Processing Letters, 2014, 21(9):1120-1124.
[20] SINGH R, LANCHANTIN J, ROBINS G, et al. DeepChrome:deep-learning for predicting gene expression from histone modifications[J]. Bioinformatics, 2016, 32(17):i639-i648.
[21] PRASOON A, PETERSEN K, IGEL C, et al. Deep feature learning for knee cartilage segmentation using a triplanar convolutional neural network[C]//MICCAI 2013:Proceedings of the 2013 International Conference on Medical Image Computing and Computer-Assisted Intervention, LNCS 8150. Berlin:Springer, 2013:246-253.
[22] 唐智川,张克俊,李超,等.基于深度卷积神经网络的运动想象分类及其在脑控外骨骼中的应用[J].计算机学报,2017,40(6):1367-1378. (TANG Z C, ZHANG K J, LI C, et al. Motor imagery classification based on deep convolutional neural network and its application in exoskeleton controlled by EEG[J]. Chinese Journal of Computers, 2016, 40(6):1367-1378.)
[23] RHEMANN C, ROTHER C, WANG J, et al. A perceptually motivated online benchmark for image matting[C]//CVPR 2009:Proceedings of the 2009 International Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2009:1826-1833.
[24] ABADI M, AGARWAL A, BARHAM P, et al. TensorFlow:large-scale machine learning on heterogeneous distributed systems[J/OL]. arXiv:1603.04467, (2016-03-16)[2017-04-06]. https://arxiv.org/abs/1603.04467v1.
[25] LIN T Y, MAIRE M, BELONGIE S, et al. Microsoft COCO:common objects in context[C]//ECCV 2014:Proceedings of the 2014 European Conference on Computer Vision, LNCS 8693. Cham:Springer, 2014:740-755.
[26] SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[J/OL]. arXiv:1409.1556, (2015-04-10)[2017-03-05]. https://arxiv.org/abs/1409.1556.

基于空洞卷积的快速背景自动更换

Fast image background automatic replacement based on dilated convolution

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	王贺兵, 张春梅. 基于非对称卷积-压缩激发-次代残差网络的人脸关键点检测[J]. 计算机应用, 2021, 41(9): 2741-2747.
[2]	郑志强, 胡鑫, 翁智, 王雨禾, 程曦. 基于改进DenseNet的牛眼图像特征提取方法[J]. 计算机应用, 2021, 41(9): 2780-2784.
[3]	赵宏, 孔东一. 图像特征注意力与自适应注意力融合的图像内容中文描述[J]. 计算机应用, 2021, 41(9): 2496-2503.
[4]	徐江浪, 李林燕, 万新军, 胡伏原. 结合目标检测的室内场景识别方法[J]. 计算机应用, 2021, 41(9): 2720-2725.
[5]	陈成瑞, 孙宁, 何世彪, 廖勇. 面向C-V2X通信的基于深度学习的联合信道估计与均衡算法[J]. 计算机应用, 2021, 41(9): 2687-2693.
[6]	谢德峰, 吉建民. 融入句法感知表示进行句法增强的语义解析[J]. 计算机应用, 2021, 41(9): 2489-2495.
[7]	代雨柔, 杨庆, 张凤荔, 周帆. 基于自监督学习的社交网络用户轨迹预测模型[J]. 计算机应用, 2021, 41(9): 2545-2551.
[8]	曹玉红, 徐海, 刘荪傲, 王紫霄, 李宏亮. 基于深度学习的医学影像分割研究综述[J]. 计算机应用, 2021, 41(8): 2273-2287.
[9]	秦斌斌, 彭良康, 卢向明, 钱江波. 司机分心驾驶检测研究进展[J]. 计算机应用, 2021, 41(8): 2330-2337.
[10]	何正海, 线岩团, 王蒙, 余正涛. 融合句法指导与字符注意力机制的案情阅读理解方法[J]. 计算机应用, 2021, 41(8): 2427-2431.
[11]	李亚芳, 梁烨, 冯韦玮, 祖宝开, 康玉健. 基于社区优化的深度网络嵌入方法[J]. 计算机应用, 2021, 41(7): 1956-1963.
[12]	王月, 江逸茗, 兰巨龙. 基于改进三元组网络和K近邻算法的入侵检测[J]. 计算机应用, 2021, 41(7): 1996-2002.
[13]	高钦泉, 黄炳城, 刘文哲, 童同. 基于改进CenterNet的竹条表面缺陷检测方法[J]. 计算机应用, 2021, 41(7): 1933-1938.
[14]	冯兴杰, 张天泽. 基于分组卷积进行特征融合的全景分割算法[J]. 计算机应用, 2021, 41(7): 2054-2061.
[15]	侯笑晗, 金国栋, 谭力宁, 薛远亮. 基于自适应和最优特征的合成孔径雷达舰船检测方法[J]. 计算机应用, 2021, 41(7): 2150-2155.