计算机应用 ›› 2018, Vol. 38 ›› Issue (2): 405-409.DOI: 10.11772/j.issn.1001-9081.2017081966

• 人工智能 • 上一篇    下一篇

基于空洞卷积的快速背景自动更换

张浩, 窦奇伟, 栾桂凯, 姚绍文, 周维   

  1. 云南大学 软件学院, 昆明 650091
  • 收稿日期:2017-08-11 修回日期:2017-09-09 出版日期:2018-02-10 发布日期:2018-02-10
  • 通讯作者: 周维
  • 作者简介:张浩(1992-),男,云南玉溪人,硕士研究生,主要研究方向:深度学习;窦奇伟(1994-),男,内蒙古包头人,硕士研究生,主要研究方向:深度学习;栾桂凯(1993-),男,山东烟台人,硕士研究生,主要研究方向:深度学习;姚绍文(1966-),男,云南昆明人,教授,博士,主要研究方向:工作流、Petri网;周维(1974-),男,云南昆明人,教授,博士,主要研究方向:分布式处理、生物信息学。
  • 基金资助:
    国家自然科学基金资助项目(61762089,61363021,61640306)。

Fast image background automatic replacement based on dilated convolution

ZHANG Hao, DOU Qiwei, LUAN Guikai, YAO Shaowen, ZHOU Wei   

  1. School of Software, Yunnan University, Kunming Yunnan 650091, China
  • Received:2017-08-11 Revised:2017-09-09 Online:2018-02-10 Published:2018-02-10
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61762089, 61363021, 61640306).

摘要: 针对背景更换过程复杂性较高导致传统方法效率低下并且精确度难以提高的问题,提出一种基于空洞卷积的快速图像背景更换方法——FABRNet。首先,采用VGG(Visual Geometry Group network)模型中前三部分网络结构对输入图片进行卷积和池化操作;其次,多组空洞卷积并联组合使得网络拥有足够大和足够细的感受野,并且加上残差网络结构来保证卷积过程中信息位置分布的准确性;最后,通过双线性插值算法将图片缩放到原图尺寸输出。在实验部分,与三种经典方法KNN(K-Nearest Neighbor)matting、Portrait matting和Deep matting进行了对比,结果表明,FABRNet能够有效地完成背景自动更换的操作,并且在速度方面有一定的优势。

关键词: 深度学习, 空洞卷积, 残差网络, 背景更换, 双线性插值

Abstract: Because of complexity of background replacement, the traditional method is inefficient and the accuracy is difficult to improve. To solve these problems, a fast image background replacement method based on dilated convolution, called FABRNet, was proposed. First of all, the first three parts of VGG (Visual Geometry Group network) model were used for convolution and pooling operations of input images. Secondly, the combination of multiple sets of dilated convolutions were embedded into convolution neural network to make the network have a large and fine enough receptive field; meanwhile, the residual network structure was used to ensure the accuracy of the information distribution in the convolution process. Finally, the image was scaled to the original size and output by bilinear interpolation algorithm. Compared with three classical methods such as KNN (K-Nearest Neighbors) matting, Portrait matting and Deep matting, the experimental results show that FABRNet can effectively complete the background automatic replacement, and has advantages in running speed.

Key words: deep learning, dilated convolution, residual network, background replacement, bilinear interpolation

中图分类号: