融合时空多特征表示的无监督视频分割算法

doi:10.11772/j.issn.1001-9081.2017.11.3134

计算机应用 ›› 2017, Vol. 37 ›› Issue (11): 3134-3138.DOI: 10.11772/j.issn.1001-9081.2017.11.3134

• 第十六届中国机器学习会议(CCML 2017) • 上一篇下一篇

融合时空多特征表示的无监督视频分割算法

李雪君, 张开华, 宋慧慧

江苏省大数据分析技术重点实验室(南京信息工程大学), 南京 210044

收稿日期:2017-05-16 修回日期:2017-05-31 出版日期:2017-11-10 发布日期:2017-11-11
通讯作者: 张开华
作者简介:李雪君(1993-),女,江苏南京人,硕士研究生,主要研究方向:视频分割;张开华(1983-),男,山东日照人,教授,博士,CCF会员,主要研究方向:目标跟踪、水平集图像分割;宋慧慧(1986-),女,山东聊城人,教授,博士,主要研究方向:遥感影像处理。
基金资助:
国家自然科学基金资助项目（61402233，41501377）；江苏省自然科学基金资助项目（BK20151529，BK20150906）。

Unsupervised video segmentation by fusing multiple spatio-temporal feature representations

LI Xuejun, ZHANG Kaihua, SONG Huihui

Jiangsu Key Laboratory of Big Data Analysis Technology(Nanjing University of Information Science and Technology), Nanjing Jiangsu 210044, China

Received:2017-05-16 Revised:2017-05-31 Online:2017-11-10 Published:2017-11-11
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61402233, 41501377), the Natural Science Foundation of Jiangsu Province (BK20151529,BK20150906).

摘要/Abstract

摘要： 针对视频分割的难点在于分割目标的无规则运动、快速变换的背景、目标外观的任意变化与形变等，提出了一种基于时空多特征表示的无监督视频分割算法，通过融合像素级、超像素级以及显著性三类特征设计由细粒度到粗粒度的稳健特征表示。首先，采用超像素分割对视频序列进行处理以提高运算效率，并设计图割算法进行快速求解；其次，利用光流法对相邻帧信息进行匹配，并通过K-D树算法实现最近邻搜索以引入各超像素的非局部时空颜色特征，从而增强分割的鲁棒性；然后，对采用超像素计算得到的分割结果，设计混合高斯模型进行完善；最后，引入图像的显著性特征，协同超像素分割与混合高斯模型的分割结果，设计投票获得更加准确的视频分割结果。实验结果表明，所提算法是一种稳健且有效的分割算法，其结果优于当前大部分无监督视频分割算法及部分半监督视频分割算法。

关键词: 超像素分割, K-D树, 混合高斯模型, 图割算法, 光流法

Abstract: Due to random movement of the segmented target, rapid change of background, arbitrary variation and shape deformation of object appearance, in this paper, a new unsupervised video segmentation algorithm based on multiple spatial-temporal feature representations was presented. By combination of salient features and other features obtained from pixels and superpixels, a coarse-to-fine-grained robust feature representation was designed to represent each frame in a video sequence. Firstly, a set of superpixels was generated to represent foreground and background in order to improve computational efficiency and get segmentation results by graph-cut algorithm. Then, the optical flow method was used to propagate information between adjacent frames, and the appearance of each superpixel was updated by its non-local sptatial-temporal features generated by nearest neighbor searching method with efficient K-Dimensional tree (K-D tree) algorithm, so as to improve robustness of segmentation. After that, for segmentation results generated in superpixel-level, a new Gaussian mixture model based on pixels was constructed to achieve pixel-level refinement. Finally, the significant feature of image was introduced, as well as segmentation results generated by graph-cut and Gaussian mixture model, to obtain more accurate segmentation results by voting scheme. The experimental results show that the proposed algorithm is a robust and effective segmentation algorithm, which is superior to most unsupervised video segmentation algorithms and some semi-supervised video segmentation algorithms.

Key words: superpixel segmentation, K-Dimensional tree (K-D tree), Gaussian Mixture Model (GMM), graph-cut algorithm, optical flow method

中图分类号:

TP312

李雪君, 张开华, 宋慧慧. 融合时空多特征表示的无监督视频分割算法[J]. 计算机应用, 2017, 37(11): 3134-3138.

LI Xuejun, ZHANG Kaihua, SONG Huihui. Unsupervised video segmentation by fusing multiple spatio-temporal feature representations[J]. Journal of Computer Applications, 2017, 37(11): 3134-3138.

参考文献

[1] YONG J L, KIM J, GRAUMAN K. Key-segments for video object segmentation[C]//Proceedings of the 2011 International Conference on Computer Vision. Washington, DC:IEEE Computer Society, 2011:1995-2002.
[2] PAPAZOGLOU A, FERRARI V. Fast object segmentation in unconstrained video[C]//Proceedings of the 2013 IEEE International Conference on Computer Vision. Washington, DC:IEEE Computer Society, 2013:1777-1784.
[3] WEN L, DU D, LEI Z, et al. JOTS:joint online tracking and segmentation[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ:IEEE, 2015:2226-2234.
[4] BROX T, MALIK J. Object segmentation by long term analysis of point trajectories[C]//Proceedings of the 11th European Conference on Computer Vision. Berlin:Springer-Verlag, 2010:282-295.
[5] NAGARAGA N S, SCHMIDT F R, BROX T. Video segmentation with just a few strokes[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision. Piscataway, NJ:IEEE, 2015:3235-3243.
[6] 于跃龙, 卢焕章. 基于背景构造的视频对象分割技术[J]. 计算机工程与科学, 2006, 28(1):36-38.(YU Y L, LU H Z. Video object segmentation technology based on background construction[J]. Computer Engineering and Science, 2006, 28(1):36-38.)
[7] CULIBRK D, MARQUES O, SOCEK D, et al. Neural network approach to background modeling for video object segmentation[J]. IEEE Transactions on Neural Networks, 2007, 18(6):1614-1627.
[8] 纪腾飞,王世刚,周茜,等. 基于动静背景下的视频对象自适应提取算法[J].吉林大学学报(信息科学版), 2007, 25(1):73-77.(JI T F, WANG S G, ZHOU Q, et al. Adaptive algorithm of video object segmentation under moving and static background[J]. Journal of Jilin University (Information Science Edition), 2007, 25(1):73-77.)
[9] 马丽红, 张宇, 邓健平. 基于形态开闭滤波二值标记和纹理特征合并的分水岭算法[J]. 中国图象图形学报, 2003, 8(1):80-86.(MA L H, ZHANG Y, DENG J P. A target segmentation algorithm based on opening closing binary marker on watersheds and texture merging[J]. Journal of Image and Graphics, 2003, 8(1):80-86.)
[10] CHOI J G, LEE S W, KIM S D. Spatio-temporal video segmentation using a joint similarity measure[J]. IEEE Transactions on Circuits and Systems for Video Technology, 1997, 7(2):279-286.
[11] 黄波, 杨勇, 王桥,等. 一种基于时空联合的视频分割算法[J]. 电子学报, 2001, 29(11):1491-1494.(HUANG B, YANG Y,WANG Q, et al. Video segmentation based on spatio-temporal information[J]. Acta Electronica Sinica, 2001, 29(11):1491-1494.)
[12] 维基百科. 光流法[EB/OL].[2017-05-01].http://zh.wikipedia.org/wiki/%E5%85%89%E6%B5%81%E6%B3%95. (Wikipedia. Optical Flow Method[EB/OL].[2017-05-01].http://zh.wikipedia.org/wiki/%E5%85%89%E6%B5%81%E6%B3%95.
[13] LEVINSHTEIN A, STERE A, KUTULAKOS K N, et al. TurboPixels:fast superpixels using geometric flows[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009, 31(12):2290-2297.
[14] BOYKOV Y, VEKSLER O, ZABIH R. Fast approximate energy minimization via graph cuts[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2001, 23(11):1222-1239.
[15] LI F, KIM T, HUMAYUN A, et al. Video segmentation by tracking many figure-ground segments[C]//Proceedings of the 2014 IEEE International Conference on Computer Vision. Piscataway, NJ:IEEE, 2014:2192-2199.
[16] VEDALDI A, FULKERSON B. Vlfeat:an open and portable library of computer vision algorithms[C]//Proceedings of the 18th ACM International Conference on Multimedia. New York:ACM, 2010:1469-1472.
[17] GOFERMAN S, ZELINKMANOR L, TAL A. Context-aware saliency detection[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2012, 34(10):1915-1926.
[18] TSAI D, FLAGG M, NAKAZAWA A, et al. Motion coherent tracking using multi-label MRF optimization[J]. International Journal of Computer Vision, 2012, 100(2):190-202.
[19] CAIZ, WEN L, LEI Z, et al. Robust deformable and occluded object tracking with dynamic graph[J]. IEEE Transactions on Image Processing, 2014, 23(12):5497.
[20] JAIN S D, GRAUMAN K. Supervoxel-consistent foreground propagation in video[C]//Proceedings of the 13th European Conference on Computer Vision. Berlin:Springer, 2014:656-671.
[21] OCHS P, BROX T. Higher order motion models and spectral clustering[C]//Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2012:614-621.

融合时空多特征表示的无监督视频分割算法

Unsupervised video segmentation by fusing multiple spatio-temporal feature representations

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	张家岗, 李达平, 杨晓东, 邹茂扬, 吴锡, 胡金蓉. 基于深度卷积特征光流的形变医学图像配准算法[J]. 计算机应用, 2020, 40(6): 1799-1805.
[2]	王书朋, 赵瑶. 基于自适应分割的多曝光图像融合算法[J]. 计算机应用, 2020, 40(1): 252-257.
[3]	李登刚, 陈香香, 李华丽, 王忠美. 基于超像素的流形正则化稀疏约束NMF混合像元分解算法[J]. 计算机应用, 2019, 39(10): 3100-3106.
[4]	胡学敏, 易重辉, 陈钦, 陈茜, 陈龙. 基于运动显著图的人群异常行为检测[J]. 计算机应用, 2018, 38(4): 1164-1169.
[5]	刘宇, 金伟正, 范赐恩, 邹炼. 使用超像素分割与图割的网状遮挡物检测算法[J]. 计算机应用, 2018, 38(1): 238-245.
[6]	朱明敏, 胡茂海. 基于相关滤波器的长时视觉目标跟踪方法[J]. 计算机应用, 2017, 37(5): 1466-1470.
[7]	孙少超. 基于带权核范数最小化和混合高斯模型的去噪模型[J]. 计算机应用, 2017, 37(5): 1471-1474.
[8]	姬丽娜, 陈庆奎, 陈圆金, 赵德玉, 方玉玲, 赵永涛. 基于GPU的视频流人群实时计数[J]. 计算机应用, 2017, 37(1): 145-152.
[9]	高智勇, 唐文峰, 贺良杰. 基于运动显著性的移动镜头下的运动目标检测[J]. 计算机应用, 2016, 36(6): 1692-1698.
[10]	潘磊, 周欢, 王明辉. 适用于密集人群的异常事件实时检测方法[J]. 计算机应用, 2016, 36(6): 1719-1723.
[11]	何传阳, 王平, 张晓华, 宋丹妮. 基于智能监控的中小人群异常行为检测[J]. 计算机应用, 2016, 36(6): 1724-1729.
[12]	刘志远, 李华锋. 对比度与空间位置关系驱动的显著性检测[J]. 计算机应用, 2016, 36(3): 795-799.
[13]	兰丽, 何小海, 吴晓红, 滕奇志. 基于超像素和局部颜色恒常性的自适应阴影去除[J]. 计算机应用, 2016, 36(10): 2837-2841.
[14]	姜伟, 吕晓琪, 任晓颖, 任国印. 结合区域生长与图割算法的冠状动脉CT血管造影图像三维分割[J]. 计算机应用, 2015, 35(5): 1462-1466.
[15]	华媛蕾刘万军. 改进混合高斯模型的运动目标检测算法[J]. 计算机应用, 2014, 34(2): 580-584.