CCML2017+218+联合时空多特征表示的无监督视频分割方法

摘要/Abstract

摘要： 摘要: 视频分割是计算机视觉技术中最困难的问题之一，分割的难点在于分割目标的无规则运动、快速变换的背景、目标外观的任意变化与形变等。本文提出了一种基于时空多特征表示的无监督视频分割算法，通过融合像素级、超像素级以及显著性三类特征设计由细粒度到粗粒度的稳健特征表示。首先，采用超像素分割对视频序列进行处理以提高运算效率，并设计图割算法进行快速求解；其次，利用光流法对相邻帧信息进行匹配，并通过KD树算法（K-dimension tree）实现最近邻搜索以引入各超像素的非局部时空颜色特征，从而增强分割的鲁棒性；然后，对采用超像素计算得到的分割结果，设计混合高斯模型进行完善；最后，引入图像的显著性特征，协同超像素分割与混合高斯模型的分割结果，设计投票获得更加准确的视频分割结果。实验结果表明该算法是一种稳健且有效的分割算法，其结果优于当前大部分无监督视频分割算法及部分半监督视频分割算法。

关键词: 关键词: 超像素分割, KD树, 混合高斯模型, 图割算法, 光流法

Abstract: Abstract: Video object segmentation is one of the most difficult problems in computer vision due to the factors like fast moving objects, cluttered backgrounds, arbitrary object appearance variation and shape deformation. In this paper we present a new unsupervised video segmentation algorithm based on multiple spatio-temporal feature representations. By the combination of saliency characteristics and other features obtained from pixels and superpixels, we design a coarse to fine-grained robust feature representation to represent each frame in a video sequence. First, we generate a set of superpixels to represent the foreground and background in order to improve computational efficiency and get segmentation results by graph-cut algorithm; then, the optical flow can be used to propagate information between adjacent frames, and the appearance of each superpixel can be updated by its non-local sptatio-temporally counterparts generated by the nearest neighboring searching method with the efficient K-dimension tree (KD-tree ) algorithm, so as to improve the robustness of the segmentation; after that, for segmentation results generated in superpixel-level, we construct a new Gaussian mixture model based on pixels to achieve pixel level refinement; finally, the algorithm calculate the saliency characteristics of each frame, as well as segmentation results generated by graph cut and Gaussian mixture model, to obtain a more accurate segmentation results by voting scheme. Extensive evaluations on the SegTrack dataset demonstrate the effectiveness of the proposed method, which performs favorably against some state-of-art methods.

Key words: Keywords：superpixel segmentation, KD-tree, gaussian mixture model, graph cut algorithm, optical flow

中图分类号:

中图分类号:TP312

李雪君张开华宋慧慧. CCML2017+218+联合时空多特征表示的无监督视频分割方法[J]. 计算机应用.

[1]	张家岗, 李达平, 杨晓东, 邹茂扬, 吴锡, 胡金蓉. 基于深度卷积特征光流的形变医学图像配准算法[J]. 计算机应用, 2020, 40(6): 1799-1805.
[2]	胡学敏, 易重辉, 陈钦, 陈茜, 陈龙. 基于运动显著图的人群异常行为检测[J]. 计算机应用, 2018, 38(4): 1164-1169.
[3]	朱明敏, 胡茂海. 基于相关滤波器的长时视觉目标跟踪方法[J]. 计算机应用, 2017, 37(5): 1466-1470.
[4]	孙少超. 基于带权核范数最小化和混合高斯模型的去噪模型[J]. 计算机应用, 2017, 37(5): 1471-1474.
[5]	李雪君, 张开华, 宋慧慧. 融合时空多特征表示的无监督视频分割算法[J]. 计算机应用, 2017, 37(11): 3134-3138.
[6]	刘彤, 黄修添, 马建设, 苏萍. 基于完全联系的条件随机场的图像标注[J]. 计算机应用, 2017, 37(10): 2841-2846.
[7]	姬丽娜, 陈庆奎, 陈圆金, 赵德玉, 方玉玲, 赵永涛. 基于GPU的视频流人群实时计数[J]. 计算机应用, 2017, 37(1): 145-152.
[8]	高智勇, 唐文峰, 贺良杰. 基于运动显著性的移动镜头下的运动目标检测[J]. 计算机应用, 2016, 36(6): 1692-1698.
[9]	潘磊, 周欢, 王明辉. 适用于密集人群的异常事件实时检测方法[J]. 计算机应用, 2016, 36(6): 1719-1723.
[10]	何传阳, 王平, 张晓华, 宋丹妮. 基于智能监控的中小人群异常行为检测[J]. 计算机应用, 2016, 36(6): 1724-1729.
[11]	姜伟, 吕晓琪, 任晓颖, 任国印. 结合区域生长与图割算法的冠状动脉CT血管造影图像三维分割[J]. 计算机应用, 2015, 35(5): 1462-1466.
[12]	华媛蕾刘万军. 改进混合高斯模型的运动目标检测算法[J]. 计算机应用, 2014, 34(2): 580-584.
[13]	邵楠张科. 基于投影熵特征的图像识别算法[J]. 计算机应用, 2013, 33(10): 2874-2877.
[14]	李鸿生薛月菊黄晓琳黄珂何金辉. 改进的自适应混合高斯前景检测方法[J]. 计算机应用, 2013, 33(09): 2610-2613.
[15]	黄杨昱胡伟袁国栋. 基于全局路径规划的相互速度障碍物人群疏散方法[J]. 计算机应用, 2013, 33(06): 1753-1758.

CCML2017+218+联合时空多特征表示的无监督视频分割方法

CCML2017+218+Fusing Multiple Spatio-Temporal Feature representations Based Unsupervised Video Segmentation

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics