• •    

CCML2021+276: 基于时空上下文信息增强的目标跟踪算法

温静,李强   

  1. 山西大学
  • 收稿日期:2021-06-18 修回日期:2021-07-18 发布日期:2021-07-18
  • 通讯作者: 温静

Object tracking algorithm based on spatio-temporal context information enhancement

  • Received:2021-06-18 Revised:2021-07-18 Online:2021-07-18

摘要: 摘 要: 充分利用视频中的时空上下文信息能明显提高目标跟踪性能。然而,目前大多数基于深度学习的目标跟踪算法仅利用当前帧的特征信息来定位目标,而没有利用同一目标在视频前后帧的时空上下文特征信息,导致跟踪目标易受到邻近相似目标的干扰,从而在跟踪定位时会引入一个潜在的累计误差。为了保留时空上下文信息,本文引入一个短期记忆存储池来存储历史帧特征。同时,本文提出了外观显著性特征增强模块,一方面增强跟踪目标的显著性特征,另一方面抑制周围相似目标对目标的干扰。基于此,本文提出一种基于时空上下文信息增强的目标跟踪算法。在VOT2016、VOT2018、DAVIS-2016和DAVIS-2017等四个数据集上进行实验与分析,本文方法相较于SiamMask算法在VOT2016上准确率和平均重叠率分别提高了4%和2%,在VOT2018的准确率、鲁棒性和平均重叠率提高了3.7%、4.2%和1%。在DAVIS-2016和DAVIS-2017的区域相似度,轮廓精度分别提高了2%和1%。

关键词: 目标跟踪, 上下文信息, 显著特征, 特征增强, 深度学习

Abstract: Abstract: Making full use of the temporal and spatial context information in the video can significantly improve the performance of object tracking. However, most of the current algorithms based on deep learning only use the feature and information of the current frame to locate the object, the tracking object is apt to interference from the similar object nearby, as well as the potential error will be cumulatively introduced into tracking process. Therefore, it is necessary to utilize the spatio-temporal context information of the object of interest in the previous and latter frame of the video. In order to retain temporal and spatial context information, a short-term memory storage pool is introduced to store features of the previous frames. At the same time, we propose an appearance saliency feature boosting module (ASBM), which not only enhances the saliency feature of the tracking object, but also restrains the interference from similar object around the tracking object. In this paper, we propose an object tracking algorithm based on spatio-temporal context information enhancement. We have designed experiments on four data sets, which are VOT2016, VOT2018, DAVIS-2016 and DAVIS-2017, to verify and compare the performance of our method. Compared with the SiamMask algorithm, the accuracy and average overlap rate of the proposed method in this paper are increased by 4% and 2%, respectively. The accuracy, robustness, and average overlap rate are improved by 3.7%, 4.2% and 1%. In the regional similarity of DAVIS-2016 and DAVIS-2017, the contour accuracy has been improved by 2% and 1%, respectively.

Key words: object tracking, contextual information, salient features, feature enhancement, deep learning

中图分类号: