《计算机应用》唯一官方网站 ›› 2021, Vol. 41 ›› Issue (12): 3565-3570.DOI: 10.11772/j.issn.1001-9081.2021061034

• 第十八届中国机器学习会议(CCML 2021) • 上一篇    

基于时空上下文信息增强的目标跟踪算法

温静(), 李强   

  1. 山西大学 计算机与信息技术学院,太原 030006
  • 收稿日期:2021-05-12 修回日期:2021-07-18 接受日期:2021-07-22 发布日期:2021-12-28 出版日期:2021-12-10
  • 通讯作者: 温静
  • 作者简介:李强(1995—),男,山西大同人,硕士研究生,主要研究方向:计算机视觉、图像处理。
  • 基金资助:
    山西省研究生教育改革研究课题(2020YJJG030)

Object tracking algorithm based on spatio-temporal context information enhancement

Jing WEN(), Qiang LI   

  1. School of Computer and Information Technology,Shanxi University,Taiyuan Shanxi 030006,China
  • Received:2021-05-12 Revised:2021-07-18 Accepted:2021-07-22 Online:2021-12-28 Published:2021-12-10
  • Contact: Jing WEN
  • About author:LI Qiang, born in 1995, M. S. candidate. His research interests include computer vision, image processing.
  • Supported by:
    Research Project of Postgraduate Education Reform in Shanxi Province(2020YJJG030)

摘要:

充分利用视频中的时空上下文信息能明显提高目标跟踪性能,但目前大多数基于深度学习的目标跟踪算法仅利用当前帧的特征信息来定位目标,没有利用同一目标在视频前后帧的时空上下文特征信息,导致跟踪目标易受到邻近相似目标的干扰,从而在跟踪定位时会引入一个潜在的累计误差。为了保留时空上下文信息,在SiamMask算法的基础上引入一个短期记忆存储池来存储历史帧特征;同时,提出了外观显著性增强模块(ASBM),一方面增强跟踪目标的显著性特征,另一方面抑制周围相似目标对目标的干扰。基于此,提出一种基于时空上下文信息增强的目标跟踪算法。在VOT2016、VOT2018、DAVIS-2016和DAVIS-2017等四个数据集上进行实验与分析,结果表明所提出的算法相较于SiamMask算法在VOT2016上的准确率和平均重叠率(EAO)分别提升了4个百分点和2个百分点;在VOT2018上的准确率、鲁棒性和EAO分别提升了3.7个百分点、2.8个百分点和1个百分点;在DAVIS-2016上的区域相似度、轮廓精度指标中的下降率均分别降低了0.2个百分点;在DAVIS-2017上的区域相似度、轮廓精度指标中的下降率分别降低了1.3和0.9个百分点。

关键词: 目标跟踪, 上下文信息, 显著特征, 特征增强, 深度学习

Abstract:

Making full use of the spatio-temporal context information in the video can significantly improve the performance of object tracking, but most of the current object tracking algorithms based on deep learning only use the feature information of the current frame to locate the object, without using the spatio-temporal context information of the same object in the video frames before and after the current frame, which leads to the tracking object being susceptible to the interference from the similar object nearby, so a potential cumulative error will be introduced during tracking and locating. In order to retain spatio-temporal context information, a short-term memory storage pool was introduced based on SiamMask algorithm to store features of the historical frames; meanwhile, an Appearance Saliency Boosting Module (ASBM) was proposed, which not only enhanced the saliency features of the tracking object, but also suppressed the interference from similar object around the tracking object. On the basis of the above, an object tracking algorithm based on spatio-temporal context information enhancement was proposed. To verify the performance of the proposed algorithm, experiments were carried out on four datasets, including VOT2016, VOT2018, DAVIS-2016 and DAVIS-2017. Experimental results show that compared with SiamMask algorithm, the proposed algorithm has the accuracy and Expected Average Overlap rate (EAO) increased by 4 percentage points and 2 percentage points respectively on VOT2016 dataset, and has the accuracy, robustness and EAO improved by 3.7 percentage points, 2.8 percentage points and 1 percentage point respectively on VOT2018 dataset, and has the decay of the regional similarity and contour accuracy indicators on DAVIS-2016 datasets both reduced by 0.2 percentage points, and has the decay of the regional similarity and contour progress indicators on DAVIS-2017 datasets reduced by 1.3 and 0.9 percentage points respectively.

Key words: object tracking, context information, salient feature, feature enhancement, deep learning

中图分类号: