《计算机应用》唯一官方网站 ›› 2023, Vol. 43 ›› Issue (12): 3733-3739.DOI: 10.11772/j.issn.1001-9081.2022111790

• 人工智能 • 上一篇    下一篇

基于孪生网络和Transformer的小目标跟踪算法SiamTrans

公海涛1, 陈志华1(), 盛斌2, 祝冰艳1   

  1. 1.华东理工大学 信息科学与工程学院,上海 200237
    2.上海交通大学 电子信息与电气工程学院,上海 200240
  • 收稿日期:2022-12-06 修回日期:2023-02-23 接受日期:2023-02-27 发布日期:2023-03-13 出版日期:2023-12-10
  • 通讯作者: 陈志华
  • 作者简介:公海涛(1998—),男,山东临沂人,硕士研究生,主要研究方向:计算机视觉、深度学习
    陈志华(1969—),男,江西上饶人,教授,博士,CCF杰出会员,主要研究方向:计算机视觉、机器学习;Email:czh@ecust.edu.cn
    盛斌(1981—),男,湖北武汉人,教授,博士,CCF会员,主要研究方向:虚拟现实、计算机图形学
    祝冰艳(1998—),女,安徽六安人,硕士研究生,主要研究方向:计算机视觉、深度学习。
  • 基金资助:
    空间智能控制技术全国重点实验室开放基金课题(HTKJ2022KL502010)

SiamTrans: tiny object tracking algorithm based on Siamese network and Transformer

Haitao GONG1, Zhihua CHEN1(), Bin SHENG2, Bingyan ZHU1   

  1. 1.School of Information Science and Engineering,East China University of Science and Technology,Shanghai 200237,China
    2.School of Electronic Information and Electrical Engineering,Shanghai Jiao Tong University,Shanghai 200240,China
  • Received:2022-12-06 Revised:2023-02-23 Accepted:2023-02-27 Online:2023-03-13 Published:2023-12-10
  • Contact: Zhihua CHEN
  • About author:GONG Haitao, born in 1998, M. S. candidate. His research interests include computer vision, deep learning.
    SHENG Bin, born in 1981, Ph. D., professor. His research interests include virtual reality, computer graphics.
    ZHU Bingyan, born in 1998, M. S. candidate. Her research interests include computer vision, deep learning.
  • Supported by:
    Fund Project of National Key Laboratory of Space Intelligent Control(HTKJ2022KL502010)

摘要:

针对现有小目标跟踪算法的鲁棒性差、精度及成功率低的问题,提出一种基于孪生网络和Transformer的小目标跟踪算法SiamTrans。首先,基于Transformer机制设计一种相似度响应图计算模块。该模块叠加若干层特征编码-解码结构,并利用多头自注意力机制和多头跨注意力机制在不同层次的搜索区域特征图中查询模板特征图信息,从而避免陷入局部最优解,并获得一个高质量的相似度响应图;其次,在预测子网中设计一个基于Transformer机制的预测模块(PM),并利用自注意力机制处理预测分支特征图中的冗余特征信息,以提高不同预测分支的预测精度。在Small90数据集上,相较于TransT(Transformer Tracking)算法,所提算法的跟踪精度和跟踪成功率分别高8.0和9.5个百分点。可见,所提出的算法具有更优异的小目标跟踪性能。

关键词: 目标跟踪, 小目标, 孪生网络, 注意力机制, Transformer

Abstract:

Aiming at the problems of poor robustness, low precision and success rate in the existing tiny object tracking algorithms, a tiny object tracking algorithm, SiamTrans, was proposed on the basis of Siamese network and Transformer. Firstly, a similarity response map calculation module was designed based on the Transformer mechanism. In the module, several layers of feature encoding-decoding structures were superimposed, and multi-head self-attention and multi-head cross-attention mechanisms were used to query template feature map information in feature maps of different levels of search regions, which avoided falling into local optimal solutions and obtained a high-quality similarity response map. Secondly, a Prediction Module (PM) based on Transformer mechanism was designed in the prediction subnetwork, and the self-attention mechanism was used to process redundant feature information in the prediction branch feature maps to improve the prediction precisions of different prediction branches. Experimental results on Small90 dataset show that, compared to the TransT (Transformer Tracking) algorithm, the tracking precision and tracking success rate of the proposed algorithm are 8.0 and 9.5 percentage points higher, respectively. It can be seen that the proposed algorithm has better tracking performance for tiny objects.

Key words: object tracking, tiny object, Siamese network, attention mechanism, Transformer

中图分类号: