• •    

CCDM2022+220_融合人体全身表观信息的行人头部跟踪

张广耀,宋纯锋   

  1. 中国科学院自动化研究所
  • 收稿日期:2022-03-28 修回日期:2022-05-14 发布日期:2022-06-29
  • 通讯作者: 宋纯锋
  • 基金资助:
    国家自然科学基金

Tracking Pedestrian Heads in Crowd with Global Body Appearance Feature Fusion

  • Received:2022-03-28 Revised:2022-05-14 Online:2022-06-29
  • Supported by:
    National Natural Science Foundation of China

摘要: 随着深度神经网络的广泛应用,多目标跟踪这一任务已经取得了巨大的进展。然而,密集场景下的行人跟踪仍然是一个具有挑战性的问题,其主要原因是行人之间严重的遮挡给运动模型和行人表观信息提取带来了巨大的挑战。行人头部跟踪相比于行人身体跟踪遮挡的情况更少,因此头部跟踪任务在最近引起了研究者的广泛关注。本文设计了一种联合身体表观特征的行人头部跟踪模型,本文设计的模型总共有两个主要的模块。第一,行人身体的检测框内含有更丰富的纹理信息,从而可以提取更好的表观特征,因此本文设计了一种动态的由头部检测框生成全身检测框的生成网络。第二,全身检测框之间存在互相重叠,导致较为严重的遮挡问题。为了能够提取低噪声的行人表观信息,本文利用人体姿态估计的信息作为引导,使得行人重识别网络更好的关注到非遮挡部分,进而提取到去噪声的行人表观信息。通过上述两个模块,本文设计的模型在行人头部跟踪的基准数据集上取得了当前最好的效果。此外,本文设计的模型还可以应用于行人的全身跟踪任务。

关键词: 多目标跟踪, 动态模型, 特征匹配, 人头跟踪, 行人重识别

Abstract: Substantial improvement has been achieved in the field of Multi-Object-Tracking due to the successful application of deep neural networks. However, tracking pedestrians in crowd scene remains a challenging problem in computer vision. Previous methods work well in common scenes but failed in the crowded situation since severe occlusion between pedestrians that makes the motion model and appearance feature extraction failed. Inspired by the less occluded head part, we propose a part-based motion and Re-ID model which makes full use of both the head part and full body for tracking. There are mainly two modules in the proposed method. Firstly, instead of using full body bounding box for motion modeling, we use the head part to model the pedestrian position since the head part has few occlusions. Secondly, in order to get denoised full-body appearance feature in crowded scene, we propose an adaptive full-body bounding box generator to obtain full-body bounding box from head bounding box and use the human pose to guide the Re-ID model to extract appearance features. Our model achieves the state-of-the-art result on Head Tracking 21 dataset. We show the necessity of each module of our approach with extensive ablative study. Moreover, by adaptively generating full-body bounding box, our approach could also be used for full-body tracking.

Key words: Multi-object-tracking, Motion Model, Feature Matching, Head Tracking, Occluded Person Re-ID

中图分类号: