《计算机应用》唯一官方网站 ›› 2023, Vol. 43 ›› Issue (6): 1803-1810.DOI: 10.11772/j.issn.1001-9081.2022050665

• 人工智能 • 上一篇    下一篇

基于双流结构的跨模态行人重识别关系网络

郭玉彬1,2, 文向1, 刘攀1, 李西明1,2()   

  1. 1.华南农业大学 数学与信息学院,广州 510642
    2.广州市智慧农业重点实验室(华南农业大学),广州 510642
  • 收稿日期:2022-05-08 修回日期:2022-08-09 接受日期:2022-08-11 发布日期:2023-06-08 出版日期:2023-06-10
  • 通讯作者: 李西明
  • 作者简介:郭玉彬(1973—),女,山东高唐人,副教授,博士,主要研究方向:数据库、大数据、数据挖掘、深度学习
    文向(1998—),男,湖南长沙人,硕士研究生,主要研究方向:深度学习、数据挖掘、计算机视觉
    刘攀(1992—),男,湖南耒阳人,硕士研究生,主要研究方向:深度学习、数据挖掘、计算机视觉
    李西明(1974—),男,山东临清人,高级工程师,博士,主要研究方向:深度学习、计算机视觉、信息安全Email:liximing@scau.edu.cn
  • 基金资助:
    国家自然科学基金资助项目(61872152);广州市科技计划项目(201902010081)

Cross-modal person re-identification relation network based on dual-stream structure

Yubin GUO1,2, Xiang WEN1, Pan LIU1, Ximing LI1,2()   

  1. 1.College of Mathematics and Informatics,South China Agricultural University,Guangzhou Guangdong 510642,China
    2.Guangzhou Key Laboratory of Intelligent Agriculture (South China Agricultural University),Guangzhou Guangdong 510642,China
  • Received:2022-05-08 Revised:2022-08-09 Accepted:2022-08-11 Online:2023-06-08 Published:2023-06-10
  • Contact: Ximing LI
  • About author:GUO Yubin, born in 1973, Ph. D., associate professor. Her research interests include database, big data, data mining, deep learning.
    WEN Xiang, born in 1998, M. S. candidate. His research interests include deep learning, data mining, computer vision.
    LIU Pan, born in 1992, M. S. candidate. His research interests include deep learning, data mining, computer vision.
  • Supported by:
    National Natural Science Foundation of China(61872152);Science and Technology Program of Guangzhou(201902010081)

摘要:

针对可见光-红外跨模态行人重识别中模态差异导致的识别精确率低的问题,提出了一种基于双流结构的跨模态行人重识别关系网络(IVRNBDS)。首先,利用双流结构分别提取可见光模态和红外模态行人图像的特征;然后,将行人图像的特征图水平切分为6个片段,以提取行人的每个片段的局部特征和其他片段的特征之间的关系,以及行人的核心特征和平均特征之间的关系;最后,在设计损失函数时,引入异质中心三元组损失(HC Loss)函数放松普通三元组损失函数的严格约束,从而使不同模态的图像特征可以更好地映射到同一特征空间中。在公开数据集SYSU-MM01(SunYat-Sen University MultiModal re-identification)和RegDB(Dongguk Body-based person Recognition)上的实验结果表明,虽然IVRNBDS的计算量略高于当前主流的跨模态行人重识别算法,但所提网络在相似度排名第1(Rank-1)指标和平均精度均值(mAP)指标上都有所提高,提高了跨模态行人重识别算法的识别精确率。

关键词: 行人重识别, 可见光-红外跨模态, 双流结构, 异质中心三元组损失, 局部特征

Abstract:

In visible-infrared cross-modal person re-identification, the modal differences will lead to low identification accuracy. Therefore, a dual-stream structure based cross-modal person re-identification relation network, named IVRNBDS (Infrared and Visible Relation Network Based on Dual-stream Structure), was proposed. Firstly, the dual-stream structure was used to extract the features of the visible light modal and the infrared modal person images respectively. Then, the feature map of the person image was divided into six segments horizontally to extract relationships between the local features of each segment and the features of other segments of the person and the relationship between the core features and average features of the person. Finally, when designing loss function, the Hetero-Center triplet Loss (HC Loss) function was introduced to relax the strict constraints of the ordinary triplet loss function, so that image features of different modals were able to be better mapped into the same feature space. Experimental results on public datasets SYSU-MM01 (SunYat-Sen University MultiModal re-identification) and RegDB (Dongguk Body-based person Recognition) show that the computational cost of IVRNBDS is slightly higher than those of the mainstream cross-modal person re-identification algorithms, but the proposed network has the Rank-1 (similarity Rank 1) and mAP (mean Average Precision) improved compared to the mainstream algorithms, increasing the recognition accuracy of the cross-modal people re-identification algorithm.

Key words: person re-identification, visible-infrared cross-modal, dual-stream structure, hetero-center triplet loss, local feature

中图分类号: