基于孪生网络和双向最大边界排序损失的行人再识别

doi:10.11772/j.issn.1001-9081.2018091889

计算机应用 ›› 2019, Vol. 39 ›› Issue (4): 977-983.DOI: 10.11772/j.issn.1001-9081.2018091889

基于孪生网络和双向最大边界排序损失的行人再识别

祁子梁¹, 曲寒冰^1,2, 赵传虎¹, 董良², 李博昭², 王长生²

1. 河北工业大学人工智能与数据科学学院, 天津 300401;
2. 北京市科学技术研究院北京市新技术应用研究所, 北京 100035

收稿日期:2018-09-10 修回日期:2018-10-29 发布日期:2019-04-10 出版日期:2019-04-10
通讯作者: 曲寒冰
作者简介:祁子梁(1993-),男,河北邯郸人,硕士研究生,主要研究方向:计算机视觉、行人再识别;曲寒冰(1977-),男,黑龙江哈尔滨人,副研究员,博士,CCF会员,主要研究方向:机器学习、计算机视觉、生物识别、图像处理;赵传虎(1993-),男,河南平顶山人,硕士,主要研究方向:机器学习、数据挖掘;董良(1990-),男,河北邢台人,硕士,主要研究方向:数据挖掘、知识发现、机器学习、时空模式、社会网络;李博昭(1993-),女,河北邢台人,硕士,主要研究方向:机器学习、图像处理、模式识别;王长生(1989-),男,山东潍坊人,硕士研究生,主要研究方向:数据挖掘、机器学习。
基金资助:
国家重点研发计划项目（2018YFC08097000，2018YFC0704800，2018YFF0301000）；国家自然科学基金资助项目（91746207）；北京市科学技术研究院萌芽计划项目（GS201817）。

Person re-identification based on Siamese network and bidirectional max margin ranking loss

QI Ziliang¹, QU Hanbing^1,2, ZHAO Chuanhu¹, DONG Liang², LI Bozhao², WANG changsheng²

1. School of Artificial Intelligence, Hebei University of Technology, Tianjin 300401, China;
2. Beijing Institute of New Technology Applications, Beijing Academy of Science and Technology, Beijing 100035, China

Received:2018-09-10 Revised:2018-10-29 Online:2019-04-10 Published:2019-04-10
Supported by:
This work is partially supported by the National Key R&D Program of China (2018YFC08097000, 2018YFC0704800, 2018YFF0301000), the National Natural Science Foundation of China (91746207), the Beijing Academy of Science and Technology Budding Plan (GS201817).

摘要/Abstract

摘要： 针对在实际场景中存在的不同行人图像之间比相同行人图像之间更相似所造成的行人再识别准确率较低的问题，提出一种基于孪生网络并结合识别损失和双向最大边界排序损失的行人再识别方法。首先，对在超大数据集上预训练过的神经网络模型进行结构改造，主要是对最后的全连接层进行改造，使模型可以在行人再识别数据集上进行识别判断；其次，联合识别损失和排序损失监督网络在训练集上的训练，并通过正样本对的相似度值减去负样本对的相似度值大于预定阈值这一判定条件，来使得负例图像对之间的距离大于正例图像对之间的距离；最后，使用训练好的神经网络模型在测试集上测试，提取特征并比对特征之间的余弦相似度。在公开数据集Market-1501、CUHK03和DukeMTMC-reID上进行的实验结果表明，所提方法分别取得了89.4%、86.7%、77.2%的rank-1识别率，高于其他典型的行人再识别方法，并且该方法在基准网络结构下最高达到了10.04%的rank-1识别率提升。

关键词: 行人再识别, 孪生网络, 双向最大边界, 排序损失, 卷积神经网络

Abstract: Focusing on the low accuracy of person re-identification caused by that the similarity between different pedestrians' images is more than that between the same pedestrians' images in reality, a person re-identification method based on Siamese network combined with identification loss and bidirectional max margin ranking loss was proposed. Firstly, a neural network model which was pre-trained on a huge dataset, especially its final full-connected layer was structurally modified so that it can output correct results on the person re-identification dataset. Secondly, training of the network on the training set was supervised by the combination of identification loss and ranking loss. And according to that the difference between the similarity of the positive and negative sample pairs is greater than the predetermined value, the distance between negative sample pair was made to be larger than that of positive sample pair. Finally, a trained neural network model was used to test on the test set, extracting features and comparing the cosine similarity between the features. Experimental result on the open datasets Market-1501, CUHK03 and DukeMTMC-reID show that rank-1 recognition rates of the proposed method reach 89.4%, 86.7%, and 77.2% respectively, which are higher than those of other classical methods. Moreover, the proposed method can achieve a rank-1 rate improvement of up to 10.04% under baseline network structure.

Key words: person re-identification, Siamese network, bidirectional max margin, ranking loss, Convolutional Neural Network (CNN)

中图分类号:

TP391.4

祁子梁, 曲寒冰, 赵传虎, 董良, 李博昭, 王长生. 基于孪生网络和双向最大边界排序损失的行人再识别[J]. 计算机应用, 2019, 39(4): 977-983.

QI Ziliang, QU Hanbing, ZHAO Chuanhu, DONG Liang, LI Bozhao, WANG changsheng. Person re-identification based on Siamese network and bidirectional max margin ranking loss[J]. Journal of Computer Applications, 2019, 39(4): 977-983.

参考文献

[1] ZHENG L, SHEN L, TIAN L, et al. Scalable person re-identification:a benchmark[C]//ICCV 2015:Proceedings of the 2015 IEEE International Conference on Computer Vision. Piscataway, NJ:IEEE, 2015:1116-1124.
[2] MATSUKAWA T, OKABE T, SUZUKI E, et al. Hierarchical Gaussian descriptor for person re-identification[C]//CVPR 2016:Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2016:1363-1372.
[3] LIAO S, HU Y, ZHU X, et al. Person re-identification by local maximal occurrence representation and metric learning[C]//CVPR 2015:Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2015:2197-2206.
[4] KOESTINGER M, HIRZER M, WOHLHART P, et al. Large scale metric learning from equivalence constraints[C]//CVPR 2012:Proceedings of the 2012 IEEE International Conference on Computer Vision. Washington, DC:IEEE Computer Society, 2012:2288-2295.
[5] WEINBERGER K Q, SAUL L K. Distance metric learning for large margin nearest neighbor classification[J]. Journal of Machine Learning Research, 2009, 10(2):207-244.
[6] ZHENG L, YANG Y, HAUPTMANN A G. Person re-identification:past, present and future[EB/OL].[2018-05-10]. https://arxiv.org/pdf/1610.02984.
[7] VARIOR R R, HALOI M, WANG G. Gated siamese convolutional neural network architecture for human re-identification[C]//ECCV 2016:Proceedings of the 2016 European Conference on Computer Vision. Berlin:Springer, 2016:791-808.
[8] 陈首兵, 王洪元, 金翠, 等. 基于孪生网络和重排序的行人重识别[J]. 计算机应用, 2018, 38(11):3161-3166. (CHEN S B, WANG H Y, JIN C, et al. Person re-identification based on siamese network and reranking[J]. Journal of Computer Applications, 2018, 38(11):3161-3166.)
[9] ZHENG Z, ZHENG L, YANG Y. A discriminatively learned CNN embedding for person reidentification[J]. ACM Transactions on Multimedia Computing, Communications, and Applications, 2017, 14(1):13.
[10] ZHENG Z, ZHENG L, YANG Y. Unlabeled samples generated by GAN improve the person re-identification baseline in vitro[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway, NJ:IEEE, 2017:3774-3782.
[11] WANG F, ZUO W, LIN L, et al. Joint learning of single-image and cross-image representations for person re-identification[C]//CVPR 2016:Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2016:1288-1296.
[12] SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[EB/OL].[2018-05-10]. https://arxiv.org/pdf/1409.1556.
[13] SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions[C]//CVPR 2015:Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2015:1-9.
[14] HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//CVPR 2016:Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2016:770-778.
[15] ZHENG Z, ZHENG L, GARRETT M, et al. Dual-path convolutional image-text embedding with instance loss[EB/OL].[2018-05-10]. https://arxiv.org/pdf/1711.05535.
[16] van der MAATEN L. Accelerating t-SNE using tree-based algorithms[J]. The Journal of Machine Learning Research, 2014, 15(1):3221-3245.
[17] LI W, ZHAO R, XIAO T, et al. DeepReid:deep filter pairing neural network for person re-identification[C]//CVPR 2014:Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2014:152-159.
[18] RISTANI E, SOLERA F, ZOU R, et al. Performance measures and a data set for multi-target, multi-camera tracking[C]//ECCV 2016:Proceedings of the 2016 European Conference on Computer Vision. Berlin:Springer, 2016:17-35.
[19] ZHANG L, XIANG T, GONG S. Learning a discriminative null space for person re-identification[C]//CVPR 2016:Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2016:1239-1248.
[20] USTINOVA E, GANIN Y, LEMPITSKY V. Multi-region bilinear convolutional neural networks for person re-identification[C]//AVSS 2017:Proceedings of the 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance. Washington, DC:IEEE Computer Society, 2017:1-6.
[21] VARIOR R R, SHUAI B, LU J, et al. A siamese long short-term memory architecture for human re-identification[C]//ECCV 2016:Proceedings of the 2016 European Conference on Computer Vision. Berlin:Springer, 2016:135-153.
[22] BARBOSA I B, CRISTANI M, CAPUTO B, et al. Looking beyond appearances:Synthetic training data for deep cnns in re-identification[J]. Computer Vision and Image Understanding, 2018, 167:50-62.
[23] WANG Y, CHEN Z, WU F, et al. Person re-identification with cascaded pairwise convolutions[C]//CVPR 2018:Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2018:1470-1478.
[24] LV J, CHEN W, LI Q, et al. Unsupervised cross-dataset person re-identification by transfer learning of spatial-temporal patterns[C]//CVPR 2018:Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC:IEEE Computer Society, 2018:7948-7956.

基于孪生网络和双向最大边界排序损失的行人再识别

Person re-identification based on Siamese network and bidirectional max margin ranking loss

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	秦璟, 秦志光, 李发礼, 彭悦恒. 基于概率稀疏自注意力神经网络的重性抑郁疾患诊断[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2970-2974.
[2]	李云, 王富铕, 井佩光, 王粟, 肖澳. 基于不确定度感知的帧关联短视频事件检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2903-2910.
[3]	张春雪, 仇丽青, 孙承爱, 荆彩霞. 基于两阶段动态兴趣识别的购买行为预测模型[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2365-2371.
[4]	赵宇博, 张丽萍, 闫盛, 侯敏, 高茂. 基于改进分段卷积神经网络和知识蒸馏的学科知识实体间关系抽取[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2421-2429.
[5]	陈虹, 齐兵, 金海波, 武聪, 张立昂. 融合1D-CNN与BiGRU的类不平衡流量异常检测[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2493-2499.
[6]	熊武, 曹从军, 宋雪芳, 邵云龙, 王旭升. 基于多尺度混合域注意力机制的笔迹鉴别方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2225-2232.
[7]	王东炜, 刘柏辰, 韩志, 王艳美, 唐延东. 基于低秩分解和向量量化的深度网络压缩方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 1987-1994.
[8]	高阳峄, 雷涛, 杜晓刚, 李岁永, 王营博, 闵重丹. 基于像素距离图和四维动态卷积网络的密集人群计数与定位方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2233-2242.
[9]	姚迅, 秦忠正, 杨捷. 生成式标签对抗的文本分类模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1781-1785.
[10]	黄梦源, 常侃, 凌铭阳, 韦新杰, 覃团发. 基于层间引导的低光照图像渐进增强算法[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1911-1919.
[11]	李健京, 李贯峰, 秦飞舟, 李卫军. 基于不确定知识图谱嵌入的多关系近似推理模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1751-1759.
[12]	沈君凤, 周星辰, 汤灿. 基于改进的提示学习方法的双通道情感分析模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1796-1806.
[13]	高文烁, 陈晓云. 基于节点结构的点云分类网络[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1471-1478.
[14]	孙子文, 钱立志, 杨传栋, 高一博, 陆庆阳, 袁广林. 基于Transformer的视觉目标跟踪方法综述[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1644-1654.
[15]	孙敏, 成倩, 丁希宁. 基于CBAM-CGRU-SVM的Android恶意软件检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1539-1545.