基于注意力机制的行人重识别特征提取方法

doi:10.11772/j.issn.1001-9081.2019081356

计算机应用 ›› 2020, Vol. 40 ›› Issue (3): 672-676.DOI: 10.11772/j.issn.1001-9081.2019081356

基于注意力机制的行人重识别特征提取方法

刘紫燕, 万培佩

贵州大学大数据与信息工程学院, 贵阳 550025

收稿日期:2019-08-05 修回日期:2019-10-13 发布日期:2019-10-31 出版日期:2020-03-10
通讯作者: 刘紫燕
作者简介:刘紫燕(1974-),女,贵州都匀人,副教授,硕士,CCF会员,主要研究方向:移动机器人、深度学习、大数据分析、无线通信系统;万培佩(1994-),男,湖北安陆人,硕士研究生,主要研究方向:深度学习、行人重识别。
基金资助:
国家自然科学基金资助项目（61863006）；贵州省联合资金资助项目（黔科合LH字［2017］7226）；贵州省科学技术基金资助项目（黔科合基础［2016］1054）；贵州省科技计划重点项目（20191416）；贵州大学2017年度学术新苗培养及创新探索专项（黔科合平台人才［2017］5788）。

Pedestrian re-identification feature extraction method based on attention mechanism

LIU Ziyan, WAN Peipei

College of Big Data and Information Engineering, Guizhou University, Guiyang Guizhou 550025, China

Received:2019-08-05 Revised:2019-10-13 Online:2019-10-31 Published:2020-03-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61863006), the Joint Foundation of Guizhou Province (LH-[2017]7226), the Science and Technology Foundation of Guizhou Province ([2016]1054), the Guizhou Provincial Science and Technology Plan Key Project (20191416), the Academic Talent Training and Innovation Exploration Special Project of Guizhou University in 2017 ([2017]5788).

摘要/Abstract

摘要： 针对真实环境中非重叠多摄像头的行人重识别受到不同摄像机场景、视角、光照等因素的影响导致行人重识别精度低的问题，提出一种基于注意力机制的行人重识别特征提取方法。首先，使用随机擦除法对输入的行人图像进行数据增强，提高网络的鲁棒性；然后，通过构建自上而下的注意力机制网络增强空间像素特征的显著性，并将注意力机制网络嵌入ResNet50网络提取整个行人的显著特征；最后，将整个行人的显著特征进行相似性度量并排序得到行人重识别的结果。该注意力机制的行人重识别特征提取方法在Market1501数据集上Rank1达到88.53%，平均精度均值（mAP）为70.70%；在DukeMTMC-reID数据集上Rank1达到77.33%，mAP为59.47%。所提方法在两大行人重识别数据集上性能都有明显提升，具有一定的应用价值。

关键词: 行人重识别, 特征学习, 注意力机制, 数据增强, 显著特征

Abstract: Aiming at the problem of the low pedestrian re-identification accuracy with disjoint multiple cameras in real environment caused by different camera scenes, perspectives, illuminations and other factors, a pedestrian re-identification feature extraction method based on attention mechanism was proposed. Firstly, the random erasure method was used to enhance the data of the input pedestrian image in order to improve the robustness of the network. Then, by constructing a from-top-to-bottom attention mechanism network, the saliency of the spatial pixel feature was enhanced, and the attention mechanism network was embedded in the ResNet50 network to extract the entire pedestrian salient features. Finally, the similarity measurement and ranking were performed on the entire salient features of pedestrians in order to obtain the accuracy of pedestrian re-identification. The pedestrian re-identification feature extraction method based on attention mechanism has Rank1 of 88.53% and mAP （mean Average Precision） of 70.70% on the Market1501 dataset, and has Rank1 of 77.33% and mAP of 59.47% on the DukeMTMC-reID dataset. The proposed method has significantly improved performance on the two major pedestrian re-identification datasets, and has certain application value.

Key words: pedestrian re-identification, feature learning, attention mechanism, data enhancement, salient feature

中图分类号:

TP391.41

刘紫燕, 万培佩. 基于注意力机制的行人重识别特征提取方法[J]. 计算机应用, 2020, 40(3): 672-676.

LIU Ziyan, WAN Peipei. Pedestrian re-identification feature extraction method based on attention mechanism[J]. Journal of Computer Applications, 2020, 40(3): 672-676.

参考文献

[1] 张耿宁, 王家宝, 张亚非, 等. 基于特征融合的行人重识别方法[J]. 计算机工程与应用,2017,53(12):185-189,240. (ZHANG G N,WANG J B,ZHANG Y F, et al. Person re-identification method based on feature fusion[J]. Computer Engineering and Applications,2017,53(12):185-189,240.)
[2] 朱小波, 车进. 基于特征融合与子空间学习的行人重识别算法[J]. 激光与光电子学进展,2019,56(2):156-162. (ZHU X B, CHE J. Person re-identification algorithm based on feature fusion and subspace learning[J]. Laser and Optoelectronics Progress, 2019,56(2):156-162.)
[3] 唐松. 基于显著特征的行人重识别方法研究[D]. 南京:南京邮电大学,2017. (TANG S. Person re-identification based on saliency[D]. Nanjing:Nanjing University of Posts and Telecommunications,2017.)
[4] 陈兵, 查宇飞, 李运强, 等. 基于卷积神经网络判别特征学习的行人重识别[J]. 光学学报,2018,38(7):255-261.(CHEN B, ZHA Y F,LI Y Q,et al. Person re-identification based on convolutional neural network discriminative feature learning[J]. Acta Optica Sinica,2018,38(7):255-261.)
[5] AHMED E,JONES M,MARKS T K. An improved deep learning architecture for person re-identification[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:3908-3916.
[6] HE L,LIANG J,LI H,et al. Deep spatial feature reconstruction for partial person re-identification:alignment-free approach[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:7073-3082.
[7] SUN Y,ZHENG L,YANG Y,et al. Beyond part models:person retrieval with refined part pooling[C]//Proceedings of the 2018 Proceedings of the European Conference on Computer Vision,LNCS 1120. Cham:Springer,2018:501-518.
[8] LI W,ZHU X,GONG S. Harmonious attention network for person re-identification[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:2285-2294.
[9] XU J,ZHAO R,ZHU F,et al. Attention-aware compositional network for person re-identification[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:2119-2128.
[10] 陈首兵, 王洪元, 金翠, 等. 基于孪生网络和重排序的行人重识别[J]. 计算机应用,2018,38(11):3161-3166.(CHEN S B, WANG H Y,JIN C,et al. Person re-identification based on siamese network and reordering[J]. Journal of Computer Applications,2018,38(11):3161-3166.)
[11] ZHENG L,SHEN L,TIAN L,et al. Scalable person re-identification:a benchmark[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision. Piscataway:IEEE,2015:1116-1124.
[12] ZHENG Z,ZHENG L,YANG Y. Unlabeled samples generated by GAN improve the person re-identification baseline in vitro[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway:IEEE,2017:3774-3782.
[13] LIAO S,HU Y,ZHU X,et al. Person re-identification by local maximal occurrence representation and metric learning[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:2197-2206.
[14] WANG J,ZHU X,GONG S,et al. Transferable joint attributeidentity deep learning for unsupervised person re-identification[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2018:2275-2284.
[15] SUN Y,ZHENG L,DENG W,et al. SVDNet for pedestrian retrieval[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway:IEEE,2017:3820-3828.
[16] CHEN W,CHEN X,ZHANG J,et al. Beyond triplet loss:a deep quadruplet network for person re-identification[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:1320-1329.
[17] ZHONG Z,ZHENG L,ZHENG Z,et al. Camera style adaptation for person re-identification[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:5157-5166.
[18] ZHONG Z,ZHENG L,CAO D,et al. Re-ranking person re-identification with k-reciprocal encoding[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:3652-3661.

基于注意力机制的行人重识别特征提取方法

Pedestrian re-identification feature extraction method based on attention mechanism

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	秦璟, 秦志光, 李发礼, 彭悦恒. 基于概率稀疏自注意力神经网络的重性抑郁疾患诊断[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2970-2974.
[2]	李力铤, 华蓓, 贺若舟, 徐况. 基于解耦注意力机制的多变量时序预测模型[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2732-2738.
[3]	贾洁茹, 杨建超, 张硕蕊, 闫涛, 陈斌. 基于自蒸馏视觉Transformer的无监督行人重识别[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2893-2902.
[4]	赵志强, 马培红, 黑新宏. 基于双重注意力机制的人群计数方法[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2886-2892.
[5]	薛凯鹏, 徐涛, 廖春节. 融合自监督和多层交叉注意力的多模态情感分析网络[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2387-2392.
[6]	汪雨晴, 朱广丽, 段文杰, 李书羽, 周若彤. 基于交互注意力机制的心理咨询文本情感分类模型[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2393-2399.
[7]	高鹏淇, 黄鹤鸣, 樊永红. 融合坐标与多头注意力机制的交互语音情感识别[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2400-2406.
[8]	王翠, 邓淼磊, 张德贤, 李磊, 杨晓艳. 基于图像的端到端行人搜索算法综述[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2544-2550.
[9]	李钟华, 白云起, 王雪津, 黄雷雷, 林初俊, 廖诗宇. 基于图像增强的低照度人脸检测[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2588-2594.
[10]	莫尚斌, 王文君, 董凌, 高盛祥, 余正涛. 基于多路信息聚合协同解码的单通道语音增强[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2611-2617.
[11]	杨莹, 郝晓燕, 于丹, 马垚, 陈永乐. 面向图神经网络模型提取攻击的图数据生成方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2483-2492.
[12]	熊武, 曹从军, 宋雪芳, 邵云龙, 王旭升. 基于多尺度混合域注意力机制的笔迹鉴别方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2225-2232.
[13]	李欢欢, 黄添强, 丁雪梅, 罗海峰, 黄丽清. 基于多尺度时空图卷积网络的交通出行需求预测[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2065-2072.
[14]	毛典辉, 李学博, 刘峻岭, 张登辉, 颜文婧. 基于并行异构图和序列注意力机制的中文实体关系抽取模型[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2018-2025.
[15]	刘丽, 侯海金, 王安红, 张涛. 基于多尺度注意力的生成式信息隐藏算法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2102-2109.