《计算机应用》唯一官方网站 ›› 2022, Vol. 42 ›› Issue (6): 1876-1883.DOI: 10.11772/j.issn.1001-9081.2021040545

• 人工智能 • 上一篇    

基于小波特征与注意力机制结合的卷积网络车辆重识别

廖光锴1, 张正1, 宋治国2()   

  1. 1.吉首大学 信息科学与工程学院,湖南 吉首 416000
    2.吉首大学 物理与机电工程学院,湖南 吉首 416000
  • 收稿日期:2021-04-12 修回日期:2021-07-09 接受日期:2021-07-09 发布日期:2022-06-22 出版日期:2022-06-10
  • 通讯作者: 宋治国
  • 作者简介:廖光锴(1993—),男,四川内江人,硕士研究生,主要研究方向:车辆重识别、图像检索
    张正(1981—),男,湖南吉首人,副教授,博士,主要研究方向:矩阵计算
  • 基金资助:
    国家自然科学基金资助项目(32060238)

Convolutional network-based vehicle re-identification combining wavelet features and attention mechanism

Guangkai LIAO1, Zheng ZHANG1, Zhiguo SONG2()   

  1. 1.College of Information Science and Engineering,Jishou University,Jishou Hunan 416000,China
    2.College of Physics and Mechanical and Electrical Engineering,Jishou University,Jishou Hunan 416000,China
  • Received:2021-04-12 Revised:2021-07-09 Accepted:2021-07-09 Online:2022-06-22 Published:2022-06-10
  • Contact: Zhiguo SONG
  • About author:LIAO Guangkai,born in 1993,M. S. candidate. His research interests include vehicle re-identification,image retrieval.
    ZHANG Zheng,born in 1981,Ph. D.,associate professor. His research interests include matrix computation
  • Supported by:
    National Natural Science Foundation(32060238)

摘要:

针对现有的基于卷积神经网络(CNN)的车辆重识别方法所提取的特征表达力不足的问题,提出一种基于小波特征与注意力机制相结合的车辆重识别方法。首先,将单层小波模块嵌入到卷积模块中代替池化层进行下采样,减少细粒度特征的丢失;其次,结合通道注意力(CA)机制和像素注意力(PA)机制提出一种新的局部注意力模块——特征提取模块(FEM)嵌入到卷积网络中,对关键信息进行加权强化。在VeRi数据集上与基准残差网络ResNet-50、ResNet-101进行对比。实验结果表明,在ResNet-50中增加小波变换层数能提高平均精度均值(mAP);在消融实验中,虽然ResNet-50+离散小波变换(DWT)比ResNet-101的mAP降低了0.25个百分点,但是其参数量和计算复杂度都比ResNet-101低,且mAP、Rank-1和Rank-5均比单独的ResNet-50高,说明该模型在车辆重识别中能够有效提高车辆检索精度。

关键词: 车辆重识别, 通道注意力, 像素注意力, 小波变换, 卷积神经网络

Abstract:

Aiming at the problem of insufficient representation ability of features extracted by the existing vehicle re-identification methods based on convolution Neural Network (CNN), a vehicle re-identification method based on the combination of wavelet features and attention mechanism was proposed. Firstly, the single-layer wavelet module was embedded in the convolution module to replace the pooling layer for subsampling, thereby reducing the loss of fine-grained features. Secondly, a new local attention module named Feature Extraction Module (FEM) was put forward by combining Channel Attention (CA) mechanism and Pixel Attention (PA) mechanism, which was embedded into CNN to weight and strengthen the key information. Comparison experiments with the benchmark residual convolutional network ResNet-50 and ResNet-101 were conducted on VeRi dataset. Experimental results show that increasing the number of wavelet decomposition layers in ResNet-50 can improve mean Average Precision (mAP). In the ablation experiment, although ResNet-50+Discrete Wavelet Transform (DWT) has the mAP reduced by 0.25 percentage points compared with ResNet-101, it has the number of parameters and computational complexity lower than those of ResNet-101, and has the mAP, Rank-1 and Rank-5 higher than those of ResNet-50 without DWT, verifying that the proposed model can effectively improve the accuracy of vehicle retrieval in vehicle re-identification.

Key words: vehicle re-identification, Channel Attention (CA), Pixel Attention (PA), wavelet transform, Convolutional Neural Network (CNN)

中图分类号: