基于挤压激励的轻量化注意力机制模块

doi:10.11772/j.issn.1001-9081.2021061037

《计算机应用》唯一官方网站 ›› 2022, Vol. 42 ›› Issue (8): 2353-2360.DOI: 10.11772/j.issn.1001-9081.2021061037

基于挤压激励的轻量化注意力机制模块

吕振虎¹, 许新征¹^,²(), 张芳艳³

^1.中国矿业大学计算机科学与技术学院, 江苏徐州 221116
^2.光电技术与智能控制教育部重点实验室(兰州交通大学), 兰州 730070
^3.宁夏大学智能工程与技术学院, 宁夏中卫 755000

收稿日期:2021-06-21 修回日期:2021-09-04 接受日期:2021-09-14 发布日期:2021-10-18 出版日期:2022-08-10
通讯作者: 许新征
作者简介:吕振虎（1995—），男，山东枣庄人，硕士研究生，主要研究方向：机器学习、计算机视觉；
许新征（1980—），男，安徽宿州人，教授，博士，CCF高级会员，主要研究方向：机器学习、模式识别；
张芳艳（1990—），女，甘肃宁县人，硕士，主要研究方向：图像处理、模式识别。
基金资助:
国家自然科学基金资助项目(61976217);光电技术与智能控制教育部重点实验室（兰州交通大学）开放课题(KFKT2020-03)

Lightweight attention mechanism module based on squeeze and excitation

Zhenhu LYU¹, Xinzheng XU¹^,²(), Fangyan ZHANG³

^1.School of Computer Science and Technology，China University of Mining and Technology，Xuzhou Jiangsu 221116，China
^2.Key Laboratory of Opt-Electronic Technology and Intelligent Control of Ministry of Education （Lanzhou Jiaotong University），Lanzhou Gansu 730070，China
^3.School of Intelligent Engineering and Technology，Ningxia University，Zhongwei Ningxia 755000，China

Received:2021-06-21 Revised:2021-09-04 Accepted:2021-09-14 Online:2021-10-18 Published:2022-08-10
Contact: Xinzheng XU
About author:LYU Zhenhu， born in 1995， M. S. candidate. His research interests include machine learning， computer vision.
XU Xinzheng， born in 1980， Ph. D.， professor. His research interests include machine learning， pattern recognition.
ZHANG Fangyan， born in 1990， M. S. Her research interests include image processing， pattern recognition.
Supported by:
National Natural Science Foundation of China(61976217);Opening Project of Key Laboratory of Opt-Electronic Technology and Intelligent Control of Ministry of Education （Lanzhou Jiaotong University）(KFKT2020-03)

摘要/Abstract

摘要：

针对向卷积神经网络（CNN）中嵌入注意力机制模块以提高模型应用精度导致参数和计算量增加的问题，提出基于挤压激励的轻量化高度维度挤压激励（HD-SE）模块和宽度维度挤压激励（WD-SE）模块。为了充分利用特征图中潜在的信息，HD-SE对卷积层输出的特征图在高度维度上进行挤压激励操作，获得高度维度上的权重信息；而WD-SE在宽度维度上进行挤压激励操作，以得到特征图宽度维度上的权重信息；然后，将得到的权重信息分别应用于对应维度的特征图张量，以提高模型的应用精度。将HD-SE与WD-SE分别嵌入VGG16、ResNet56、MobileNetV1和MobileNetV2模型中，在CIFAR10和CIFAR100数据集上进行的实验结果表明，与挤压激励（SE）模块、协调注意力（CA）模块、卷积块注意力模块（CBAM）和高效通道注意力（ECA）模块等先进的注意力机制模块相比，HD-SE与WD-SE在向网络模型中增加的参数和计算量更少的同时得到的精度相似或者更高。

关键词: 卷积神经网络, 挤压激励, 轻量化, 多维度, 注意力机制模块

Abstract:

Focusing on the issue that embedding the attention mechanism module into Convolutional Neural Network （CNN） to improve the application accuracy will increase the parameters and the computational cost， the lightweight Height Dimensional Squeeze and Excitation （HD-SE） module and Width Dimensional Squeeze and Excitation （WD-SE） module based on squeeze and excitation were proposed. To make full use of the potential information in the feature maps， two kinds of height and width dimensional weight information of feature maps was respectively extracted by HD-SE and WD-SE through squeeze and excitation operations， then the obtained weight information was respectively applied to corresponding tensors of the feature maps of two dimensions to improve the application accuracy of the model. Experiments were implemented on CIFAR10 and CIFAR100 datasets after embedding HD-SE and WD-SE into Visual Geometry Group 16 （VGG16）， Residual Network 56 （ResNet56）， MobileNetV1 and MobileNetV2 models respectively. Experimental results show fewer parameters and computational cost added by HD-SE and WD-SE to the network models when the models achieve the same or even better accuracy， compared with the state-of-the-art attention mechanism modules， such as Squeeze and Excitation （SE） module， Coordinate Attention （CA） block， Convolutional Block Attention Module （CBAM） and Efficient Channel Attention （ECA） module.

Key words: Convolutional Neural Network (CNN), squeeze and excitation, lightweight, multi-dimension, attention mechanism module

中图分类号:

TP181

吕振虎, 许新征, 张芳艳. 基于挤压激励的轻量化注意力机制模块[J]. 计算机应用, 2022, 42(8): 2353-2360.

Zhenhu LYU, Xinzheng XU, Fangyan ZHANG. Lightweight attention mechanism module based on squeeze and excitation[J]. Journal of Computer Applications, 2022, 42(8): 2353-2360.

图/表 12

参考文献 19

1	KRIZHEVSKY A， SUTSKEVER I， HINTON G E. ImageNet classification with deep convolutional neural networks［J］. Communications of the ACM， 2017， 60（6）： 84-90. 10.1145/3065386
2	HAN K， GUO J Y， ZHANG C， et al. Attribute-aware attention model for fine-grained representation learning ［C］// Proceedings of the 26th ACM International Conference on Multimedia. New York： ACM， 2018： 2040-2048. 10.1145/3240508.3240550
3	REN S Q， HE K M， GIRSHICK R， et al. Faster R-CNN： towards real-time object detection with region proposal networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2017， 39（6）： 1137-1149. 10.1109/tpami.2016.2577031
4	张顺，龚怡宏，王进军.深度卷积神经网络的发展及其在计算机视觉领域的应用［J］.计算机学报， 2019， 42（3）： 453-482. 10.11897/SP.J.1016.2019.00453
	ZHANG S， GONG Y H， WANG J J. The development of deep convolution neural networks and its applications on computer vision［J］. Chinese Journal of Computers， 2019， 42（3）： 453-482. 10.11897/SP.J.1016.2019.00453
5	CHEN L C， PAPANDREOU G， KOKKINOS I， et al. DeepLab： semantic image segmentation with deep convolutional nets， atrous convolution， and fully connected CRFs［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2018， 40（4）： 834-848. 10.1109/tpami.2017.2699184
6	HU J， SHEN L， SUN G. Squeeze-and-excitation networks ［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018： 7132-7141. 10.1109/cvpr.2018.00745
7	PARK J， WOO S， LEE J Y， et al. BAM： bottleneck attention module ［C］// Proceedings of the 2018 British Machine Vision Conference. Durham： BMVA Press， 2018： No.92.
8	WOO S， PARK J， LEE J Y， et al. CBAM： convolutional block attention module ［C］// Proceedings of the 2018 European Conference on Computer Vision， LNCS 11211. Cham： Springer， 2018： 3-19.
9	WANG Q L， WU B G， ZHU P F， et al. ECA-Net： efficient channel attention for deep convolutional neural networks ［C］// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2020： 13708-13717. 10.1109/cvpr42600.2020.01155
10	HOU Q B， ZHOU D Q， FENG J S. Coordinate attention for efficient mobile network design ［C］// Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2021： 11531-11539. 10.1109/cvpr46437.2021.01350
11	MEHTA S， HAJISHIRZI H， RASTEGARI M. DiCENet： dimension-wise convolutions for efficient networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2022， 44（5）： 2416-2425.
12	SIMONYAN K， ZISSERMAN A. Very deep convolutional networks for large-scale image recognition［EB/OL］. （2015-04-10）［2021-04-19］. .
13	HE K M， ZHANG X Y， REN S Q， et al. Deep residual learning for image recognition ［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2016： 770-778. 10.1109/cvpr.2016.90
14	HOWARD A G， ZHU M L， CHEN B， et al. MobileNets： efficient convolutional neural networks for mobile vision applications［EB/OL］. （2017-04-17）［2021-06-20］. .
15	SANDLER M， HOWARD A， ZHU M L， et al. MobileNetV2： inverted residuals and linear bottlenecks ［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018： 4510-4520. 10.1109/cvpr.2018.00474
16	PASZKE A， GROSS S， CHINTALA S， et al. Automatic differentiation in PyTorch［EB/OL］. ［2021-06-28］. .
17	KRIZHEVSKY A. Learning multiple layers of features from tiny images［R/OL］. （2009-04-08）［2021-02-19］. .
18	MENG F X， CHENG H， LI K， et al. Pruning filter in filter［C/OL］// Proceedings of the 34th Conference on Neural Information Processing Systems. ［2021-02-20］. . 10.1109/cvpr42600.2020.00663
19	KUANG L. PyTorch-CIFAR［CP/OL］. ［2021-06-20］. .

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
VGG16	93.25	72.57	14.73	14.77	313.33	313.33
VGG16+SE	93.80	73.37	15.63	15.68	314.24	314.24
VGG16+CA	93.86	73.51	14.91	14.96	314.41	314.41
VGG16+CBAM	93.71	72.61	14.96	15.01	313.61	313.61
VGG16+ECA	93.65	71.51	14.73	14.77	313.33	313.33
VGG16+HD-SE	93.97	73.83	14.73	14.78	313.34	313.34
VGG16+WD-SE	93.98	74.14	14.73	14.78	313.34	313.34

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
VGG16	93.25	72.57	14.73	14.77	313.33	313.33
VGG16+SE	93.80	73.37	15.63	15.68	314.24	314.24
VGG16+CA	93.86	73.51	14.91	14.96	314.41	314.41
VGG16+CBAM	93.71	72.61	14.96	15.01	313.61	313.61
VGG16+ECA	93.65	71.51	14.73	14.77	313.33	313.33
VGG16+HD-SE	93.97	73.83	14.73	14.78	313.34	313.34
VGG16+WD-SE	93.98	74.14	14.73	14.78	313.34	313.34

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
ResNet56	93.10	71.43	0.85	0.86	125.49	125.49
ResNet56+SE	93.67	72.26	0.88	0.89	125.50	125.50
ResNet56+CA	94.06	72.22	0.88	0.89	125.95	125.95
ResNet56+CBAM	93.90	72.25	0.86	0.87	126.67	126.67
ResNet56+ECA	93.84	72.21	0.85	0.86	125.49	125.49
ResNet56+HD-SE	93.76	72.39	0.86	0.86	125.49	125.49
ResNet56+WD-SE	93.84	72.53	0.86	0.86	125.49	125.49

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
ResNet56	93.10	71.43	0.85	0.86	125.49	125.49
ResNet56+SE	93.67	72.26	0.88	0.89	125.50	125.50
ResNet56+CA	94.06	72.22	0.88	0.89	125.95	125.95
ResNet56+CBAM	93.90	72.25	0.86	0.87	126.67	126.67
ResNet56+ECA	93.84	72.21	0.85	0.86	125.49	125.49
ResNet56+HD-SE	93.76	72.39	0.86	0.86	125.49	125.49
ResNet56+WD-SE	93.84	72.53	0.86	0.86	125.49	125.49

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
MobileNetV1	91.24	67.89	3.22	3.31	46.34	46.34
MobileNetV1+SE	91.88	69.16	5.14	5.23	48.26	48.26
MobileNetV1+CA	91.92	69.52	3.59	3.69	48.01	48.01
MobileNetV1+CBAM	91.73	68.55	3.70	3.80	46.52	46.52
MobileNetV1+ECA	91.54	68.03	3.22	3.31	46.34	46.34
MobileNetV1+HD-SE	92.19	69.99	3.22	3.31	46.35	46.35
MobileNetV1+WD-SE	91.92	69.84	3.22	3.31	46.35	46.35

基于挤压激励的轻量化注意力机制模块

Lightweight attention mechanism module based on squeeze and excitation

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 12

参考文献 19

相关文章 15

编辑推荐

Metrics

[1]	徐成霞, 阎庆, 李腾, 苗开超. 基于联合注意力机制的单幅图像去雨算法[J]. 《计算机应用》唯一官方网站, 2022, 42(8): 2578-2585.
[2]	张显杰, 张之明. 基于卷积神经网络和Transformer的手写体英文文本识别[J]. 《计算机应用》唯一官方网站, 2022, 42(8): 2394-2400.
[3]	程南江, 余贞侠, 陈琳, 乔贺辙. 基于领域自适应的多源多标签行人属性识别[J]. 《计算机应用》唯一官方网站, 2022, 42(8): 2401-2406.
[4]	邓杰航, 郭文权, 陈汉杰, 顾国生, 刘景建, 杜宇坤, 刘超, 康晓东, 赵建. 融合多尺度多头自注意力和在线难例挖掘的小样本硅藻检测[J]. 《计算机应用》唯一官方网站, 2022, 42(8): 2593-2600.
[5]	靳华中, 张修洋, 叶志伟, 张闻其, 夏小鱼. 基于近似U型网络结构的图像去噪模型[J]. 《计算机应用》唯一官方网站, 2022, 42(8): 2571-2577.
[6]	谭湘粤, 胡晓, 杨佳信, 向俊将. 基于递进式特征增强聚合的伪装目标检测[J]. 《计算机应用》唯一官方网站, 2022, 42(7): 2192-2200.
[7]	刘万军, 王佳铭, 曲海成, 董利兵, 曹欣宇. 基于频谱空间域特征注意的音乐流派分类算法[J]. 《计算机应用》唯一官方网站, 2022, 42(7): 2072-2077.
[8]	钟志峰, 夏一帆, 周冬平, 晏阳天. 基于改进YOLOv4的轻量化目标检测算法[J]. 《计算机应用》唯一官方网站, 2022, 42(7): 2201-2209.
[9]	王海起, 王志海, 李留珂, 孔浩然, 王琼, 徐建波. 基于网格划分的城市短时交通流量时空预测模型[J]. 《计算机应用》唯一官方网站, 2022, 42(7): 2274-2280.
[10]	董宁, 程晓荣, 张铭泉. 基于物联网平台的动态权重损失函数入侵检测系统[J]. 《计算机应用》唯一官方网站, 2022, 42(7): 2118-2124.
[11]	陈荣源, 姚剑敏, 严群, 林志贤. 基于深度神经网络的视频播放速度识别[J]. 《计算机应用》唯一官方网站, 2022, 42(7): 2043-2051.
[12]	王震宇, 张雷, 高文彬, 权威铭. 基于渐进式神经网络架构搜索的人体运动识别[J]. 《计算机应用》唯一官方网站, 2022, 42(7): 2058-2064.
[13]	苏珊, 张杨, 张冬雯. 基于深度学习的耦合度相关代码坏味检测方法[J]. 《计算机应用》唯一官方网站, 2022, 42(6): 1702-1707.
[14]	杨磊, 赵红东, 于快快. 基于多头注意力机制的端到端语音情感识别[J]. 《计算机应用》唯一官方网站, 2022, 42(6): 1869-1875.
[15]	廖光锴, 张正, 宋治国. 基于小波特征与注意力机制结合的卷积网络车辆重识别[J]. 《计算机应用》唯一官方网站, 2022, 42(6): 1876-1883.

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
MobileNetV2	93.33	74.83	2.30	2.41	91.14	91.14
MobileNetV2+SE	93.41	75.65	4.55	4.67	93.40	93.40
MobileNetV2+CA	93.61	75.05	2.74	2.86	94.60	94.60
MobileNetV2+CBAM	93.52	75.34	2.87	2.99	91.57	91.57
MobileNetV2+ECA	93.72	75.20	2.30	2.41	91.14	91.14
MobileNetV2+HD-SE	93.54	75.01	2.30	2.41	91.14	91.14
MobileNetV2+WD-SE	93.43	75.15	2.30	2.41	91.14	91.14

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
MobileNetV2	93.33	74.83	2.30	2.41	91.14	91.14
MobileNetV2+SE	93.41	75.65	4.55	4.67	93.40	93.40
MobileNetV2+CA	93.61	75.05	2.74	2.86	94.60	94.60
MobileNetV2+CBAM	93.52	75.34	2.87	2.99	91.57	91.57
MobileNetV2+ECA	93.72	75.20	2.30	2.41	91.14	91.14
MobileNetV2+HD-SE	93.54	75.01	2.30	2.41	91.14	91.14
MobileNetV2+WD-SE	93.43	75.15	2.30	2.41	91.14	91.14