基于挤压激励的轻量化注意力机制模块

doi:10.11772/j.issn.1001-9081.2021061037

《计算机应用》唯一官方网站 ›› 2022, Vol. 42 ›› Issue (8): 2353-2360.DOI: 10.11772/j.issn.1001-9081.2021061037

所属专题：人工智能

基于挤压激励的轻量化注意力机制模块

吕振虎¹, 许新征¹^,²(), 张芳艳³

^1.中国矿业大学计算机科学与技术学院, 江苏徐州 221116
^2.光电技术与智能控制教育部重点实验室(兰州交通大学), 兰州 730070
^3.宁夏大学智能工程与技术学院, 宁夏中卫 755000

收稿日期:2021-06-21 修回日期:2021-09-04 接受日期:2021-09-14 发布日期:2021-10-18 出版日期:2022-08-10
通讯作者: 许新征
作者简介:吕振虎（1995—），男，山东枣庄人，硕士研究生，主要研究方向：机器学习、计算机视觉；
许新征（1980—），男，安徽宿州人，教授，博士，CCF高级会员，主要研究方向：机器学习、模式识别；
张芳艳（1990—），女，甘肃宁县人，硕士，主要研究方向：图像处理、模式识别。
基金资助:
国家自然科学基金资助项目(61976217);光电技术与智能控制教育部重点实验室（兰州交通大学）开放课题(KFKT2020-03)

Lightweight attention mechanism module based on squeeze and excitation

Zhenhu LYU¹, Xinzheng XU¹^,²(), Fangyan ZHANG³

^1.School of Computer Science and Technology，China University of Mining and Technology，Xuzhou Jiangsu 221116，China
^2.Key Laboratory of Opt-Electronic Technology and Intelligent Control of Ministry of Education （Lanzhou Jiaotong University），Lanzhou Gansu 730070，China
^3.School of Intelligent Engineering and Technology，Ningxia University，Zhongwei Ningxia 755000，China

Received:2021-06-21 Revised:2021-09-04 Accepted:2021-09-14 Online:2021-10-18 Published:2022-08-10
Contact: Xinzheng XU
About author:LYU Zhenhu， born in 1995， M. S. candidate. His research interests include machine learning， computer vision.
XU Xinzheng， born in 1980， Ph. D.， professor. His research interests include machine learning， pattern recognition.
ZHANG Fangyan， born in 1990， M. S. Her research interests include image processing， pattern recognition.
Supported by:
National Natural Science Foundation of China(61976217);Opening Project of Key Laboratory of Opt-Electronic Technology and Intelligent Control of Ministry of Education （Lanzhou Jiaotong University）(KFKT2020-03)

摘要/Abstract

摘要：

针对向卷积神经网络（CNN）中嵌入注意力机制模块以提高模型应用精度导致参数和计算量增加的问题，提出基于挤压激励的轻量化高度维度挤压激励（HD-SE）模块和宽度维度挤压激励（WD-SE）模块。为了充分利用特征图中潜在的信息，HD-SE对卷积层输出的特征图在高度维度上进行挤压激励操作，获得高度维度上的权重信息；而WD-SE在宽度维度上进行挤压激励操作，以得到特征图宽度维度上的权重信息；然后，将得到的权重信息分别应用于对应维度的特征图张量，以提高模型的应用精度。将HD-SE与WD-SE分别嵌入VGG16、ResNet56、MobileNetV1和MobileNetV2模型中，在CIFAR10和CIFAR100数据集上进行的实验结果表明，与挤压激励（SE）模块、协调注意力（CA）模块、卷积块注意力模块（CBAM）和高效通道注意力（ECA）模块等先进的注意力机制模块相比，HD-SE与WD-SE在向网络模型中增加的参数和计算量更少的同时得到的精度相似或者更高。

关键词: 卷积神经网络, 挤压激励, 轻量化, 多维度, 注意力机制模块

Abstract:

Focusing on the issue that embedding the attention mechanism module into Convolutional Neural Network （CNN） to improve the application accuracy will increase the parameters and the computational cost， the lightweight Height Dimensional Squeeze and Excitation （HD-SE） module and Width Dimensional Squeeze and Excitation （WD-SE） module based on squeeze and excitation were proposed. To make full use of the potential information in the feature maps， two kinds of height and width dimensional weight information of feature maps was respectively extracted by HD-SE and WD-SE through squeeze and excitation operations， then the obtained weight information was respectively applied to corresponding tensors of the feature maps of two dimensions to improve the application accuracy of the model. Experiments were implemented on CIFAR10 and CIFAR100 datasets after embedding HD-SE and WD-SE into Visual Geometry Group 16 （VGG16）， Residual Network 56 （ResNet56）， MobileNetV1 and MobileNetV2 models respectively. Experimental results show fewer parameters and computational cost added by HD-SE and WD-SE to the network models when the models achieve the same or even better accuracy， compared with the state-of-the-art attention mechanism modules， such as Squeeze and Excitation （SE） module， Coordinate Attention （CA） block， Convolutional Block Attention Module （CBAM） and Efficient Channel Attention （ECA） module.

Key words: Convolutional Neural Network (CNN), squeeze and excitation, lightweight, multi-dimension, attention mechanism module

中图分类号:

TP181

吕振虎, 许新征, 张芳艳. 基于挤压激励的轻量化注意力机制模块[J]. 计算机应用, 2022, 42(8): 2353-2360.

Zhenhu LYU, Xinzheng XU, Fangyan ZHANG. Lightweight attention mechanism module based on squeeze and excitation[J]. Journal of Computer Applications, 2022, 42(8): 2353-2360.

图/表 12

参考文献 19

1	KRIZHEVSKY A， SUTSKEVER I， HINTON G E. ImageNet classification with deep convolutional neural networks［J］. Communications of the ACM， 2017， 60（6）： 84-90. 10.1145/3065386
2	HAN K， GUO J Y， ZHANG C， et al. Attribute-aware attention model for fine-grained representation learning ［C］// Proceedings of the 26th ACM International Conference on Multimedia. New York： ACM， 2018： 2040-2048. 10.1145/3240508.3240550
3	REN S Q， HE K M， GIRSHICK R， et al. Faster R-CNN： towards real-time object detection with region proposal networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2017， 39（6）： 1137-1149. 10.1109/tpami.2016.2577031
4	张顺，龚怡宏，王进军.深度卷积神经网络的发展及其在计算机视觉领域的应用［J］.计算机学报， 2019， 42（3）： 453-482. 10.11897/SP.J.1016.2019.00453
	ZHANG S， GONG Y H， WANG J J. The development of deep convolution neural networks and its applications on computer vision［J］. Chinese Journal of Computers， 2019， 42（3）： 453-482. 10.11897/SP.J.1016.2019.00453
5	CHEN L C， PAPANDREOU G， KOKKINOS I， et al. DeepLab： semantic image segmentation with deep convolutional nets， atrous convolution， and fully connected CRFs［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2018， 40（4）： 834-848. 10.1109/tpami.2017.2699184
6	HU J， SHEN L， SUN G. Squeeze-and-excitation networks ［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018： 7132-7141. 10.1109/cvpr.2018.00745
7	PARK J， WOO S， LEE J Y， et al. BAM： bottleneck attention module ［C］// Proceedings of the 2018 British Machine Vision Conference. Durham： BMVA Press， 2018： No.92.
8	WOO S， PARK J， LEE J Y， et al. CBAM： convolutional block attention module ［C］// Proceedings of the 2018 European Conference on Computer Vision， LNCS 11211. Cham： Springer， 2018： 3-19.
9	WANG Q L， WU B G， ZHU P F， et al. ECA-Net： efficient channel attention for deep convolutional neural networks ［C］// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2020： 13708-13717. 10.1109/cvpr42600.2020.01155
10	HOU Q B， ZHOU D Q， FENG J S. Coordinate attention for efficient mobile network design ［C］// Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2021： 11531-11539. 10.1109/cvpr46437.2021.01350
11	MEHTA S， HAJISHIRZI H， RASTEGARI M. DiCENet： dimension-wise convolutions for efficient networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2022， 44（5）： 2416-2425.
12	SIMONYAN K， ZISSERMAN A. Very deep convolutional networks for large-scale image recognition［EB/OL］. （2015-04-10）［2021-04-19］. .
13	HE K M， ZHANG X Y， REN S Q， et al. Deep residual learning for image recognition ［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2016： 770-778. 10.1109/cvpr.2016.90
14	HOWARD A G， ZHU M L， CHEN B， et al. MobileNets： efficient convolutional neural networks for mobile vision applications［EB/OL］. （2017-04-17）［2021-06-20］. .
15	SANDLER M， HOWARD A， ZHU M L， et al. MobileNetV2： inverted residuals and linear bottlenecks ［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018： 4510-4520. 10.1109/cvpr.2018.00474
16	PASZKE A， GROSS S， CHINTALA S， et al. Automatic differentiation in PyTorch［EB/OL］. ［2021-06-28］. .
17	KRIZHEVSKY A. Learning multiple layers of features from tiny images［R/OL］. （2009-04-08）［2021-02-19］. .
18	MENG F X， CHENG H， LI K， et al. Pruning filter in filter［C/OL］// Proceedings of the 34th Conference on Neural Information Processing Systems. ［2021-02-20］. . 10.1109/cvpr42600.2020.00663
19	KUANG L. PyTorch-CIFAR［CP/OL］. ［2021-06-20］. .

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
VGG16	93.25	72.57	14.73	14.77	313.33	313.33
VGG16+SE	93.80	73.37	15.63	15.68	314.24	314.24
VGG16+CA	93.86	73.51	14.91	14.96	314.41	314.41
VGG16+CBAM	93.71	72.61	14.96	15.01	313.61	313.61
VGG16+ECA	93.65	71.51	14.73	14.77	313.33	313.33
VGG16+HD-SE	93.97	73.83	14.73	14.78	313.34	313.34
VGG16+WD-SE	93.98	74.14	14.73	14.78	313.34	313.34

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
VGG16	93.25	72.57	14.73	14.77	313.33	313.33
VGG16+SE	93.80	73.37	15.63	15.68	314.24	314.24
VGG16+CA	93.86	73.51	14.91	14.96	314.41	314.41
VGG16+CBAM	93.71	72.61	14.96	15.01	313.61	313.61
VGG16+ECA	93.65	71.51	14.73	14.77	313.33	313.33
VGG16+HD-SE	93.97	73.83	14.73	14.78	313.34	313.34
VGG16+WD-SE	93.98	74.14	14.73	14.78	313.34	313.34

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
ResNet56	93.10	71.43	0.85	0.86	125.49	125.49
ResNet56+SE	93.67	72.26	0.88	0.89	125.50	125.50
ResNet56+CA	94.06	72.22	0.88	0.89	125.95	125.95
ResNet56+CBAM	93.90	72.25	0.86	0.87	126.67	126.67
ResNet56+ECA	93.84	72.21	0.85	0.86	125.49	125.49
ResNet56+HD-SE	93.76	72.39	0.86	0.86	125.49	125.49
ResNet56+WD-SE	93.84	72.53	0.86	0.86	125.49	125.49

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
ResNet56	93.10	71.43	0.85	0.86	125.49	125.49
ResNet56+SE	93.67	72.26	0.88	0.89	125.50	125.50
ResNet56+CA	94.06	72.22	0.88	0.89	125.95	125.95
ResNet56+CBAM	93.90	72.25	0.86	0.87	126.67	126.67
ResNet56+ECA	93.84	72.21	0.85	0.86	125.49	125.49
ResNet56+HD-SE	93.76	72.39	0.86	0.86	125.49	125.49
ResNet56+WD-SE	93.84	72.53	0.86	0.86	125.49	125.49

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
MobileNetV1	91.24	67.89	3.22	3.31	46.34	46.34
MobileNetV1+SE	91.88	69.16	5.14	5.23	48.26	48.26
MobileNetV1+CA	91.92	69.52	3.59	3.69	48.01	48.01
MobileNetV1+CBAM	91.73	68.55	3.70	3.80	46.52	46.52
MobileNetV1+ECA	91.54	68.03	3.22	3.31	46.34	46.34
MobileNetV1+HD-SE	92.19	69.99	3.22	3.31	46.35	46.35
MobileNetV1+WD-SE	91.92	69.84	3.22	3.31	46.35	46.35

基于挤压激励的轻量化注意力机制模块

Lightweight attention mechanism module based on squeeze and excitation

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 12

参考文献 19

相关文章 15

编辑推荐

Metrics

[1]	李云, 王富铕, 井佩光, 王粟, 肖澳. 基于不确定度感知的帧关联短视频事件检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2903-2910.
[2]	秦璟, 秦志光, 李发礼, 彭悦恒. 基于概率稀疏自注意力神经网络的重性抑郁疾患诊断[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2970-2974.
[3]	赵宇博, 张丽萍, 闫盛, 侯敏, 高茂. 基于改进分段卷积神经网络和知识蒸馏的学科知识实体间关系抽取[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2421-2429.
[4]	张春雪, 仇丽青, 孙承爱, 荆彩霞. 基于两阶段动态兴趣识别的购买行为预测模型[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2365-2371.
[5]	陈虹, 齐兵, 金海波, 武聪, 张立昂. 融合1D-CNN与BiGRU的类不平衡流量异常检测[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2493-2499.
[6]	王东炜, 刘柏辰, 韩志, 王艳美, 唐延东. 基于低秩分解和向量量化的深度网络压缩方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 1987-1994.
[7]	高阳峄, 雷涛, 杜晓刚, 李岁永, 王营博, 闵重丹. 基于像素距离图和四维动态卷积网络的密集人群计数与定位方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2233-2242.
[8]	张勇进, 徐健, 张明星. 面向轻量化的改进YOLOv7棉杂检测算法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2271-2278.
[9]	姚迅, 秦忠正, 杨捷. 生成式标签对抗的文本分类模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1781-1785.
[10]	程小辉, 黄云天, 张瑞芳. 基于多尺度和加权坐标注意力的轻量化红外道路场景检测模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1927-1934.
[11]	沈君凤, 周星辰, 汤灿. 基于改进的提示学习方法的双通道情感分析模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1796-1806.
[12]	黄梦源, 常侃, 凌铭阳, 韦新杰, 覃团发. 基于层间引导的低光照图像渐进增强算法[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1911-1919.
[13]	李健京, 李贯峰, 秦飞舟, 李卫军. 基于不确定知识图谱嵌入的多关系近似推理模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1751-1759.
[14]	封筠, 毕健康, 霍一儒, 李家宽. 轻量化沥青路面裂缝图像分割网络PIPNet[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1520-1526.
[15]	席治远, 唐超, 童安炀, 王文剑. 基于双路时空网络的驾驶员行为识别[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1511-1519.

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
MobileNetV2	93.33	74.83	2.30	2.41	91.14	91.14
MobileNetV2+SE	93.41	75.65	4.55	4.67	93.40	93.40
MobileNetV2+CA	93.61	75.05	2.74	2.86	94.60	94.60
MobileNetV2+CBAM	93.52	75.34	2.87	2.99	91.57	91.57
MobileNetV2+ECA	93.72	75.20	2.30	2.41	91.14	91.14
MobileNetV2+HD-SE	93.54	75.01	2.30	2.41	91.14	91.14
MobileNetV2+WD-SE	93.43	75.15	2.30	2.41	91.14	91.14

模型	测试精度/%		参数量/10⁶		计算量/10⁶
模型	CIFAR10	CIFAR100	CIFAR10	CIFAR100	CIFAR10	CIFAR100
MobileNetV2	93.33	74.83	2.30	2.41	91.14	91.14
MobileNetV2+SE	93.41	75.65	4.55	4.67	93.40	93.40
MobileNetV2+CA	93.61	75.05	2.74	2.86	94.60	94.60
MobileNetV2+CBAM	93.52	75.34	2.87	2.99	91.57	91.57
MobileNetV2+ECA	93.72	75.20	2.30	2.41	91.14	91.14
MobileNetV2+HD-SE	93.54	75.01	2.30	2.41	91.14	91.14
MobileNetV2+WD-SE	93.43	75.15	2.30	2.41	91.14	91.14