基于改进Res-UNet的昼夜地基云图分割网络

doi:10.11772/j.issn.1001-9081.2023040453

《计算机应用》唯一官方网站 ›› 2024, Vol. 44 ›› Issue (4): 1310-1316.DOI: 10.11772/j.issn.1001-9081.2023040453

所属专题：多媒体计算与计算机仿真

• 多媒体计算与计算机仿真 • 上一篇下一篇

基于改进Res-UNet的昼夜地基云图分割网络

王铂越, 李英祥(), 钟剑丹

成都信息工程大学通信工程学院，成都 610200

收稿日期:2023-04-21 修回日期:2023-07-05 接受日期:2023-07-05 发布日期:2023-12-04 出版日期:2024-04-10
通讯作者: 李英祥
作者简介:王铂越（1999—），男，河南洛阳人，硕士研究生，主要研究方向：智能图像处理、人工智能
钟剑丹（1985—），男，甘肃张掖人，讲师，博士，主要研究方向：智能图像处理、人工智能。
基金资助:
四川省科技计划项目(2023YFS0428)

Segmentation network for day and night ground-based cloud images based on improved Res-UNet

Boyue WANG, Yingxiang LI(), Jiandan ZHONG

College of Communication Engineering，Chengdu University of Information Technology，Chengdu Sichuan 610200，China

Received:2023-04-21 Revised:2023-07-05 Accepted:2023-07-05 Online:2023-12-04 Published:2024-04-10
Contact: Yingxiang LI
About author:WANG Boyue， born in 1999， M. S. candidate. His research interests include intelligent image processing， artificial intelligence.
ZHONG Jiandan， born in 1985， Ph. D.， lecturer. His research interests include intelligent image processing， artificial intelligence.
Supported by:
Science and Technology Plan of Sichuan Province(2023YFS0428)

摘要/Abstract

摘要：

针对昼夜地基云图在分割中细节信息丢失、分割精度低等问题，提出一种基于改进Res-UNet（Residual network-UNetwork）的昼夜地基云图分割网络CloudRes-UNet（Cloud ResNet-UNetwork），整体采用编码器-解码器的网络结构。首先，编码器使用ResNet50提取特征，增强特征提取能力；其次，设计多级特征提取（Multi-Stage）模块，该模块结合分组卷积、膨胀卷积和通道打乱这3种技巧，获取高强度语义信息；再次，加入高效通道注意力（ECA?Net）模块，在通道维度上聚焦重要信息，加强对地基云图中云区域的关注，提高分割精度；最后，解码器使用双线性插值对特征进行上采样，提高分割图像的清晰度并减少目标和位置信息丢失。实验结果表明，与当前基于深度学习表现较好的地基云图分割网络（Cloud-UNet）相比，CloudRes-UNet在昼夜地基云图分割数据集上的分割准确率提升了1.5个百分点，平均交并比（MIoU）上升了1.4个百分点，更准确地获取了云量信息，对天气预报、气候研究和光伏发电等方面具有积极意义。

关键词: 地基云图, 语义分割, 深度学习, 高效通道注意力网络, ResNet50, Res-UNet

Abstract:

Aiming at the problems of detail information loss and low segmentation accuracy in the segmentation of day and night ground-based cloud images， a segmentation network called CloudResNet-UNetwork （CloudRes-UNet） for day and night ground-based cloud images based on improved Res-UNet （Residual network-UNetwork） was proposed， in which the overall network structure of encoder-decoder was adopted. Firstly， ResNet50 was used by the encoder to extract features to enhance the feature extraction ability. Then， a Multi-Stage feature extraction （Multi-Stage） module was designed， which combined three techniques of group convolution， dilated convolution and channel shuffle to obtain high-intensity semantic information. Secondly， Efficient Channel Attention Network （ECA?Net） module was added to focus on the important information in the channel dimension， strengthen the attention to the cloud region in the ground-based cloud image， and improve the segmentation accuracy. Finally， bilinear interpolation was used by the decoder to upsample the features， which improved the clarity of the segmented image and reduced the loss of object and position information. The experimental results show that， compared with the state-of-the-art ground-based cloud image segmentation network Cloud-UNetwork （Cloud-UNet） based on deep learning， the segmentation accuracy of CloudRes-UNet on the day and night ground-based cloud image segmentation dataset is increased by 1.5 percentage points， and the Mean Intersection over Union （MIoU） is increased by 1.4 percentage points， which indicates that CloudRes-UNet obtains cloud information more accurately. It has positive significance for weather forecast， climate research， photovoltaic power generation and so on.

Key words: ground-based cloud image, semantic segmentation, deep learning, Efficient Channel Attention Network (ECA-Net), ResNet50, Res-UNet (Residual network-UNetwork)

中图分类号:

TP391

王铂越, 李英祥, 钟剑丹. 基于改进Res-UNet的昼夜地基云图分割网络[J]. 计算机应用, 2024, 44(4): 1310-1316.

Boyue WANG, Yingxiang LI, Jiandan ZHONG. Segmentation network for day and night ground-based cloud images based on improved Res-UNet[J]. Journal of Computer Applications, 2024, 44(4): 1310-1316.

图/表 15

图1 改进后的CloudRes-UNet结构

Fig. 1 Improved CloudRes-UNet structure

图2 ResNet50第3层残差块内部结构

Fig. 2 Internal structure of the 3rd layer’s residual block in ResNet50

表1 ResNet50特征提取网络结构

Tab. 1 ResNet50 feature extraction network structure

卷积层名称	输出尺寸	层内部结构
Conv1	160×160	7×7，64，步长=2
Conv2_x	80×80	残差块×3
Conv3_x	40×40	残差块×4
Conv4_x	20×20	残差块×6
Conv5_x	10×10	残差块×3

图3 分组卷积

Fig. 3 Grouped convolution

图4 3种膨胀卷积

Fig. 4 Three dilation convolutions

图5 ECA-Net结构

Fig. 5 ECA-Net structure

图6 双线性插值原理

Fig. 6 Principle of bilinear interpolation

图7 白天和夜间地基云图和标签图像

Fig. 7 Ground-based cloud images and label images in daytime and nighttime

图8 夜间地基云图数据增强可视化结果

Fig. 8 Data enhanced visualization results of nighttime ground-based cloud images

图9 训练过程中MIoU变化

Fig. 9 MIoU change during training process

图10 昼夜地基云图分割结果

Fig. 10 Segmentation results of day and night ground-based cloud images images

图11 分割结果代表性区域展示

Fig. 11 Representative region display of segmentation results

表2 不同分割网络在3个数据集上的实验结果对比

Tab. 2 Comparison of experimental results of different segmentation networks on three datasets

数据集	分割网络	准确率	精确率	召回率	F1值	ER	MIoU
白天地基云图	U-Net	0.942	0.918	0.917	0.916	0.058	0.847
	PSPNet	0.941	0.941	0.941	0.941	0.059	0.889
	DeepLabv3+	0.934	0.933	0.934	0.933	0.066	0.875
	CloudU-Net	0.951	0.951	0.951	0.951	0.049	0.906
	CloudRes-UNet	0.958	0.958	0.949	0.953	0.042	0.912
夜间地基云图	U-Net	0.952	0.951	0.951	0.951	0.048	0.907
	PSPNet	0.963	0.962	0.963	0.962	0.037	0.927
	DeepLabv3+	0.958	0.958	0.957	0.958	0.042	0.919
	CloudU-Net	0.973	0.972	0.973	0.972	0.027	0.947
	CloudRes-UNet	0.986	0.983	0.981	0.982	0.014	0.965
昼夜地基云图	U-Net	0.931	0.931	0.931	0.931	0.069	0.871
	PSPNet	0.947	0.947	0.947	0.947	0.053	0.900
	DeepLabv3+	0.937	0.936	0.937	0.936	0.063	0.882
	CloudU-Net	0.951	0.954	0.954	0.954	0.049	0.908
	CloudRes-UNet	0.966	0.964	0.958	0.961	0.034	0.922

表3 几种网络的参数量、训练时间和测试时间对比

Tab. 3 Comparison of parameters， training time and test time among several networks

网络模型	参数量/MB	训练时间/h	测试时间/s
CloudRes-UNet	156.8	2.8	105
U-Net	94.9	3.3	218
PSPNet	9.3	2.1	47
DeepLabv3+	22.4	0.9	63
CloudU-Net	138.6	2.6	91

表4 消融实验结果

Tab. 4 Ablation experiment results

网络	Multi-Stage	ECA-Net	准确率	ER	MIoU
网络a			0.931	0.069	0.871
网络b	√		0.954	0.046	0.912
网络c		√	0.957	0.043	0.916
本文网络	√	√	0.966	0.034	0.922

参考文献 35

1	STEPHENS G L. Cloud feedbacks in the climate system： a critical review［J］. Journal of Climate， 2005， 18（2）：237-273. 10.1175/jcli-3243.1
2	刘翼飞，崔承刚. 基于地基云图云量特征的光伏发电功率区间预测［J］. 南方电网技术， 2023， 17（2）：92-100.
	LIU Y F， CUI C G. Interval prediction for short-term solar power based on cloud features of ground-based cloud images［J］. Southern Power System Technology， 2023， 17（2）： 92-100.
3	顾轶，韩潮，刘建勋，等.基于卫星云图的大区域云层预测方法［J］.中国空间科学技术， 2023，43（2）：165-173.
	GU Y， HAN C， LIU J X， et al. Research on large area cloud forecasting method based on satellite cloud images［J］. Chinese Space Science and Technology， 2023， 43（2）：165-173.
4	WANG Y， WANG C， SHI C， et al. Short-term cloud coverage prediction using the ARIMA time series model［J］. Remote Sensing Letters， 2018， 9（3）：274-283. 10.1080/2150704x.2017.1418992
5	LONG C N， SABBURG J M， CALBÓ J， et al. Retrieving cloud characteristics from ground-based daytime color all-sky images［J］. Journal Atmospheric Oceanic Technolog， 2006， 23：633-652. 10.1175/jtech1875.1
6	HEINLE A， MACKE A， SRIVASTAV A. Automatic cloud classification of whole sky images［J］. Atmospheric Measurement Techniques， 2010， 3（3）：557-567. 10.5194/amt-3-557-2010
7	LI Q， LU W， YANG J. A hybrid thresholding algorithm for cloud detection on ground-based color images［J］. Journal of Atmospheric & Oceanic Technology， 2011， 28：1286-1296. 10.1175/jtech-d-11-00009.1
8	LIU S， ZHANG L， ZHANG Z， et al. Automatic cloud detection for all-sky images using superpixel segmentation［J］. IEEE Geoscience and Remote Sensing Letters， 2015， 12（2）：354-358. 10.1109/lgrs.2014.2341291
9	SHI C， WANG Y， WANG C， et al. Ground-based cloud detection using graph model built upon superpixels［J］. IEEE Geoscience & Remote Sensing Letters， 2017， 14（5）： 719-723. 10.1109/lgrs.2017.2676007
10	吉茹，张银胜，杨宇龙，等. 基于多尺度特征融合的改进型云图分割方法［J］. 国外电子测量技术， 2022， 41（11）：37-44.
	JI R， ZHANG Y S， YANG Y L，et al. Improved cloud image segmentation method based on multi-scale feature fusion［J］. Foreign Electronic Measurement Technology， 2022， 41（11）：37-44.
11	张雪，贾克斌，刘钧，等. 面向轻量化的地基云图分割技术研究［J］. 测控技术， 2022， 41（9）：37-43.
	ZHANG X， JIA K B， LIU J，et al. Segmentation technology of ground-based cloud image for lightweight［J］. Measurement & Control Technology， 2022， 41（9）：37-43.
12	GACAL G F B， ANTIOQUIA C， LAGROSAS N. Ground-based detection of nighttime clouds above Manila Observatory （14.64°N， 121.07°E） using a digital camera［J］. Applied Optics， 2016， 55（22）：6040-6045. 10.1364/ao.55.006040
13	DEV S， LEE Y H， WINKLER S. Color-based segmentation of sky/cloud images from ground-based cameras［J］. IEEE Journal Selected Topics in Applied Earth Observations and Remote Sensing， 2017， 10（1）：231-242. 10.1109/jstars.2016.2558474
14	SHI C， ZHOU Y， QIU B， et al. Diurnal and nocturnal cloud segmentation of All-Sky Imager （ASI） images using enhancement fully convolutional networks［J］. Atmospheric Measurement Techniques， 2019， 12（9）：4713-4724. 10.5194/amt-12-4713-2019
15	DEV S， NAUTIYAL A， LEE Y H， et al. CloudSegNet： a deep network for nychthemeron cloud image segmentation［J］. IEEE Geoscience and Remote Sensing Letters， 2019， 16（12）：1814-1818. 10.1109/lgrs.2019.2912140
16	张男男，李丽莎，王宝珠，等. 基于MobileNet的地基云图分割方法研究［J］. 电子技术与软件工程， 2022， 18：129-132.
	ZHANG N N， LI L S， WANG B Z，et al. Research on MobileNet-based ground-based cloud segmentation method［J］. Electronic Technology & Software Engineering， 2022， 18：129-132.
17	HE K， ZHANG X， REN S， et al. Deep residual learning for image recognition［C］// Proceeding of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2016：770-778. 10.1109/cvpr.2016.90
18	WANG Q， WU B， ZHU P， et al. ECA-Net： efficient channel attention for deep convolutional neural networks［C］// Proceedings of the 2020 IEEE/CVF International Conference on Computer Vision and Pattern Recognition. Washington， DC： IEEE Computer Society， 2020：11531-11539. 10.1109/cvpr42600.2020.01155
19	SHI C， ZHOU Y， QIU B. CloudU-Netv2： a cloud segmentation method for ground-based cloud images based on deep learning［J］. Neural Processing Letters， 2021， 53：2715-2728. 10.1007/s11063-021-10457-2
20	SHELHAMER E， LONG J， DARRELL T. Fully convolutional networks for semantic segmentation［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2017， 39（4）：640-651. 10.1109/tpami.2016.2572683
21	RONNEBERGER O， FISCHER P， BROX T. U-Net： convolutional networks for biomedical image segmentation［C］// Proceedings of the 2015 International Conference on Medical Image Computing and Computer-Assisted Intervention，LNCS 9351. Cham： Springer， 2015： 234-241.
22	BADRINARAYANAN V， KENDALL A， CIPOLLA R. SegNet： a deep convolutional encoder-decoder architecture for image segmentation［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2017， 39（12）：2481-2495. 10.1109/tpami.2016.2644615
23	ZHAO H， SHI J， QI X， et al. Pyramid scene parsing network ［C］// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2017：6230-6239. 10.1109/cvpr.2017.660
24	CHEN L-C， ZHU Y， PAPANDREOU G， et al. Encoder-decoder with atrous separable convolution for semantic image segmentation［C］// Proceedings of the 2018 European Conference on Computer Vision， LNCS 11211. Cham： Springer， 2018：833-851.
25	LIU Z， LIN Y， CAO Y， et al. Swin Transformer： hierarchical vision Transformer using shifted windows［EB/OL］. ［2023-05-10］. . 10.1109/iccv48922.2021.00986
26	LIU Z， MAO H， WU C-Y， et al. A ConvNet for the 2020s［EB/OL］. ［2023-05-10］.. 10.1109/cvpr52688.2022.01167
27	HE K， CHEN X， XIE S， et al. Masked autoencoders are scalable vision learners［EB/OL］. ［2023-05-10］. . 10.1109/cvpr52688.2022.01553
28	CARON M， TOUVRON H， MISRA I， et al. Emerging properties in self-supervised vision Transformers［EB/OL］. ［2023-05-10］. . 10.1109/iccv48922.2021.00951
29	STEPHAN M， SANTRA A. Radar-based human target detection using deep residual U-Net for smart home applications［C］// Proceedings of the 2019 18th IEEE International Conference on Machine Learning and Applications. Piscataway： IEEE， 2019： 175-182. 10.1109/icmla.2019.00035
30	冯兴杰，张天泽. 基于分组卷积进行特征融合的全景分割算法［J］. 计算机应用， 2021， 41（7）：2054-2061. 10.11772/j.issn.1001-9081.2020091523
	FENG X J， ZHANG T Z. Panoptic segmentation algorithm based on grouped convolution for feature fusion［J］. Journal of Computer Applications， 2021， 41（7）： 2054-2061. 10.11772/j.issn.1001-9081.2020091523
31	谷静，吴怡宁，孟鑫昊. 基于膨胀卷积的多尺度焊缝缺陷检测算法［J］. 光电子·激光， 2022， 33（1）：61-66.
	GU J， WU Y N， MENG X H. Weld defect detection based on expansion convolution multi-scale fusion［J］. Journal of Optoelectronics·Laser， 2022， 33（1）：61-66.
32	YU F， KOLYUN V. Multi-scale context aggregation by dilated convolutions ［EB/OL］. ［2023-03-23］. . 10.1109/cvpr.2017.75
33	ZHANG X， ZHOU X， LIN M， et al. ShuffleNet： an extremely efficient convolutional neural network for mobile devices ［C］// Proceeding of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018：6548-6856. 10.1109/cvpr.2018.00716
34	HU J， SHEN L， ALBANIE S， et al. Squeeze-and-excitation networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2020， 42（8）：2011-2023. 10.1109/tpami.2019.2913372
35	SHI C， ZHOU Y， QIU B， et al. CloudU-Net： a deep convolutional neural network architecture for daytime and nighttime cloud images’segmentation［J］. IEEE Geoscience and Remote Sensing Letters， 2021， 18（10）：1688-1692. 10.1109/lgrs.2020.3009227

[1]	李顺勇, 李师毅, 胥瑞, 赵兴旺. 基于自注意力融合的不完整多视图聚类算法[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2696-2703.
[2]	秦璟, 秦志光, 李发礼, 彭悦恒. 基于概率稀疏自注意力神经网络的重性抑郁疾患诊断[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2970-2974.
[3]	王熙源, 张战成, 徐少康, 张宝成, 罗晓清, 胡伏原. 面向手术导航3D/2D配准的无监督跨域迁移网络[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2911-2918.
[4]	潘烨新, 杨哲. 基于多级特征双向融合的小目标检测优化模型[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2871-2877.
[5]	黄云川, 江永全, 黄骏涛, 杨燕. 基于元图同构网络的分子毒性预测[J]. 《计算机应用》唯一官方网站, 2024, 44(9): 2964-2969.
[6]	刘禹含, 吉根林, 张红苹. 基于骨架图与混合注意力的视频行人异常检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2551-2557.
[7]	顾焰杰, 张英俊, 刘晓倩, 周围, 孙威. 基于时空多图融合的交通流量预测[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2618-2625.
[8]	石乾宏, 杨燕, 江永全, 欧阳小草, 范武波, 陈强, 姜涛, 李媛. 面向空气质量预测的多粒度突变拟合网络[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2643-2650.
[9]	吴筝, 程志友, 汪真天, 汪传建, 王胜, 许辉. 基于深度学习的患者麻醉复苏过程中的头部运动幅度分类方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2258-2263.
[10]	李欢欢, 黄添强, 丁雪梅, 罗海峰, 黄丽清. 基于多尺度时空图卷积网络的交通出行需求预测[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2065-2072.
[11]	张郅, 李欣, 叶乃夫, 胡凯茜. 基于暗知识保护的模型窃取防御技术DKP[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2080-2086.
[12]	赵亦群, 张志禹, 董雪. 基于密集残差物理信息神经网络的各向异性旅行时计算方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2310-2318.
[13]	徐松, 张文博, 王一帆. 基于时空信息的轻量视频显著性目标检测网络[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2192-2199.
[14]	孙逊, 冯睿锋, 陈彦如. 基于深度与实例分割融合的单目3D目标检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2208-2215.
[15]	刘源泂, 何茂征, 黄益斌, 钱程. 基于ResNet50和改进注意力机制的船舶识别模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1935-1941.

基于改进Res-UNet的昼夜地基云图分割网络

Segmentation network for day and night ground-based cloud images based on improved Res-UNet

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 15

参考文献 35

相关文章 15

编辑推荐

Metrics