基于非对称多解码器和注意力模块的三维肾脏影像结构分割模型

doi:10.11772/j.issn.1001-9081.2023060773

《计算机应用》唯一官方网站 ›› 2024, Vol. 44 ›› Issue (7): 2216-2224.DOI: 10.11772/j.issn.1001-9081.2023060773

• 多媒体计算与计算机仿真 • 上一篇下一篇

基于非对称多解码器和注意力模块的三维肾脏影像结构分割模型

孔哲¹, 李寒¹, 甘少伟¹, 孔明茹¹, 何冰涛¹, 郭子钰¹, 金督程², 邱兆文¹()

^1.东北林业大学计算机与控制工程学院，哈尔滨 150040
^2.哈尔滨工业大学电子与信息工程学院，哈尔滨 150001

收稿日期:2023-06-19 修回日期:2023-08-17 接受日期:2023-08-21 发布日期:2023-09-18 出版日期:2024-07-10
通讯作者: 邱兆文
作者简介:孔哲（1997—），男，山东泰安人，硕士研究生，主要研究方向：医学影像分析；
李寒（1999—），女，山东泰安人，硕士研究生，主要研究方向：医学影像分析；
甘少伟（1996—），男，安徽安庆人，硕士研究生，CCF会员，主要研究方向：计算机视觉；
孔明茹（1989—），女，黑龙江牡丹江人，副教授，博士，主要研究方向：医学影像重建；
何冰涛（1997—），男，河南灵宝人，硕士研究生，CCF会员，主要研究方向：医学影像分析；
郭子钰（1999—），男，山西高平人，硕士研究生，主要研究方向：医学影像分析；
金督程（2002—），男，黑龙江哈尔滨人，硕士研究生，主要研究方向：遥感数据分析及处理；
第一联系人：邱兆文（1974—），男，黑龙江哈尔滨人，教授，博士，CCF会员，主要研究方向：医学影像分析。
基金资助:
黑龙江省重点研发计划项目(SC2022ZX01 A0201)

Structure segmentation model for 3D kidney images based on asymmetric multi-decoder and attention module

Zhe KONG¹, Han LI¹, Shaowei GAN¹, Mingru KONG¹, Bingtao HE¹, Ziyu GUO¹, Ducheng JIN², Zhaowen QIU¹()

^1.College of Computer and Control Engineering，Northeast Forestry University，Harbin Heilongjiang 150040，China
^2.School of Electronics and Information Engineering，Harbin Institute of Technology，Harbin Heilongjiang 150001，China

Received:2023-06-19 Revised:2023-08-17 Accepted:2023-08-21 Online:2023-09-18 Published:2024-07-10
Contact: Zhaowen QIU
About author:KONG Zhe， born in 1997， M. S. candidate. His research interests include medical image analysis.
LI Han， born in 1999， M. S. candidate. Her research interests include medical image analysis.
GAN Shaowei， born in 1996， M. S. candidate. His research interests include computer vision.
KONG Mingru， born in 1989， Ph. D.， associate professor. Her research interests include medical image reconstruction.
HE Bingtao， born in 1997， M. S. candidate. His research interests include medical image analysis.
GUO Ziyu， born in 1999， M. S. candidate. His research interests include medical image analysis.
JIN Ducheng， born in 2002， M. S. candidate. His research interests include remote sensing data analysis and processing.
First author contact:QIU Zhaowen， born in 1974， Ph. D.， professor. His research interests include medical image analysis.
Supported by:
Key Research and Development Plan in Heilongjiang Province(SC2022ZX01 A0201)

摘要/Abstract

摘要：

针对肾脏结构中，因不同结构间差异大，动静脉体积小、结构薄及计算机断层扫描血管造影（CTA）图像灰度分布不均和伪影带来的精确分割困难的问题，提出基于非对称多解码器和注意力模块的三维肾脏影像结构分割模型MDAUnet（MultiDecoder-Attention-Unet）。首先，针对不同结构间差异大导致网络无法共享权重的问题，采用多解码器结构，为语义结构不同的特征结构匹配不同的解码器分支；其次，针对血管体积小、结构薄难分割的问题，引入非对称的空间通道联合注意力模块使模型更关注管状结构，并对学习到的特征信息同时进行空间维度和通道维度的校准；最后，为了保证模型在反向传播中对血管结构有足够的关注，提出改进的加权硬区域适应损失（WHRA）作为损失函数来动态保持训练过程中血管结构的类间平衡以及保留背景信息的特征；此外，为了提高特征图灰度值的对比度，将传统图像处理边缘检测算子嵌入模型的预处理阶段，对待分割的感兴趣区域边界进行特征增强使模型更关注边界信息并抑制伪影信息。实验结果表明：所提出的MDAUnet模型在肾脏结构分割任务上的Dice相似系数（DSC），豪斯多夫距离95（HD95）和平均表面距离（AVD）分别为89.1%，1.76 mm和1.04 mm；在DSC指标上，与次优的MGANet（Meta Greyscale Adaptive Network）相比，MDAUnet提升了1.2个百分点；在HD95和ASD指标上，与次优的UNETR（UNEt TRansformers）相比，MDAUnet分别降低了0.87 mm和0.45 mm。可见MDAUnet能有效提高肾脏三维结构分割精度，有助于医生在临床手术中客观有效地评估病情。

关键词: 肾脏三维结构分割, 注意力模块, 计算机断层血管造影, 损失函数, 边缘检测

Abstract:

To address the problems of accurate segmentation difficulties for kidney structures caused by large differences between different structures， small sizes and thin structures of arteries and veins， and uneven grayscale distribution and artifacts in Computed Tomography Angiography （CTA） images， a kidney 3D structure segmentation model MDAUnet （MultiDecoder-Attention-Unet） based on multi-decoder and attention mechanism with CTA was proposed. Firstly， to address the problem that the network cannot share weights due to large differences between different structures， a multi-decoder structure was used to match different decoder branches for feature structures with different semantic structures. Secondly， to address the problem that it is difficult to segment blood vessels with small size and thin structure， an asymmetric spatial channel joint attention module was introduced to make the model more focused on tubular structures， and the learned feature information was simultaneously calibrated in spatial dimension and channel dimension. Finally， in order to ensure that the model paid enough attention to the vessel structure in back propagation， an improved WHRA （Weighted Hard Region Adaptation） loss was proposed as a loss function to dynamically maintain the inter-class balance of the vessel structure during training as well as to preserve the characteristics of the background information. In addition， in order to improve the contrast of the grayscale values of the feature map， the edge detection operator in traditional image processing was embedded into the pre-processing stage of the model， and the feature enhancement of the boundary of the region of interest to be segmented made the model more focused on the boundary information and suppressed the artifact information. The experimental results show that the Dice Similarity Coefficient （DSC）， Hausdorff Distance 95 （HD95） and Average Surface Distance （AVD） of the proposed MDAUnet model on the kidney structure segmentation task are 89.1%， 1.76 mm and 1.04 mm， respectively. Compared with suboptimal MGANet （Meta Greyscale Adaptive Network）， MDAUnet improves the DSC index by 1.2 percentage points； compared with suboptimal UNETR （UNEt TRansformers）， MDAUnet reduces HD95 and ASD indexes by 0.87 mm and 0.45 mm， respectively. It can be seen that MDAUnet can effectively improve the segmentation accuracy of the three-dimensional structure of the kidney， and help doctors to evaluate the condition objectively and effectively in clinical operations.

Key words: kidney Three-Dimensional (3D) structural segmentation, attention module, Computed Tomography Angiography (CTA), loss function, edge detection

中图分类号:

TP391.41

孔哲, 李寒, 甘少伟, 孔明茹, 何冰涛, 郭子钰, 金督程, 邱兆文. 基于非对称多解码器和注意力模块的三维肾脏影像结构分割模型[J]. 计算机应用, 2024, 44(7): 2216-2224.

Zhe KONG, Han LI, Shaowei GAN, Mingru KONG, Bingtao HE, Ziyu GUO, Ducheng JIN, Zhaowen QIU. Structure segmentation model for 3D kidney images based on asymmetric multi-decoder and attention module[J]. Journal of Computer Applications, 2024, 44(7): 2216-2224.

图/表 14

参考文献 32

1	SHAO P， QIN C， YIN C， et al. Laparoscopic partial nephrectomy with segmental renal artery clamping： technique and clinical outcomes ［J］. European Urology， 2011， 59（5）： 849-855.
2	SHAO P， TANG L， LI P， et al. Precise segmental renal artery clamping under the guidance of dual-source computed tomography angiography during laparoscopic partial nephrectomy ［J］. European Urology， 2012， 62（6）： 1001-1008.
3	LEVINE M D， SHAHEEN S I. A modular computer vision system for picture segmentation and interpretation ［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 1981， 3（5）： 540-556.
4	BALASUBRAMANIAN C， SARAVANAN S， SRINIVASAGAN K G， et al. Automatic segmentation of brain tumor from MR image using region growing technique ［J］. Life Science Journal， 2013， 10（2）： 2878-2883.
5	OTSU N. A threshold selection method from gray-level histograms ［J］. IEEE Transactions on Systems， Man， and Cybernetics， 1979， 9（1）： 62-66.
6	LEE H S， HONG H， KIM J. Detection and segmentation of small renal masses in contrast-enhanced CT images using texture and context feature classification ［C］// Proceedings of the 2017 IEEE 4th International Symposium on Biomedical Imaging. Piscataway： IEEE， 2017： 583-586.
7	段瑞玲，李庆祥，李玉和.图像边缘检测方法研究综述［J］.光学技术， 2005， 31（3）： 415-419.
	DUAN R L， LI Q X， LI Y H. Summary of image edge detection ［J］. Optical Technique， 2005， 31（3）： 415-419.
8	BUKHARI S T， MOHY-UD-DIN H . E 1D₃ U-Net for brain tumor segmentation： submission to the RSNA-ASNR-MICCAI BraTS 2021 challenge ［C］// Proceedings of the 2021 International MICCAI Brainlesion Workshop. Cham： Springer， 2021： 276-288.
9	QIN Y， ZHENG H， GU Y， et al. Learning tubule-sensitive CNNs for pulmonary airway and artery-vein segmentation in CT ［J］. IEEE Transactions on Medical Imaging， 2021， 40（6）： 1603-1617.
10	SHI G， XIAO L， CHEN Y， et al. Marginal loss and exclusion loss for partially supervised multi-organ segmentation ［J］. Medical Image Analysis， 2021， 70： 101979.
11	ZHOU C， DING C， WANG X， et al. One-pass multi-task networks with cross-task guided attention for brain tumor segmentation ［J］. IEEE Transactions on Image Processing， 2020， 29： 4516-4529.
12	SHAMSHAD F， KHAN S， ZAMIR S W， et al. Transformers in medical imaging： a survey ［J］. Medical Image Analysis， 2023， 88： 102802.
13	HELLER N， ISENSEE F， MAIER-HEIN K H， et al. The state of the art in kidney and kidney tumor segmentation in contrast-enhanced CT imaging： results of the KiTS19 challenge ［J］. Medical Image Analysis， 2021， 67： 101821.
14	ZHANG Y， WANG Y， HOU F， et al. Cascaded volumetric convolutional network for kidney tumor segmentation from CT volumes ［EB/OL］. （2020-05-04）［2023-06-19］..
15	WANG C， ODA M， HAYASHI Y， et al. Tensor-cut： a tensor-based graph-cut blood vessel segmentation method and its application to renal artery segmentation ［J］. Medical Image Analysis， 2020， 60： 101623.
16	TAHA A， LO P， LI J， et al. Kid-Net： convolution networks for kidney vessels segmentation from CT-volumes ［C］// Proceedings of the 2018 International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham： Springer， 2018： 463-471.
17	HE Y， YANG G， YANG J， et al. Dense biased networks with deep priori anatomy and hard region adaptation： semi-supervised learning for fine renal artery segmentation ［J］. Medical Image Analysis， 2020， 63： 101722.
18	HE Y， YANG G， YANG J， et al. Meta grayscale adaptive network for 3D integrated renal structures segmentation ［J］. Medical Image Analysis， 2021， 71： 102055.
19	TAKIKAWA T， ACUNA D， JAMPANI V， et al. Gated-SCNN： gated shape CNNs for semantic segmentation ［C］// Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision. Piscataway： IEEE， 2019： 5229-5238.
20	ZHOU L， MENG X， HUANG Y， et al. An interpretable deep learning workflow for discovering subvisual abnormalities in CT scans of COVID-19 inpatients and survivors ［J］. Nature Machine Intelligence， 2022， 4： 494-503.
21	ZHOU L， LI Z， ZHOU J， et al. A rapid， accurate and machine-agnostic segmentation and quantification method for CT-based COVID-19 diagnosis ［J］. IEEE Transactions on Medical Imaging， 2020， 39（8）： 2638-2652.
22	ROY A G， NAVAB N， WACHINGER C. Concurrent spatial and channel ‘squeeze & excitation’ in fully convolutional networks ［C］// Proceedings of the 2018 International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham： Springer， 2018： 421-429.
23	RONNEBERGER O， FISCHER P， BROX T. U-net： convolutional networks for biomedical image segmentation ［C］// Proceedings of the 2015 International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham： Springer， 2015： 234-241.
24	WOO S， PARK J， LEE J-Y， et al. CBAM： convolutional block attention module ［C］// Proceedings of the 15th European Conference on Computer Vision. Berlin： Springer， 2018： 3-19.
25	PARK J， WOO S， LEE J-Y， et al. BAM： bottleneck Attention module ［EB/OL］. （2018-07-20）［2023-06-19］..
26	LOSHCHILOV I， HUTTER F. Decoupled weight decay regularization ［EB/OL］. （2019-01-04）［2023-06-19］..
27	LOSHCHILOV I， HUTTER F. SGDR： stochastic gradient descent with warm restarts ［EB/OL］. （2017-05-03）［2023-06-19］..
28	ABDOLLAHI A， PRADHAN B， ALAMRI A. VNet： an end-to-end fully convolutional neural network for road extraction from high-resolution remote sensing data ［J］. IEEE Access， 2020， 8： 179424-179436.
29	ISENSEE F， JAEGER P F， KOHL S A A， et al. nnU-Net： a self-configuring method for deep learning-based biomedical image segmentation ［J］. Nature Methods， 2021， 18： 203-211.
30	DIAKOGIANNIS F I， WALDNER F， CACCETTA P， et al. ResUNet-a： a deep learning framework for semantic segmentation of remotely sensed data ［J］. ISPRS Journal of Photogrammetry and Remote Sensing， 2020， 162： 94-114.
31	HATAMIZADEH A， TANG Y， NATH V， et al. UNETR： Transformers for 3D medical image segmentation ［C］// Proceedings of the 2022 IEEE/CVF Winter Conference on Applications of Computer Vision. Piscataway： IEEE， 2022： 1748-1758.
32	KONG M， QIN Z， ZHANG P， et al. Study on modified poplar wood powder/polylactic acid high toughness green 3D printing composites ［J］. International Journal of Biological Macromolecules， 2023， 228： 311-322.

网络模型	肾脏			肿瘤			动脉			静脉			平均值
网络模型	DSC/%	HD95/mm	ASD/mm	DSC/%	HD95/mm	ASD/mm	DSC/%	HD95/mm	ASD/mm	DSC/%	HD95/mm	ASD/mm	DSC/%	HD95/mm	ASD/mm
Kid-Net^［16］	94.3	13.10	0.66	82.7	12.04	2.42	78.0	15.15	2.91	75.4	1.82	1.19	82.6	10.50	1.79
MGANet^［18］	95.1	*	*	86.4	*	*	89.0	*	*	81.0	*	*	87.9	*	*
Unet^［23］	94.9	12.24	0.33	82.0	5.29	3.75	80.2	2.56	2.25	73.7	1.86	0.78	82.7	5.50	1.80
Vnet^［28］	94.3	8.17	0.06	81.5	4.05	3.31	84.3	4.17	3.58	76.4	3.13	1.45	84.1	4.90	2.10
nnUnet^［29］	95.3	7.50	2.12	72.2	8.17	21.50	81.7	3.90	3.71	89.5	1.96	1.59	84.6	5.40	7.90
ResUnet^［30］	94.4	1.29	0.05	82.0	2.64	1.46	84.5	6.14	4.40	76.8	2.00	0.81	84.0	3.00	1.70
UNETR^［31］	95.1	1.94	0.87	83.8	4.74	1.99	82.2	2.05	1.98	77.3	1.81	1.13	84.6	2.63	1.49
MDAUnet	96.3	1.44	0.04	89.6	2.48	1.31	83.2	2.37	2.36	87.3	0.78	0.46	89.1	1.76	1.04

网络模型	肾脏			肿瘤			动脉			静脉			平均值
网络模型	DSC/%	HD95/mm	ASD/mm	DSC/%	HD95/mm	ASD/mm	DSC/%	HD95/mm	ASD/mm	DSC/%	HD95/mm	ASD/mm	DSC/%	HD95/mm	ASD/mm
Kid-Net^［16］	94.3	13.10	0.66	82.7	12.04	2.42	78.0	15.15	2.91	75.4	1.82	1.19	82.6	10.50	1.79
MGANet^［18］	95.1	*	*	86.4	*	*	89.0	*	*	81.0	*	*	87.9	*	*
Unet^［23］	94.9	12.24	0.33	82.0	5.29	3.75	80.2	2.56	2.25	73.7	1.86	0.78	82.7	5.50	1.80
Vnet^［28］	94.3	8.17	0.06	81.5	4.05	3.31	84.3	4.17	3.58	76.4	3.13	1.45	84.1	4.90	2.10
nnUnet^［29］	95.3	7.50	2.12	72.2	8.17	21.50	81.7	3.90	3.71	89.5	1.96	1.59	84.6	5.40	7.90
ResUnet^［30］	94.4	1.29	0.05	82.0	2.64	1.46	84.5	6.14	4.40	76.8	2.00	0.81	84.0	3.00	1.70
UNETR^［31］	95.1	1.94	0.87	83.8	4.74	1.99	82.2	2.05	1.98	77.3	1.81	1.13	84.6	2.63	1.49
MDAUnet	96.3	1.44	0.04	89.6	2.48	1.31	83.2	2.37	2.36	87.3	0.78	0.46	89.1	1.76	1.04

scSE	Scharr滤波	WHRA	肾脏		肿瘤		静脉		动脉		平均值
scSE	Scharr滤波	WHRA	DSC/%	HD95/mm	DSC/%	HD95/mm	DSC/%	HD95/mm	DSC/%	HD95/mm	DSC/%	HD95/mm
			96.0	1.98	87.1	4.03	81.3	2.08	78.4	4.24	85.7	3.08
√			96.4	1.84	90.3	2.55	83.6	1.86	80.7	3.67	87.8	2.48
	√		96.2	2.25	88.8	3.04	83.6	1.46	83.5	2.95	87.5	2.42
		√	96.3	2.23	86.3	3.19	86.5	1.05	82.8	3.05	87.9	2.38
√	√		96.3	1.59	86.8	2.62	87.0	1.12	82.7	2.88	88.2	2.05
√		√	96.0	1.98	85.1	2.83	87.7	1.07	82.4	2.85	87.8	2.18
√	√	√	96.3	1.44	89.6	2.48	87.3	0.78	83.2	2.37	89.1	1.76

scSE	Scharr滤波	WHRA	肾脏		肿瘤		静脉		动脉		平均值
scSE	Scharr滤波	WHRA	DSC/%	HD95/mm	DSC/%	HD95/mm	DSC/%	HD95/mm	DSC/%	HD95/mm	DSC/%	HD95/mm
			96.0	1.98	87.1	4.03	81.3	2.08	78.4	4.24	85.7	3.08
√			96.4	1.84	90.3	2.55	83.6	1.86	80.7	3.67	87.8	2.48
	√		96.2	2.25	88.8	3.04	83.6	1.46	83.5	2.95	87.5	2.42
		√	96.3	2.23	86.3	3.19	86.5	1.05	82.8	3.05	87.9	2.38
√	√		96.3	1.59	86.8	2.62	87.0	1.12	82.7	2.88	88.2	2.05
√		√	96.0	1.98	85.1	2.83	87.7	1.07	82.4	2.85	87.8	2.18
√	√	√	96.3	1.44	89.6	2.48	87.3	0.78	83.2	2.37	89.1	1.76

T	DSC/%	T	DSC/%
0.100	88.8	0.010	89.1
0.050	88.8	0.005	88.9

基于非对称多解码器和注意力模块的三维肾脏影像结构分割模型

Structure segmentation model for 3D kidney images based on asymmetric multi-decoder and attention module

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 14

参考文献 32

相关文章 15

编辑推荐

Metrics

模型	模型参数量/10⁶	模型大小/MB
无scSE	30.93	122
编码器+scSE	31.07	123
解码器+对称scSE	31.45	124
解码器+非对称scSE	31.27	123

[1]	李钟华, 白云起, 王雪津, 黄雷雷, 林初俊, 廖诗宇. 基于图像增强的低照度人脸检测[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2588-2594.
[2]	邓凯丽, 魏伟波, 潘振宽. 改进掩码自编码器的工业缺陷检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2595-2603.
[3]	程小辉, 黄云天, 张瑞芳. 基于多尺度和加权坐标注意力的轻量化红外道路场景检测模型[J]. 《计算机应用》唯一官方网站, 2024, 44(6): 1927-1934.
[4]	孙敏, 成倩, 丁希宁. 基于CBAM-CGRU-SVM的Android恶意软件检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1539-1545.
[5]	陈天华, 朱家煊, 印杰. 基于注意力机制的鸟类识别算法[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1114-1120.
[6]	肖斌, 甘昀, 汪敏, 张兴鹏, 王照星. 基于端口注意力与通道空间注意力的网络异常流量检测[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1027-1034.
[7]	李威, 陈玲, 徐修远, 朱敏, 郭际香, 周凯, 牛颢, 张煜宸, 易珊烨, 章毅, 罗凤鸣. 基于多任务学习的间质性肺病分割算法[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1285-1293.
[8]	袁卿宇, 高铁杠. 基于像素预测和秘密图像共享的可逆信息隐藏[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 780-787.
[9]	黄巧玲, 郑伯川, 丁梓成, 吴泽东. 融合监督注意力模块和跨阶段特征融合的图像修复改进网络[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 572-579.
[10]	刘涛, 鞠事宏, 高一萌. 基于改进YOLOv8n的无人机视角下小目标检测算法[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3603-3609.
[11]	李大海, 李冰涛, 王振东. 基于改进YOLOv8的水下目标检测算法[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3610-3616.
[12]	龙杰, 谢良, 徐海蛟. 集成的深度强化学习投资组合模型[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 300-310.
[13]	郭祥, 姜文刚, 王宇航. 基于改进Inception-ResNet的加密流量分类方法[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2471-2476.
[14]	郭奕裕, 周箩鱼, 刘新瑜, 李尧. 改进注意力机制的电梯场景下危险品检测方法[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2295-2302.
[15]	詹春兰, 王安志, 王明辉. 基于通道注意力和边缘融合的伪装目标分割方法[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2166-2172.