《计算机应用》唯一官方网站

• •    下一篇

基于非对称多解码器和注意力模块的三维肾脏影像结构分割模型

孔哲1,李寒1,甘少伟1,孔明茹1,何冰涛1,郭子钰1,金督程2,邱兆文3   

  1. 1. 东北林业大学计算机与控制工程学院
    2. 哈尔滨工业大学电子与信息工程学院
    3. 东北林业大学
  • 收稿日期:2023-06-19 修回日期:2023-08-17 发布日期:2023-09-18 出版日期:2023-09-18
  • 通讯作者: 孔哲
  • 基金资助:
    危重症患者远程医疗救治支持、质控与智能决策系统

Structure segmentation model for 3D kidney images based on asymmetric multi-decoder and attention module

  • Received:2023-06-19 Revised:2023-08-17 Online:2023-09-18 Published:2023-09-18

摘要: 针对肾脏结构中,因不同结构间差异大,动静脉体积小、结构薄及计算机断层扫描血管造影(CTA)图像灰度分布不均和伪影带来的精确分割困难的问题,提出基于非对称多解码器和注意力模块的三维肾脏影像结构分割模型MDAUnet(MultiDecoder-Attention-Unet)。首先,针对不同结构间差异大导致网络无法共享权重的问题,采用多解码器结构,为语义结构不同的特征结构匹配不同的解码器分支;其次,针对血管体积小,结构薄难分割的问题,引入非对称的空间通道联合注意力模块使模型更加关注管状结构,并对学习到的特征信息同时进行空间维度和通道维度的校准;最后,为了保证模型在反向传播中对血管结构有足够的关注,提出了改进的加权硬区域适应损失(WHRA)作为损失函数来动态的保持训练过程中血管结构的类间平衡以及保留背景信息的特征;此外,为了提高特征图灰度值的对比度,将传统图像处理边缘检测算子嵌入模型的预处理阶段,对待分割的感兴趣区域边界进行特征增强使得模型更关注边界信息并抑制伪影信息。实验结果表明,所提出的MDAUnet模型在肾脏结构分割任务上的Dice相似系数(DSC),豪斯多夫距离95(HD95)和平均表面距离(AVD)分别为89.1%,1.76mm和1.04mm,与MGANet相比,MDAUnet在DSC指标上提升了1.2个百分点;与UNETR相比,MDAUnet在HD95和ASD指标上分别降低了0.84毫米和0.45毫米。可见MDAUnet能有效提高肾脏三维结构分割精度,有助于医生在临床手术中客观有效的评估病情。

关键词: 肾脏三维结构分割, 注意力模块, 计算机断层血管造影, 损失函数, 边缘检测

Abstract: To address the problems of accurate segmentation difficulties caused by large differences between different structures, small size of arteries and veins, thin structures and uneven grayscale distribution and artifacts in computed tomography angiography(CTA) images in kidney structures, a kidney 3D structure segmentation model MDAUnet (MultiDecoder-Attention-Unet) based on multi-decoder and attention mechanism with CTA was proposed. Firstly, to address the problem that the network cannot share weights due to large differences between different structures, a multi-decoder structure was used to match different decoder branches for feature structures with different semantic structures; secondly, to address the problem that it was difficult to segment blood vessels with small size and thin structures, an asymmetric spatial channel joint attention module was introduced to make the model more focused on tubular structures, and the learned feature information was simultaneously spatially dimensional and channel Finally, in order to ensure that the model paid enough attention to the vessel structure in backpropagation, an improved WHRA(weighted hard region adaptation) loss was proposed as a loss function to dynamically maintain the inter-class balance of the vessel structure during training as well as to preserve the characteristics of the background information; In addition, in order to improve the contrast of the grayscale values of the feature map, the traditional image processing edge detection operator was embedded into the pre-processing stage of the model, and the feature enhancement of the boundary of the region of interest to be segmented made the model paid more attention to the boundary information and suppressed the artifact information. The results show that the Dice similarity coefficient (DSC), hausdorff distance 95(HD95) and average surface distance (AVD) of the proposed MDAUnet model on the kidney structure segmentation task are 89.1%, 1.76mm and 1.04mm, respectively. Compared with MGANet, MDAUnet improves the DSC index by 1.2 percentage points; compared with UNETR, MDAUnet reduces HD95 and ASD indexes by 0.84 mm and 0.45 mm, respectively. It can be seen that MDAUnet can effectively improve the segmentation accuracy of the three-dimensional structure of the kidney, and help doctors to evaluate the condition objectively and effectively in clinical operations.

Key words: kidney Three-Dimensional(3D) structural segmentation, attention module, computed tomography angiography (CTA), loss function, edge detection

中图分类号: