《计算机应用》唯一官方网站

• •    下一篇

从U-Net到Transformer: 深度模型在医学图像分割中的应用综述

张玮智1,于谦2,苏金善3,乎西旦·居马洪4,林玲1   

  1. 1. 伊犁师范大学
    2. 山东女子学院
    3. 伊犁师范学院
    4. 伊犁师范大学 网络安全与信息技术学院
  • 收稿日期:2023-08-04 修回日期:2023-10-17 发布日期:2023-10-26 出版日期:2023-10-26
  • 通讯作者: 张玮智
  • 基金资助:
    自治区级研究生创新项目;伊犁师范大学校级重点项目;国家自然科学基金项目

Application review of deep models in medical image segmentation: from U-Net to Transformer

  • Received:2023-08-04 Revised:2023-10-17 Online:2023-10-26 Published:2023-10-26

摘要: 精准分割医学图像中的病灶对医生探寻病因和制定诊疗方案起关键作用,计算机视觉技术的发展促使深度学习在医学图像分割领域衍生出多种模型架构,U-Net架构以其巧妙的跳跃连接、易于优化的模块设计成为这一领域的基准模型。然而,U-Net以卷积神经网络(CNN)为主干,在长期建模依赖关系方面只擅长获取局部特征,基于CNN的各项方法在执行分割任务中缺乏对图像长期相关性的解释,无法提取全局特征。为帮助本领域学者了解U型网络的发展历程及研究现状,文中以问题为导向对近七年U型网络改进工作进行综述,首先,从改进结构位置的角度对U-Net及其各项改进模型进行叙述,探讨各工作的研究目的和创新设计及不足之处。其次,对Transformer与U型网络的结合方式进行分析,从中获取改进工作的研究动向。最后,在Synapse和ACDC数据集上进行对比实验,通过实验分析和可视化结果表明,Transformer方法在分割精度方面有显著优势,特别是混合网络子块的结合方式,在确保模型性能的同时兼顾效率,表明该类工作有着广阔的发展前景和研究价值。

关键词: 医学图像分割, U-Net, 结构改进, Transformer, 深度神经网络, 技术综述

Abstract: The accurate segmentation of lesions in medical images plays a key role in the physician's search for the cause of disease and the formulation of treatment plans. The development of computer vision technology has led to the derivation of various model architectures for deep learning in medical image segmentation. The U-Net architecture is widely used for its clever skip connection, easy-to-optimize module design has become a benchmark model in this field. However, U-Net network, with Convolutional Neural Network (CNN) as its backbone, is only good at acquiring local features in terms of long-term modeling dependencies, and various CNN-based methods lack the interpretation of long-term image correlations in performing segmentation tasks to extract global features. In order to help scholars in this field to understand the development history and research status of U-Net, in the text provides a problem-oriented overview of the U-Net improvement work in the last seven years. First, the U-Net and its various improved models are narrated from the perspective of improving the structural position, and the research purpose, innovative design and shortcomings of each work are discussed. Secondly, the combination method of Transformer and U-shaped network is analyzed to obtain research trends for improvement work. Finally, comparative experiments are conducted on Synapse and ACDC datasets, and the experimental analysis and visualization results show that the Transformer method has a significant advantage in segmentation accuracy, especially the way of combining hybrid network sub-blocks, which ensures the model performance while taking into account the efficiency, and proves that this class has a broad development prospect and research value.

Key words: medical image segmentation, U-Net, structural improvement, transformer, deep neural network, technology overview

中图分类号: