Spatial frequency divided attention network for ultrasound image segmentation

doi:10.11772/j.issn.1001-9081.2020091470

Journal of Computer Applications ›› 2021, Vol. 41 ›› Issue (6): 1828-1835.DOI: 10.11772/j.issn.1001-9081.2020091470

Special Issue: 前沿与综合应用

• Frontier and comprehensive applications • Previous Articles Next Articles

Spatial frequency divided attention network for ultrasound image segmentation

SHEN Xuewen^1,2, WANG Xiaodong^1,2, YAO Yu^1,2

1. Chengdu Institute of Computer Applications, Chinese Academy of Sciences, Chengdu Sichuan 610041, China;
2. University of Chinese Academy of Sciences, Beijing 100049, China

Received:2020-09-21 Revised:2020-11-12 Online:2020-12-01 Published:2021-06-10
Supported by:
This work is partially supported by the STS Regional Key Project of Chinese Academy of Sciences (KFJ-STS-QYZD-179).

基于空间分频的超声图像分割注意力网络

沈雪雯^1,2, 王晓东^1,2, 姚宇^1,2

1. 中国科学院成都计算机应用研究所, 成都 610041;
2. 中国科学院大学, 北京 100049

通讯作者: 沈雪雯
作者简介:沈雪雯(1995-),女,贵州龙里人,硕士研究生,主要研究方向:机器学习、图像处理;王晓东(1973-),男,四川乐山人,研究员,主要研究方向:网络工程;姚宇(1980-),男,四川宜宾人,研究员,博士,主要研究方向:机器学习、模式识别。
基金资助:
中国科学院STS区域重点项目（KFJ-STS-QYZD-179）。

Abstract

Abstract: Aiming at the problems of medical ultrasound images such as many noisy points, fuzzy boundaries, and difficulty in defining the cardiac contours, a new Spatial Frequency Divided Attention Network for ultrasound image segmentation (SFDA-Net) was proposed. Firstly, with the help of Octave convolution, the high and low-frequency parallel processing of image in the entire network was realized to obtain more diverse information. Then, the Convolutional Block Attention Module (CBAM) was added for paying more attention to the effective information when image feature recovered, so as to reduce the loss of segmenting the entire target area. Finally Focal Tversky Loss was considered as the objective function to reduce the weights of simple samples and pay more attention on difficult samples, as well as decrease the errors introduced by pixel misjudgment between the categories. Through multiple sets of comparative experiments,it can be seen that with the parameter number lower than that of the original UNet++, SFDA-Net has the segmentation accuracy increased by 6.2 percentage points, Dice sore risen by 8.76 percentage points, mean Pixel Accuracy (mPA) improved to 84.09%, and mean Intersection Over Union (mIoU) increased to 75.79%. SFDA-Net steadily improves the network performance while reducing parameters, and makes the echocardiographic segmentation more accurate.

Key words: echocardiogram, deep learning, image segmentation, spatial frequency division, attention mechanism

摘要： 针对医学超声影像噪点多、边界模糊，器官轮廓很难界定的问题，提出了一种基于空间分频的超声图像分割注意力网络（SFDA-Net）。首先，借助Octave卷积在整个网络中对图像实现了高、低频并行处理，从而获得更加多元的信息。然后，加入卷积块注意模块（CBAM），使图像特征恢复时更加关注有效信息，以减小分割目标整体区域的缺失。最后，使用Focal Tversky Loss作为目标函数，从而降低简单样本的权重并加强对困难样本的关注，以及降低各个类别间因像素误判而引入的误差。通过多组对比实验可知，SFDA-Net的参数量低于原UNet++，而分割精度提高了6.2个百分点，Dice得分提高了8.76个百分点，类别平均像素准确率（mPA）提升至84.09%，平均交并比（mIoU）提升至75.79%。SFDA-Net在降低参数量的同时稳步提高了网络性能，实现了更为准确的超声心动图分割。

关键词: 超声心动图, 深度学习, 图像分割, 空间分频, 注意力机制

CLC Number:

TP391.41

SHEN Xuewen, WANG Xiaodong, YAO Yu. Spatial frequency divided attention network for ultrasound image segmentation[J]. Journal of Computer Applications, 2021, 41(6): 1828-1835.

沈雪雯, 王晓东, 姚宇. 基于空间分频的超声图像分割注意力网络[J]. 计算机应用, 2021, 41(6): 1828-1835.

References

[1] SHELHAMER E,LONG J,DARRELL T. Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4):640-651.
[2] RONNEBERGER O,FISCHER P,BROX T. U-Net:convolutional networks for biomedical image segmentation[C]//Proceedings of the 2015 International Conference on Medical Image Computing and Computer-Assisted Intervention, LNCS 9351. Cham:Springer, 2015:234-241.
[3] OKTAY O,SCHLEMPER J,LE FOLGOC L,et al. Attention UNet:learning where to look for the pancreas[EB/OL].[2020-07-13]. https://arxiv.org/pdf/1804.03999.pdf.
[4] HE K,ZHANG X,REN S,et al. Deep residual learning for image recognition[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2016:770-778.
[5] HUANG G, LIU Z, VAN DER MAATEN L, et al. Densely connected convolutional networks[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:2261-2269.
[6] CHEN L C,PAPANDREOU G,KOKKINOS I,et al. Semantic image segmentation with deep convolutional nets and fully connected CRFs[EB/OL].[2020-07-21]. https://arxiv.org/pdf/1412.7062.pdf.
[7] ZHOU Z,RAHMAN SIDDIQUEE M M,TAIBAKHSH N,et al. UNet++:a nested U-Net architecture for medical image segmentation[C]//Proceedings of the 2018 International Workshop on Deep Learning in Medical Image Analysis/International Workshop on Multimodal Learning for Clinical Decision Support, LNCS 11045. Cham:Springer,2018:3-11.
[8] CHEN Y,FAN H,XU B,et al. Drop an octave:reducing spatial redundancy in convolutional neural networks with octave convolution[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision. Piscataway:IEEE,2019:3434-3443.
[9] VASWANI A,SHAZEER N,PARMAR N,et al. Attention is all you need[C]/Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook:Curran Associates Inc.,2017:6000-6010.
[10] HE K,GKIOXARI G,DOLLÁR P,et al. Mask R-CNN[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway:IEEE,2017:2980-2988.
[11] HU J,SHEN L,SUN G. Squeeze-and-excitation networks[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:7132-7141.
[12] WOO S,PARK J,LEE J Y,et al. CBAM:convolutional block attention module[C]//Proceedings of the 2018 European Conference on Computer Vision,LNCS 11211. Cham:Springer, 2018:3-19.
[13] PARK J,WOO S,LEE J Y,et al. BAM:bottleneck attention module[C]//Proceedings of the 2018 British Machine Vision Conference. Durham:BMVA Press,2018:Article No. 92.
[14] ABRAHAM N,KHAN N M. A novel focal Tversky loss function with improved attention U-Net for lesion segmentation[C]//Proceedings of the 2019 IEEE 16th International Symposium on Biomedical Imaging. Piscataway:IEEE,2019:683-687.
[15] JADON S. A survey of loss functions for semantic segmentation[EB/OL].[2020-12-01]. https://arxiv.org/pdf/2006.14822.pdf.
[16] 叶海, 冯开平, 谢红宁. 基于全卷积网络的胎儿脑部超声图像分割算法[J]. 现代计算机, 2019(17):51-54.(YE H,FENG K P, XIE H N. Fetal brain ultrasound image segmentation algorithm based on fully convolution network[J]. Modern Computer,2019(17):51-54.)
[17] 张耀楠, 李显, 宋谦, 等. 基于超声序列图像的心脏四腔运动同步特性提取[J]. 北京生物医学工程, 2016, 35(5):455-463. (ZHANG Y N, LI X, SONG Q, et al. Extraction of synchronization characteristics of heart's four-chamber motion based on ultrasound sequence images[J]. Beijing Biomedical Engineering,2016,35(5):455-463.)
[18] 蒋建慧, 姚静, 张艳娟, 等. 基于深度学习的超声自动测量左室射血分数的研究[J]. 临床超声医学杂志, 2019, 21(1):70-74. (JIANG J H,YAO J,ZHANG Y J,et al. Study on automatic measurement of left ventricular ejection fraction with ultrasound based on deep learning[J]. Journal of Clinical Ultrasound in Medicine,2019,21(1):70-74.)
[19] 夏黎明, 沈坚, 张荣国, 等. 深度学习技术在医学影像领域的应用[J]. 协和医学杂志, 2018, 9(1):10-14.(XIA L M,SHEN J, ZHANG R G,et al. Application of deep learning technology in medical imaging research[J]. Medical Journal of Peking Union Medical College Hospital,2018,9(1):10-14.)

Spatial frequency divided attention network for ultrasound image segmentation

基于空间分频的超声图像分割注意力网络

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

[1]	Yunchuan HUANG, Yongquan JIANG, Juntao HUANG, Yan YANG. Molecular toxicity prediction based on meta graph isomorphism network [J]. Journal of Computer Applications, 2024, 44(9): 2964-2969.
[2]	Shunyong LI, Shiyi LI, Rui XU, Xingwang ZHAO. Incomplete multi-view clustering algorithm based on self-attention fusion [J]. Journal of Computer Applications, 2024, 44(9): 2696-2703.
[3]	Yexin PAN, Zhe YANG. Optimization model for small object detection based on multi-level feature bidirectional fusion [J]. Journal of Computer Applications, 2024, 44(9): 2871-2877.
[4]	Zhiqiang ZHAO, Peihong MA, Xinhong HEI. Crowd counting method based on dual attention mechanism [J]. Journal of Computer Applications, 2024, 44(9): 2886-2892.
[5]	Jing QIN, Zhiguang QIN, Fali LI, Yueheng PENG. Diagnosis of major depressive disorder based on probabilistic sparse self-attention neural network [J]. Journal of Computer Applications, 2024, 44(9): 2970-2974.
[6]	Xiyuan WANG, Zhancheng ZHANG, Shaokang XU, Baocheng ZHANG, Xiaoqing LUO, Fuyuan HU. Unsupervised cross-domain transfer network for 3D/2D registration in surgical navigation [J]. Journal of Computer Applications, 2024, 44(9): 2911-2918.
[7]	Liting LI, Bei HUA, Ruozhou HE, Kuang XU. Multivariate time series prediction model based on decoupled attention mechanism [J]. Journal of Computer Applications, 2024, 44(9): 2732-2738.
[8]	Kaipeng XUE, Tao XU, Chunjie LIAO. Multimodal sentiment analysis network with self-supervision and multi-layer cross attention [J]. Journal of Computer Applications, 2024, 44(8): 2387-2392.
[9]	Pengqi GAO, Heming HUANG, Yonghong FAN. Fusion of coordinate and multi-head attention mechanisms for interactive speech emotion recognition [J]. Journal of Computer Applications, 2024, 44(8): 2400-2406.
[10]	Yuhan LIU, Genlin JI, Hongping ZHANG. Video pedestrian anomaly detection method based on skeleton graph and mixed attention [J]. Journal of Computer Applications, 2024, 44(8): 2551-2557.
[11]	Zhonghua LI, Yunqi BAI, Xuejin WANG, Leilei HUANG, Chujun LIN, Shiyu LIAO. Low illumination face detection based on image enhancement [J]. Journal of Computer Applications, 2024, 44(8): 2588-2594.
[12]	Shangbin MO, Wenjun WANG, Ling DONG, Shengxiang GAO, Zhengtao YU. Single-channel speech enhancement based on multi-channel information aggregation and collaborative decoding [J]. Journal of Computer Applications, 2024, 44(8): 2611-2617.
[13]	Yanjie GU, Yingjun ZHANG, Xiaoqian LIU, Wei ZHOU, Wei SUN. Traffic flow forecasting via spatial-temporal multi-graph fusion [J]. Journal of Computer Applications, 2024, 44(8): 2618-2625.
[14]	Qianhong SHI, Yan YANG, Yongquan JIANG, Xiaocao OUYANG, Wubo FAN, Qiang CHEN, Tao JIANG, Yuan LI. Multi-granularity abrupt change fitting network for air quality prediction [J]. Journal of Computer Applications, 2024, 44(8): 2643-2650.
[15]	Zheng WU, Zhiyou CHENG, Zhentian WANG, Chuanjian WANG, Sheng WANG, Hui XU. Deep learning-based classification of head movement amplitude during patient anaesthesia resuscitation [J]. Journal of Computer Applications, 2024, 44(7): 2258-2263.