Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (1): 280-288.DOI: 10.11772/j.issn.1001-9081.2024121868

• Multimedia computing and computer simulation • Previous Articles     Next Articles

Multi-scale and spatial frequency feature-based image segmentation network for pheochromocytoma

Chaoyun MAI1, Hongyi ZHANG1, Chuanbo QIN1(), Junying ZENG1, Dong WANG2   

  1. 1.School of Electronics and Information Engineering,Wuyi University,Jiangmen Guangdong 529020,China
    2.Peking Union Medical College Hospital,Chinese Academy of Medical Sciences,Beijing 100730,China
  • Received:2025-01-03 Revised:2025-03-21 Accepted:2025-03-25 Online:2026-01-10 Published:2026-01-10
  • Contact: Chuanbo QIN
  • About author:MAI Chaoyun, born in 1989, Ph. D., associate professor. His research interests include intelligent information processing, digital signal processing.
    ZHANG Hongyi, born in 1999, M. S. candidate. His research interests include biometric identification, medical image analysis.
    ZENG Junying, born in 1977, Ph. D., professor. His research interests include computer image processing, biometric identification.
    WANG Dong, born in 1981, Ph. D. His research interests include clinical diagnosis and treatment of complex pheochromocytoma.
  • Supported by:
    Key Research Platform Project for Ordinary Universities in Guangdong Province(2024ZDZX1008);Guangdong Provincial Medical Research Fund Project(B2023100);Science and Technology Program in Healthcare Sector of Jiangmen City(2022YL01029)

基于多尺度与空间频率特征的嗜铬细胞瘤图像分割网络

麦超云1, 张洪燚1, 秦传波1(), 曾军英1, 王栋2   

  1. 1.五邑大学 电子与信息工程学院,广东 江门 529020
    2.中国医学科学院 北京协和医院,北京 100730
  • 通讯作者: 秦传波
  • 作者简介:麦超云(1989—),男,广东江门人,副教授,博士,CCF会员,主要研究方向:智能信息处理、数字信号处理
    张洪燚(1999—),男,河南洛阳人,硕士研究生,CCF会员,主要研究方向:生物特征识别、医学图像分析
    曾军英(1977—),男,江西赣州人,教授,博士,主要研究方向:计算机图像处理、生物特征识别
    王栋(1981—),男,山东日照人,博士,主要研究方向:复杂嗜铬细胞瘤临床诊疗。
  • 基金资助:
    广东省普通高校重点研究平台项目(2024ZDZX1008);广东省医学科研基金资助项目(B2023100);江门市医疗卫生领域科技计划项目(2022YL01029)

Abstract:

To address the issue of insufficient learning of features between different organs in the segmentation of pheochromocytoma images from partially annotated abdominal datasets, which leads to difficulties in distinguishing tumor and surrounding organ boundaries accurately, a Multi-scale and spatial Frequency feature-based image segmentation Network for pheochromocytoma (MF-Net) was proposed. Firstly, a Multi-scale Spatial Frequency Channel Attention module (MSFCA) was constructed, which enhanced the capture of inter-organ texture and boundary features by weighting and fusing the frequency domain information of image and the multi-scale feature maps from adjacent encoders, thereby highlighting the feature representation ability of tumor regions. Then, an Upsampling Multi-Scale Feature Fusion module (UMFF) was introduced, which combined upsampled feature maps of different scales to enhance the model's ability to recognize objects of varying sizes in image. Finally, an Adaptive Objective loss function (AOb) was utilized, which calculated the loss for the annotated abdominal organ labels and adjusted the loss weights according to the annotated organ categories, thereby optimizing learning process of the segmentation network. Experimental results show that on the abdominal organ and pheochromocytoma datasets, MF-Net has the segmentation accuracy improved by 3.33 and 3.18 percentage points, respectively, compared to the separately trained nnU-Net (no new U-Net), with Dice similarity coefficient (Dice) and Normalized Surface Dice (NSD) reached 89.07% and 92.85%, respectively. On the external datasets, MF-Net has the Dice and NSD of 84.66% and 90.55%, respectively. In addition visualization results indicate that MF-Net can handle complex backgrounds and blurred boundaries in pheochromocytoma images better, providing better technical support for accurate diagnosis and treatment of pheochromocytoma.

Key words: abdominal organ segmentation, multi-scale feature, spatial frequency, partially annotated dataset, pheochromocytoma medical image

摘要:

针对腹部部分注释数据集的嗜铬细胞瘤图像分割缺乏不同器官间的特征学习,导致分割中难以准确区分肿瘤及周边器官边缘的问题,提出一种基于多尺度与空间频率特征的嗜铬细胞瘤图像分割网络(MF-Net)。首先,构建多尺度空间频率通道注意力模块(MSFCA)对图像频域信息和相邻编码器的多尺度特征图进行加权融合,以强化器官间纹理和边界特征的捕捉,从而突出肿瘤区域的特征表示能力;其次,引入上采样多尺度特征融合模块(UMFF)通过结合上采样得到的不同尺度特征图,增强模型对图像中不同大小对象的识别能力;最后,利用自适应目标损失函数(AOb)对有注释腹部器官标签进行损失计算,并根据注释器官类别调整损失权重大小,从而优化分割网络的学习过程。实验结果表明,在腹部器官和嗜铬细胞瘤数据集上, MF-Net的分割准确率相较于单独训练的nnU-Net (no new U-Net)分别提升了3.33和3.18个百分点,而Dice系数(Dice)和归一化表面距离(NSD)分别为89.07%和92.85%;在域外数据集上, MF-Net的Dice和NSD分别为84.66%和90.55%。此外,可视化结果表明, MF-Net能更好地处理嗜铬细胞瘤图像中的复杂背景和模糊边界,为嗜铬细胞瘤的精确诊断和治疗提供了更好的技术支持。

关键词: 腹部器官分割, 多尺度特征, 空间频率, 部分注释数据集, 嗜铬细胞瘤医学图像

CLC Number: