Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (1): 270-279.DOI: 10.11772/j.issn.1001-9081.2024121765

• Multimedia computing and computer simulation • Previous Articles     Next Articles

Trident generative adversarial network for low-dose CT image denoising

Lifang WANG1, Wenjing REN1, Xiaodong GUO2(), Rongguo ZHANG1, Lihua HU1   

  1. 1.College of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan Shanxi 030024,China
    2.School of Electronic Information Engineering,Taiyuan University of Science and Technology,Taiyuan Shanxi 030024,China
  • Received:2024-12-16 Revised:2025-03-17 Accepted:2025-03-18 Online:2026-01-10 Published:2026-01-10
  • Contact: Xiaodong GUO
  • About author:WANG Lifang, born in 1975, Ph. D., associate professor. Her research interests include image and graphics processing, intelligent optimization.
    REN Wenjing, born in 2000, M. S. candidate. Her research interests include computer vision, image denoising.
    ZHANG Rongguo, born in 1964, Ph. D., professor. His research interests include computer graphics and aided design, image and graphics processing, pattern recognition.
    HU Lihua, born in 1982, Ph. D., professor. Her research interests include computer vision, pattern recognition.
  • Supported by:
    National Natural Science Foundation of China(62273248);Natural Science Foundation of Shanxi Province(202203021211206);Postgraduate Educational Reform Project of Shanxi Province(2022YJJG191);Postgraduate Innovation Project of Shanxi Province(2023KY657)

用于低剂量CT图像降噪的多路特征生成对抗网络

王丽芳1, 任文婧1, 郭晓东2(), 张荣国1, 胡立华1   

  1. 1.太原科技大学 计算机科学与技术学院,太原 030024
    2.太原科技大学 电子信息工程学院,太原 030024
  • 通讯作者: 郭晓东
  • 作者简介:王丽芳(1975—),女,山西和顺人,副教授,博士, CCF会员,主要研究方向:图形图像处理、智能优化
    任文婧(2000—),女,山西长治人,硕士研究生,主要研究方向:计算机视觉、图像降噪
    张荣国(1964—),男,山西太原人,教授,博士, CCF会员,主要研究方向:计算机图形学与辅助设计、图形图像处理、模式识别
    胡立华(1982—),女,山西忻州人,教授,博士, CCF会员,主要研究方向:计算机视觉、模式识别。
  • 基金资助:
    国家自然科学基金资助项目(62273248);山西省自然科学基金资助项目(202203021211206);山西省研究生教育改革项目(2022YJJG191);山西省研究生科研创新项目(2023KY657)

Abstract:

In recent years, significant progress has been made in applying Generative Adversarial Network (GAN) in Low-Dose Computed Tomography (LDCT) image denoising. However, the existing methods face challenges such as insufficient modeling capability for complex noise distribution and limited ability to preserve structural details. Therefore, a multi-path GAN for LDCT image denoising — Trident GAN was proposed. Firstly, a feature guided generator Trident Uformer was designed. In this generator, a Feature Polymerization Attention (FPA) module was added at the bottleneck layer of the U-Net structure, thereby solving the problem of low spatial resolution in a U-shaped structure. Secondly, a multi-path feature extraction submodule Trident Block was designed, and in each of the three blocks, a Local Detail Enhancement Block (LDEB) was introduced to extract detailed features, a Lightweight Channel Attention Block (LCAB) was incorporated to enhance channel features, and a Spatial Interaction Attention Block (SIAB) was utilized to capture important spatial features, respectively. Within the SIAB, a multi-level interactive attention function and evaluation mechanism were employed to design a Spatial Context Attention Mechanism (SCAM), which addresses the limitations of single-attention mechanisms. Finally, a Multi-Feature Fusion (MFF) module was designed to realize feature aggregation at the end of the three blocks, and model both local detail information and global semantic information, and solve the problem of discontinuous details across different levels. Furthermore, the Multi-Scale Pyramid Discriminator (MSPD) was used to check the quality of the generated results at different dimensions, and guide the generation of globally consistent images. Experimental results show that Trident GAN achieves the average Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index Measure (SSIM) of 31.519 3 dB/0.883 0 and 33.633 1 dB/0.947 8, respectively, on Mayo and Piglet datasets. Compared with High-Frequency Sensitive GAN (HFSGAN), this method has the number of parameters reduced by 75.58% and the test time reduced by 36.36%. It can be seen that compared with the existing methods such as HFSGAN, Trident GAN improves image quality with less computational load.

Key words: Low-Dose Computed Tomography (LDCT), image denoising, attention mechanism, Transformer, Generative Adversarial Network (GAN)

摘要:

近些年,把生成对抗网络(GAN)应用于低剂量计算机断层扫描(LDCT)图像降噪取得了显著进展。然而,现有方法存在对复杂噪声分布建模能力不足以及结构细节保留能力有限等问题。因此,提出一种用于LDCT图像降噪的多路特征GAN ——Trident GAN。首先,设计特征引导生成器Trident Uformer,通过在U-Net结构的瓶颈层增加特征聚合注意力(FPA)模块解决U型结构空间分辨率较低的问题;其次,设计多路特征提取子模块Trident Block,并在3个分支中分别引入局部细节增强模块(LDEB)提取细节特征,轻量通道注意力模块(LCAB)增强通道特征,以及空间交互注意力模块(SIAB)获得重要空间特征;在SIAB中采用多级交互式注意力函数和评估机制设计空间上下文注意力机制(SCAM),解决单一注意力受限的问题;最后,设计多特征融合(MFF)模块来在三分支末端进行特征聚合,并对局部细节信息和全局语义信息进行建模,解决不同层次之间细节不连续的问题。此外,利用多尺度金字塔判别器(MSPD)在不同维度下检查生成结果的质量,指导具有全局一致性图像的生成。实验结果表明,在Mayo和Piglet数据集上,Trident GAN的平均峰值信噪比(PSNR)和结构相似性(SSIM)分别达到了31.519 3 dB/0.883 0和33.633 1 dB/0.947 8,与高频敏感GAN (HFSGAN)相比,参数量降低75.58%,测试时间缩短36.36%。可见,与HFSGAN等方法相比, Trident GAN可在较少的计算负荷下提高了图像质量。

关键词: 低剂量计算机断层扫描, 图像降噪, 注意力机制, Transformer, 生成对抗网络

CLC Number: