Lightweight optic cup and disc segmentation method based on multi-scale feature enhancement

doi:10.11772/j.issn.1001-9081.2025091155

Journal of Computer Applications

Lightweight optic cup and disc segmentation method based on multi-scale feature enhancement

GAI Rongli¹, WANG Junkai¹, WANG Zumin¹, DUAN Xiaoming²

1. College of Information Engineering, Dalian University 2. Beijing Key Laboratory of Ophthalmology and Visual Sciences, Beijing Tongren Hospital, Capital Medical University

Received:2025-09-29 Revised:2025-12-03 Online:2026-03-16 Published:2026-03-16
About author:GAI Rongli, born in 1980, Ph. D., professor. Her research interests include artificial Intelligence, intelligent control, smart healthcare. WANG Junkai, born in 2000, M. S. candidate. His research interests include smart healthcare. WANG Zumin, born in 1975, Ph. D., professor. His research interests include smart healthcare Internet of Things technology. DUAN Xiaoming, born in 1974, Ph. D., associate chief physician. Her research interests include glaucoma, cataracts, diagnosis and treatment of common eye surface disorders.
Supported by:
Interdisciplinary Project of Dalian University (DLUXK-2025-FX-006)

基于多尺度特征增强的轻量化视杯视盘分割方法

盖荣丽¹,王俊开¹,汪祖民¹,段晓明²

1.大连大学信息工程学院 2.首都医科大学附属北京同仁医院眼科与视觉科学北京市重点实验室

通讯作者: 段晓明
作者简介:盖荣丽(1980—)，女，辽宁大连人，教授，博士，CCF会员，主要研究方向：人工智能、智能控制、智慧医疗；王俊开(2000—)，男，辽宁朝阳人，硕士研究生，主要研究方向：智慧医疗；汪祖民(1975—)，男，河南信阳人，教授，博士，CCF会员（12911D），主要研究方向：智慧医疗、物联网技术；段晓明(1974—)，女，黑龙江牡丹江人，副主任医师，博士，主要研究方向：青光眼、白内障、常见眼表疾病诊治。
基金资助:
大连大学学科交叉项目(DLUXK-2025-FX-006)

Abstract

Abstract: To address the challenge of accurate joint segmentation of the Optic Cup (OC) and Optic Disc (OD) in early glaucoma diagnosis due to blurred boundaries and varied morphologies, a lightweight multi-scale feature enhancement network (LFM-Net) was proposed. This network employs an encoder-decoder architecture, aiming to improve segmentation accuracy by enhancing multi-scale feature representation and cross-layer feature fusion capabilities. Specifically, in the encoder stage, global contextual features were extracted layer by layer using depthwise separable convolutions and inverted bottleneck structures. A Multi-Scale Feature enhancement Aggregation (MSFA) module was introduced, utilizing its multi-branch convolutions and channel attention mechanisms to adaptively capture and aggregate global contextual and local detail features while maintaining low computational cost, thus addressing the significant differences in optic cup and optic disc sizes. In the decoder stage, a Convolutional Attention Feature Fusion Module (CAFM Fusion) was designed, combining 3D convolutional attention and pixel attention mechanisms to optimize feature transfer in skip connections, effectively suppressing background noise and sharpening edge responses, ultimately achieving efficient fusion of cross-layer features. Experimental results on three publicly available fundus image datasets—REFUGE, DRISHTI-GS, and RIM-ONE-r3—show that LFM-Net outperforms comparable methods such as U-Net, and TransUnet in key metrics including Dice coefficient, Intersection over Union (IoU), and accuracy. While maintaining lightweight design, LFM-Net accurately extracts OC and OD features, achieving high-precision segmentation. Furthermore, it demonstrates strong generalization ability across different datasets, providing effective technical support for computer-aided diagnosis of glaucoma.

Key words: glaucoma diagnosis, deep learning, optic cup and disc segmentation, multi-scale feature fusion, attention mechanism

摘要： 针对青光眼早期诊断中眼底图像视杯（OC）视盘（OD）因边界模糊、形态多变导致精准联合分割困难的挑战，
提出一种轻量化多尺度特征增强网络(LFM-Net)。该网络采用编码器-解码器架构，旨在通过增强多尺度特征表达与跨层特征融合能力以提升分割精度。具体实现上，编码器阶段利用深度可分离卷积和倒置瓶颈结构逐层提取全局上下文特征，并引入多尺度特征增强聚合（MSFA）模块，利用多分支卷积与通道注意力机制，在保持低计算成本的同时自适应捕获并聚合全局上下文与局部细节特征，以应对视杯视盘尺寸差异显著的问题；解码器阶段设计卷积注意力特征融合模块（CAFM Fusion），结合3D卷积注意力与像素注意力机制，优化跳跃连接中的特征传递，有效抑制背景噪声并锐化边缘响应,最终实现跨层特征的高效融合。在REFUGE、DRISHTI-GS和RIM-ONE-r3三个公开眼底图像数据集上的实验结果表明，LFM-Net在Dice系数、交并比（IoU）及准确度等关键指标上均优于U-Net、TransUnet等对比方法，在保证轻量化的同时能够准确提取OC和OD特征，实现高精度分割，并在跨数据集场景下展现出强泛化能力，为青光眼计算机辅助诊断提供了有效技术支持。

关键词: 青光眼诊断, 深度学习, 视杯视盘分割, 多尺度特征融合, 注意力机制

CLC Number:

TP183

GAI Rongli, WANG Junkai, WANG Zumin, DUAN Xiaoming. Lightweight optic cup and disc segmentation method based on multi-scale feature enhancement[J]. Journal of Computer Applications, DOI: 10.11772/j.issn.1001-9081.2025091155.

盖荣丽王俊开汪祖民段晓明. 基于多尺度特征增强的轻量化视杯视盘分割方法[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2025091155.

[1]	Xinyao LIU, Jun LIANG, Jiahao LONG, Renliang YAN. Fine-grained Chinese herbal medicine image classification based on feature fusion and channel information compensation [J]. Journal of Computer Applications, 2026, 46(5): 1677-1683.
[2]	Huijie GUO, Tianfeng DOU, Zhenlin ZHANG, Kaiyuan QI, Dong WU, Zhijian QU, Zhao LI, Chongguang REN. Time-interdependency-aware dynamic Bayesian network for traffic prediction [J]. Journal of Computer Applications, 2026, 46(5): 1507-1517.
[3]	Wen PENG, Bokai ZHANG, Jinwei LIN. Chromosome cascaded classification framework integrating image texture enhancement and super-resolution [J]. Journal of Computer Applications, 2026, 46(5): 1647-1657.
[4]	Qianfei WANG, Yang LI, Deyu LI, Suge WANG. Dual-channel feature fusion representation method for short-text clustering based on large language model [J]. Journal of Computer Applications, 2026, 46(5): 1441-1449.
[5]	Xing SHENG, Sunxian WENG, Kuosong CHEN, Zhongping WANG, Ruifeng REN, Yong LIU. Deep learning-based patent value evaluation for power grid enterprises [J]. Journal of Computer Applications, 2026, 46(5): 1468-1474.
[6]	Jing HU, Shikun CHEN, Fang WANG, Rui ZHANG, Yong WANG. Ore image segmentation with linear deformable convolution and dual-domain synergistic dynamic attention [J]. Journal of Computer Applications, 2026, 46(5): 1692-1702.
[7]	Ying JING, Ran LI, Zhuo JIANG, Ziyang FU, Jingyi DU, Qi LIU, Jihang LIU. SAM Meibomian gland unified dense segmentation method with introduction of automatic prompt encoder [J]. Journal of Computer Applications, 2026, 46(5): 1667-1676.
[8]	Baoyuan ZHENG, Chaobo HE. Graph convolutional network enhanced by graph diffusion and dual-view feature learning [J]. Journal of Computer Applications, 2026, 46(5): 1370-1377.
[9]	Minqi WU, Yuanhua YANG, Hang LI, Yaqin HU, Zhihao TANG, Teng MEI. Lightweight underwater small object detection based on graph Transformer and RT-DETR [J]. Journal of Computer Applications, 2026, 46(5): 1586-1595.
[10]	Ruirui SONG, Leichun WANG, Yunping HE, Jinxiang WEI, Xiangfeng LU, Xiaomeng LIU. Long time series prediction based on hybrid self-attention and differentiated normalization [J]. Journal of Computer Applications, 2026, 46(5): 1499-1506.
[11]	Hongrui ZHANG, Weiming FENG, Luxia YANG, Yongjie MA. CSAF-YOLO： improved YOLO11 algorithm for underwater small object detection [J]. Journal of Computer Applications, 2026, 46(5): 1578-1585.
[12]	Xumeng DOU, Bin XIE, Zhaohui ZHANG, Zhengang ZHAO, Hanyu DUAN, Aolei GUO. Drug-target interaction prediction based on structure-network collaborative features and grid-attention enhanced Kolmogorov-Arnold network [J]. Journal of Computer Applications, 2026, 46(4): 1344-1353.
[13]	Huanxian LIU, Hongtao WANG, Xian’ao WANG, Hongmei WANG, Weifeng XU. Multimodal fact verification with cross-modal semantic association [J]. Journal of Computer Applications, 2026, 46(4): 1069-1076.
[14]	Xinyi YAN, Linglong ZHU, Yonghong ZHANG. CDC-DETR： multi-scale real-time human-vehicle detection method for complex traffic scenarios [J]. Journal of Computer Applications, 2026, 46(4): 1283-1291.
[15]	Chuandong QIN, Zhiqiang SUO. Skin cancer classification integrating improved ResNet50 with ensemble classifier [J]. Journal of Computer Applications, 2026, 46(4): 1354-1362.

Lightweight optic cup and disc segmentation method based on multi-scale feature enhancement

基于多尺度特征增强的轻量化视杯视盘分割方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics