Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (5): 1667-1676.DOI: 10.11772/j.issn.1001-9081.2025050613

• Frontier and comprehensive applications • Previous Articles    

SAM Meibomian gland unified dense segmentation method with introduction of automatic prompt encoder

Ying JING, Ran LI(), Zhuo JIANG, Ziyang FU, Jingyi DU, Qi LIU, Jihang LIU   

  1. School of Information Engineering,Dalian Ocean University,Dalian Liaoning 116023,China
  • Received:2025-06-13 Revised:2025-09-16 Accepted:2025-09-25 Online:2025-10-17 Published:2026-05-10
  • Contact: Ran LI
  • About author:JING Ying, born in 2001, M. S. candidate. Her research interests include deep learning, image processing.
    JIANG Zhuo, born in 2000, M.S. candidate. Her research interests include textual semantic comprehension enhancement, structured knowledge fusion, interpretable reasoning.
    FU Ziyang, born in 2002, M. S. candidate. Her research interests include deep learning, image hierarchical processing.
    DU Jingyi, born in 1999, M. S. candidate. Her research interests include artificial intelligence, fault diagnosis.
    LIU Qi, born in 1999, M. S. candidate. His research interests include artificial intelligence, fault diagnosis.
    LIU Jihang, born in 2000, M. S. candidate. His research interests include fish abnormal behavior recognition and segmentation based on computer vision.
  • Supported by:
    Key Topics of Major Scientific Challenges and Pharmaceutical Technology Issues in 2022 by Chinese Medicine Education Association(2022KTM036)

引入自动提示编码器的SAM睑板腺统一密集分割方法

荆莹, 李然(), 蒋卓, 付子扬, 杜晶颐, 刘琪, 刘吉航   

  1. 大连海洋大学 信息工程学院,辽宁 大连 116023
  • 通讯作者: 李然
  • 作者简介:荆莹(2001—),女,山东临清人,硕士研究生,主要研究方向:深度学习、图像处理
    蒋卓(2000—),女,辽宁沈阳人,硕士研究生,主要研究方向:文本语义理解增强、结构化知识融合、可解释推理
    付子扬(2002—),女,辽宁沈阳人,硕士研究生,主要研究方向:深度学习、图像分级处理
    杜晶颐(1999—),女,辽宁凌源人,硕士研究生,主要研究方向:人工智能、故障诊断
    刘琪(1999—),男,江苏扬州人,硕士研究生,主要研究方向:人工智能、故障诊断
    刘吉航(2000—),男,辽宁庄河人,硕士研究生,主要研究方向:基于计算机视觉的鱼类异常行为识别与分割。
  • 基金资助:
    中国医药教育协会2022重大科学攻关问题和医药技术难题重点课题(2022KTM036)

Abstract:

The traditional Segment Anything Model (SAM) relies on manual prompts during segmentation of Meibomian gland images, making it difficult to handle issues such as dense glands, irregular shapes, and blurred boundaries. To address this, an improved model, namely ResSAM, was proposed. ResSAM eliminated the reliance on manual intervention by introducing an automatic prompt encoder. The backbone network was pruned and optimized to further enhance the model's segmentation efficiency. Focal Loss and Smooth IoU Loss were used for training optimization, and the SE (Squeeze-and-Excitation) and cross-attention mechanisms were integrated to reduce the impact of individual differences and blurred boundaries, thereby improving the model's segmentation accuracy. Experimental results on two self-built datasets, Lower Lid and Upper Lid, showed that ResSAM achieved the best performance in terms of the number of parameters and Giga FLoating-point OPerations (GFLOPs); its segmentation results obtained the highest Dice scores (88.69% and 87.75%, respectively) and the highest Intersection-over-Union (IoU) values (79.69% and 78.58%, respectively). The research results indicate that the ResSAM optimizes both efficiency and accuracy, supporting early prevention and clinical diagnosis of Meibomian Gland Dysfunction (MGD).

Key words: Segment Anything Model (SAM), image segmentation, automatic prompt encoder, backbone network pruning, Squeeze-and-Excitation (SE), cross-attention mechanism, Meibomian gland

摘要:

针对传统SAM(Segment Anything Model)在睑板腺图像分割中依赖人工提示,难以应对腺体密集、形态不规则及边界模糊的问题,提出改进模型ResSAM。该模型引入自动提示编码器消除人工干预的依赖;针对骨干网络进行剪枝优化,进一步提升模型分割效率;采用Focal Loss和Smooth IoU Loss优化训练,并融合SE(Squeeze-and-Excitation)与交叉注意力机制降低个体差异和边界模糊的影响,提升模型分割精度。在2个自建数据集Lower Lid和Upper Lid上的实验结果显示,ResSAM的参数量和十亿次浮点运算次数(GFLOPs)指标表现最优;分割结果具有最高Dice值,分别为88.69%和87.75%,以及最高的交并比(IoU)值,分别为79.69%和78.58%。研究结果表明,ResSAM在效率与精度方面均实现了优化,可为睑板腺功能障碍(MGD)的早期预防和临床诊断提供支持。

关键词: SAM, 图像分割, 自动提示编码器, 骨干网络剪枝, SE, 交叉注意力机制, 睑板腺

CLC Number: