Journal of Computer Applications

    Next Articles

Unsupervised low-light image enhancement method with diffusion priors and detection-oriented bridging

CHENG Jian1,2, XU Bingxin1,2, PAN Weiguo1,2, LIU Hongzhe1,2, DAI Songyin1,2, XU Cheng1,2   

  1. 1.Beijing Key Laboratory of Information Service Engineering (Beijing Union University) 2. College of Robotics, Beijing Union University
  • Received:2026-03-19 Revised:2026-04-24 Online:2026-05-20 Published:2026-05-20
  • About author:CHENG Jian, born in 2001, M.S. candidate. His research interests include computer vision, low-light image enhancement. XU Bingxin, born in 1985, Ph. D., professor. Her research interests include computer vision, image processing. PAN Weiguo, born in 1984, Ph. D., associate professor. His research interests include machine learning, object detection. LIU Hongzhe, born in 1971, Ph. D., professor. Her research interests include visual intelligence, cognitive computing. DAI Songyin, born in 1980, Ph. D., associate professor. Her research interests include computer vision, image processing. XU Cheng, born in 1988, Ph. D., associate professor. His research interests include digital simulation technology, data security.
  • Supported by:
    National Natural Science Foundation of China (62572057); Natural Science Foundation of Beijing (4242020,4232026)

基于扩散先验与检测桥接的无监督低光图像增强方法

程剑1,2,徐冰心1,2,潘卫国1,2,刘宏哲1,2,代松银1,2,徐成1,2   

  1. 1. 北京市信息服务工程重点实验室(北京联合大学) 2. 北京联合大学 机器人学院
  • 通讯作者: 徐冰心
  • 作者简介:程剑(2001—),男,北京人,硕士研究生,主要研究方向:计算机视觉、低光图像增强;徐冰心(1985—),女,吉林省吉林市人,教授,博士,主要研究方向:计算机视觉、图像处理;潘卫国(1984—),男,河北邯郸人,副教授,博士,主要研究方向:机器学习、目标检测;刘宏哲(1971—),女,北京人,教授,博士,CCF高级会员(27562S),主要研究方向:视觉智能、认知计算;代松银(1980—),女,湖北荆州人,副教授,博士,主要研究方向:计算机视觉、图像处理;徐成(1988—),男,重庆人,副教授,博士,CCF会员,主要研究方向:数字仿真技术、数据安全。
  • 基金资助:
    国家自然科学基金资助项目(62572057);北京市自然科学基金资助项目(4242020,4232026)

Abstract: To address the problem that existing low-light image enhancement methods primarily focus on perceptual optimization while struggling to simultaneously support downstream high-level vision tasks such as object detection, an unsupervised two-stage method, termed DBDiff, was proposed by integrating diffusion priors with a detection-bridging network so as to improve both perceptual quality and task utility without requiring paired training data or manual detection annotations. In the first stage, exposure-guided pretrained diffusion sampling was employed to progressively restore a natural illumination distribution, while an Adaptive Homomorphic Filtering (AHF) module was introduced to perform frequency modulation during the reverse denoising process, thereby alleviating the attenuation of high-frequency texture details. In the second stage, an unsupervised bridging network composed of an Adaptive Weighted Fusion (AWF) module and a Task Alignment Module (TAM) was constructed. Specifically, cycle-consistency constraints were imposed to preserve content structural consistency, and mixup augmentation, pseudo-label-based self-training, and downstream detection loss were further combined to align the enhanced results with the input distribution preferred by the detector. Experimental results showed that, on the public ExDark and DarkFace datasets, DBDiff achieved Natural Image Quality Evaluator (NIQE) scores of 3.991 and 3.314, Blind/Referenceless Image Spatial Quality Evaluator (BRISQUE) scores of 21.796 and 12.053, and Perception-based Image Quality Evaluator (PIQE) scores of 36.150 and 21.262, respectively, yielding the best overall performance in no-reference quality evaluation. In downstream object detection experiments, the mAP50 was improved by 1.8 and 1.7 percentage points over the best competing methods, namely FourierDiff on ExDark and CLIP-LIT on DarkFace, respectively. These results demonstrated that the proposed method could effectively support downstream low-light vision tasks such as object detection while achieving natural exposure and faithful detail preservation.

Key words: low-light image enhancement, diffusion model, unsupervised learning, detection-oriented, domain alignment

摘要: 针对现有低光图像增强方法偏重视觉感知优化、难以兼顾目标检测等下游高级视觉任务性能的问题,提出一种将扩散先验与检测桥接网络相结合的无监督两阶段方法DBDiff,在无需配对训练数据和人工检测标注的条件下,提升感知质量和任务可用性。第一阶段,利用曝光引导的预训练扩散采样逐步恢复自然照明分布,并引入自适应同态滤波(AHF)模块,在反向去噪过程中进行频率调制,以缓解高频纹理细节衰减;第二阶段,构建由自适应加权融合(AWF)模块与任务对齐模块(TAM)组成的无监督桥接网络,通过循环一致性约束保持内容结构一致性,并结合mixup增强、伪标签自训练与下游检测损失,将增强结果对齐至检测器偏好的输入分布。实验结果表明,DBDiff在公开数据集ExDark和DarkFace上的NIQE(Natural Image Quality Evaluator)分别达到3.991和3.314,BRISQUE(Blind/Referenceless Image Spatial Quality Evaluator)分别达到21.796和12.053,PIQE(Perception-based Image Quality Evaluator)分别达到36.150和21.262,无参考评价指标综合表现最优。在下游目标检测实验中,平均精度均值mAP50较ExDark上的最优对比方法FourierDiff和DarkFace上的最优对比方法CLIP-LIT分别提高1.8和1.7个百分点,表明所提方法在实现自然曝光和真实细节的同时,能够有效支撑目标检测等下游低光视觉任务。

关键词: 低光图像增强, 扩散模型, 无监督学习, 检测导向, 域对齐

CLC Number: