Journal of Computer Applications ›› 2022, Vol. 42 ›› Issue (10): 3184-3190.DOI: 10.11772/j.issn.1001-9081.2021081475

• Multimedia computing and computer simulation • Previous Articles    

Hardware reconstruction acceleration method of convolutional neural network-based single image defogging model

Guanjun WANG1, Chunlian JIAN2, Qiang XIANG1   

  1. 1.College of Electronic and Information,Southwest Minzu University,Chengdu Sichuan 610041,China
    2.Chengdu Powerview Science and Technology Company Limited,Chengdu Sichuan 610046,China
  • Received:2021-08-18 Revised:2021-11-22 Accepted:2021-11-24 Online:2022-01-07 Published:2022-10-10
  • Contact: Qiang XIANG
  • About author:WANG Guanjun, born in 1990, M. S. candidate. His research interests include image processing, computer vision, embedded system.
    JIAN Chunlian, born in 1988, engineer. Her research interests include image processing, computer vision.
    XIANG Qiang, born in 1973, Ph. D. , associate professor. His research interests include signal and information processing, embedded system.
  • Supported by:
    Key Research and Development of Science and Technology Department of Sichuan Province(22ZDYF3125);Innovative Scientific Research Project for Graduates of Southwest Minzu University(CX2021SZ46)

基于卷积神经网络的单图像去雾模型硬件重构加速方法

王官军1, 简春莲2, 向强1   

  1. 1.西南民族大学 电子信息学院,成都 610041
    2.成都动力视讯科技股份有限公司,成都 610046
  • 通讯作者: 向强
  • 作者简介:第一联系人:王官军(1990—),男,四川内江人,硕士研究生,主要研究方向:图像处理、计算机视觉、嵌入式系统
    简春莲(1988—),女,四川达州人,工程师,主要研究方向:图像处理、计算机视觉
    向强(1973—),男,四川营山人,副教授,博士,主要研究方向:信号与信息处理、嵌入式系统。xiangqiang@swun.edu.cn
  • 基金资助:
    四川省科技厅重点研发项目(22ZDYF3125);西南民族大学研究生创新型科研项目(CX2021SZ46)

Abstract:

Single image defogging model based on Convolutional Neural Network (CNN) was difficult to deploy on mobile/embedded system and used for real-time video defogging. To solve this problem, a method of hardware reconstruction and acceleration was proposed, based on Zynq System-on-Chip (SoC). First, a quantization-dequantization algorithm was proposed to perform quantization on two representative defogging models; second, a quantized defogging model was reconstructed and a hardware IP core with Advanced eXtensible Interface 4 (AXI4) was generated, based on video stream memory architecture, hardware/software co-design, pipeline technology and High-Level Synthesis (HLS) tool. Experimental results show that the model parameters can be quantified from float32 to int5(5 bit) under premise of defogging performance, saving about 84.4% of storage space; the highest pixel clock frequency of the generated hardware IP core is 182 Mpixel/s, which can achieve 1080P@60 frame/s video defogging; the hardware IP core processes a single hazy image with the resolution of 640 pixel × 480 pixel only in 2.4 ms, and the on-chip power consumption is only 2.25 W. This hardware IP core with AXI4 is also convenient for cross-platform migration and deployment, which can expand application scope of CNN-based single image defogging model.

Key words: defogging, Video Direct Memory Access (VDMA), model quantization, model reconstruction, hardware IP core, High-Level Synthesis (HLS)

摘要:

针对基于卷积神经网络(CNN)的单图像去雾模型在移动/嵌入式端部署难,不易用做实时视频去雾的问题,提出一种基于Zynq片上系统(SoC)的去雾模型硬件重构加速方法。首先,提出量化?反量化算法,对两个代表去雾模型进行量化;其次,基于视频流存储器架构和软硬件协同、流水线等技术以及高级综合(HLS)工具,对量化后的去雾模型硬件重构并生成具有高性能扩展总线接口(AXI4)的硬件IP核。实验结果表明,在保证去雾效果的前提下,可以实现模型参数从float32到int5(5 bit)的量化,从而节省约84.4%的存储空间;所生成硬件IP核的最高像素时钟频率为182 Mpixel/s,能够实现1080P@60 frame/s的视频去雾;单帧640×480的雾图去雾仅需2.4 ms,而片上功耗仅为2.25 W。这种生成带有标准总线接口的硬件IP核也便于跨平台移植和部署,从而可以扩大这类去雾模型的应用范围。

关键词: 去雾, 视频直接存储器访问, 模型量化, 模型重构, 硬件IP核, 高级综合

CLC Number: