《计算机应用》唯一官方网站 ›› 2023, Vol. 43 ›› Issue (12): 3933-3940.DOI: 10.11772/j.issn.1001-9081.2022111687

• 多媒体计算与计算机仿真 • 上一篇    下一篇

自监督学习HOG预测辅助任务下的车位检测方法

刘磊1, 伍鹏1(), 谢凯1,2, 程贝芝1, 盛冠群3   

  1. 1.长江大学 电子信息学院, 湖北 荆州 434023
    2.长江大学 西部研究院, 新疆 克拉玛依 834000
    3.三峡大学 计算机与信息学院, 湖北 宜昌 443002
  • 收稿日期:2022-11-10 修回日期:2023-05-23 接受日期:2023-05-29 发布日期:2023-07-26 出版日期:2023-12-10
  • 通讯作者: 伍鹏
  • 作者简介:刘磊(2002—),男,山东青岛人,主要研究方向:图像处理、人工智能
    谢凯(1974—),男,湖北荆州人,教授,博士,主要研究方向:信号与信息处理、图像处理、人工智能
    程贝芝(2002—),女,湖北黄冈人,主要研究方向:图像处理、人工智能
    盛冠群(1987—),男,山东东营人,副教授,博士,主要研究方向:人工智能、信号与信息处理。
  • 基金资助:
    国家自然科学基金资助项目(42204111)

Parking space detection method based on self-supervised learning HOG prediction auxiliary task

Lei LIU1, Peng WU1(), Kai XIE1,2, Beizhi CHENG1, Guanqun SHENG3   

  1. 1.School of Electronic Information,Yangtze University,Jingzhou Hubei 434023,China
    2.Western Research Institute,Yangtze University,Karamay Xinjiang 834000,China
    3.College of Computer and Information Technology,China Three Gorges University,Yichang Hubei 443002,China
  • Received:2022-11-10 Revised:2023-05-23 Accepted:2023-05-29 Online:2023-07-26 Published:2023-12-10
  • Contact: Peng WU
  • About author:LIU Lei, born in 2002. His research interests include image processing,artificial intelligence.
    XIE Kai, born in 1974, Ph. D., professor. His research interests include signal and information processing, image processing, artificial intelligence.
    CHENG Beizhi, born in 2002. Her research interests include image processing, artificial intelligence.
    SHENG Guanqun, born in 1987, Ph. D., associate professor. His research interests include artificial intelligence, signal and information processing.
  • Supported by:
    National Natural Science Foundation of China(42204111)

摘要:

针对智能车位管理系统中,光照变化、车位遮挡等因素导致车位预测的精度下降、有效性变差的问题,提出一种自监督学习方向梯度直方图(HOG)预测辅助任务下的车位检测方法。首先,设计预测图像遮挡部分HOG特征的自监督学习辅助任务,利用MobileViTBlock(light-weight, general-purpose, and Mobile-friendly Vision Transformer Block)综合图像全局信息,使模型更充分地学习图像的视觉表征,并提高模型的特征提取能力;其次,改进SE(Squeeze-and-Excitation)注意力机制,使模型在更低的计算开销上达到甚至高于原始SE注意力机制的效果;最后,将辅助任务训练的特征提取部分应用于下游的分类任务进行车位状态预测,在PKLot和CNRPark的混合数据集上进行实验。实验结果表明,所提模型在测试集上的准确率达到了97.49%,相较于RepVGG,遮挡预测准确率提高了5.46个百分点,与其他的车位检测算法相比进步较大。

关键词: 智能停车系统, 自监督学习, 方向梯度直方图, 辅助任务, 车位状态预测

Abstract:

In the intelligent parking space management system, a decrease in accuracy and effectiveness of parking space prediction can be caused by factors such as illumination changes and parking space occlusion. To overcome this problem, a parking space detection method based on self-supervised learning HOG (Histogram of Oriented Gradient) prediction auxiliary task was proposed. Firstly, a self-supervised learning auxiliary task to predict the HOG feature in occluded part of image was designed, the visual representation of the image was learned more fully and the feature extraction ability of the model was improved by using the MobileViTBlock (light-weight, general-purpose, and Mobile-friendly Vision Transformer Block) to synthesize the global information of the image. Then, an improvement was made to the SE (Squeeze-and-Excitation) attention mechanism, thereby enabling the model to achieve or even exceed the effect of the original SE attention mechanism at a lower computational cost. Finally, the feature extraction part trained by the auxiliary task was applied to the downstream classification task for parking space status prediction. Experiments were carried out on the mixed dataset of PKLot and CNRPark. The experimental results show that the proposed model has the accuracy reached 97.49% on the test set; compared to RepVGG, the accuracy of occlusion prediction improves by 5.46 percentage points, which represents a great improvement compared with other parking space detection algorithms.

Key words: intelligent parking system, self-supervised learning, Histogram of Oriented Gradient (HOG), auxiliary task, parking space status prediction

中图分类号: