《计算机应用》唯一官方网站

• •    下一篇

基于自监督图像对的弱监督语义分割算法

  

  1. 1.中国科学院 成都计算机应用研究所

    2.中国科学院大学 计算机科学与技术学院

    3.哈尔滨工业大学(深圳) 国际人工智能研究院

    4.哈尔滨工业大学 重庆研究院

  • 收稿日期:2022-03-14 修回日期:2022-06-28 接受日期:2022-06-30 发布日期:2022-09-02 出版日期:2022-09-02
  • 通讯作者: 陈 斌
  • 作者简介:侯孝振(1995—),男,山东临沂人,硕士研究生,主要研究方向:弱监督的语义分割、视频内容理解;陈斌(1970—),男,四川广汉人,研究员,博士,CCF会员,主要研究方向:工业检测、深度学习。

Weakly supervised semantic segmentation algorithm based on self-supervised image pairs

#br#   

  1. 1.Chengdu Institute of Computer Applications, Chinese Academy of Sciences

    2.University of Chinese Academy of Sciences

    3.International Institute for Artificial Intelligence, Harbin Institute of Technology(Shenzhen)

    4.Chongqing Research Institute, Harbin Institute of Technology

  • Received:2022-03-14 Revised:2022-06-28 Accepted:2022-06-30 Online:2022-09-02 Published:2022-09-02

摘要: 为了减少人们在语义分割任务中的标注成本,提出了一种新的基于自监督图像对的弱监督语义分割算法Co-Net。首先,将一对图像分别输入到骨干网络中提取图片对特征;然后,将特征展开加入位置信息送入编码层中进行编码;接着,将编码特征送入到协同注意力模块(CoAM)以及自注意力模块(BiAM)中进行信息相互表征;最终,将图像区域掩码(MRM)以及图像对匹配(IPM)两种自监督任务用于网络训练,学习图像对中的全局关联以及局部关联,以此得到更加精确初始化种子。仅使用图像级标签进行弱监督语义分割,在Pascal VOC 2012验证和测试集上分别实现了69.8%和70.3%的mIoU,相较于同样为图像对输入的算法GroupWSSS(Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation),验证集和测试集上分别提高了1.6个百分点和1.8个百分点的mIoU。实验结果表明,所提算法可以获得更加完整的目标激活区域。

关键词: 语义分割, 弱监督学习, 自监督学习, 弱监督的语义分割, 深度学习

Abstract: In order to reduce people’s annotation cost in semantic segmentation tasks,a new weakly supervised semantic segmentation algorithm Co-Net based on self-supervised image pairs was proposed. Fistly,a pair of images were respectively input into  backbone network to extract image pair features. Secondly, expanded features were added to location information and sent to the encoding layer,and then the encoded features were fed into Collaborative Attention Module (CoAM) and Self-Attention Module(BiAM) for information mutual representation. Finally, two self-supervised tasks, image Region Masking (MRM) and Image Pair Matching (IPM) were used for network training to learn global and local associations in image pairs, so as to obtain more accurate initialization seeds. Weakly supervised semantic segmentation using only image-level labels achieves mIoU of 69.8% and 70.3% on Pascal VOC 2012 validation and test sets. respectively. Compared with the algorithm Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation(GroupWSSS), which is also input for image pairs, the mIoU is increased by 1.6 and 1.8 percentage points on the validation set and test set. The experimental results show that the proposed algorithm can obtain a more complete target activation area.

Key words: semantic segmentation, weakly supervised learning, self-supervised learning, weakly supervised semantic segmentation, deep learning

中图分类号: