• •    

基于自注意力网络的深度图匹配模型

徐周波,陈浦青,刘华东,杨欣   

  1. 桂林电子科技大学
  • 收稿日期:2022-03-21 修回日期:2022-05-31 发布日期:2022-06-29
  • 通讯作者: 陈浦青
  • 基金资助:
    国家自然科学基金资助项目;广西自然科学基金资助项目

A deep graph matching model via self-attention network

  • Received:2022-03-21 Revised:2022-05-31 Online:2022-06-29

摘要: 摘 要: 现有深度图匹配模型在结点特征提取阶段常利用图卷积网络来学习结点的特征向量。然而,图卷积网络对结点特征的学习能力有限,影响了结点特征的可区分性,造成结点的相似性度量不佳,从而导致模型的匹配精度受损。为解决这一问题,本文设计了一种基于自注意力网络的深度图匹配模型(GSAN-GM),该模型在结点特征提取阶段用新设计的自注意力网络(GSAN)来完成对结点特征的学习,其原理是通过空间编码器和自注意力机制来学习结点的空间结构和所有结点之间的联系,以改善结点的特征描述。此外,为了减少对图匹配问题放松所带来的精度损失,文中将图匹配问题建模为整数线性规划问题,在结点匹配的基础上增加了结构匹配约束,并引入高效的组合优化求解器来计算图匹配问题的局部最优解。本文在Willow Object和Pascal VOC数据集上与多个现有方法对比模型的匹配精度。实验结果表明,GSAN-GM模型具有较好的性能体现,并在多种图像的匹配任务上达到了目前最佳的效果。

关键词: 深度图匹配, 图匹配问题, 计算机视觉, 组合优化, 深度学习

Abstract: Abstract: Deep graph matching models always use graph convolution network to learn the feature of nodes in the stage of node feature extraction. However, graph convolution network has limited learning ability for node features, so it affects the distinguishability of node features which makes a bad result of similarity measure and leads to loss accuracy. To solve this problem, this paper designed a deep graph matching model via self-attention network (GSAN-GM), which used self-attention network (GSAN) based on spatial encoder to learn the features of nodes in the model phase of nodes feature extraction. The principle of GSAN was through a spatial encoder and a self-attention module to learn spatial features of nodes and the relationship between all nodes to enhance the node features. In order to reduce the accuracy loss caused by relaxing graph matching problem, in this paper, the problem was modelled as an integer linear programming problem that considers constraints of structure matching and utilized an efficient combinatorial optimization solver to compute the local optimal solution of graph matching problem. The accuracy of our model was compared with the state-of-the-art methods on Willow Object and Pascal VOC datasets. Experimental results show that GSAN-GM has better performance and achieves the best accuracy on matching tasks of multiple categories of images.

Key words: Keywords: deep graph matching, graph matching problem, computer vision, combinatorial optimization, deep learning

中图分类号: