基于自注意力网络的深度图匹配模型

• •

基于自注意力网络的深度图匹配模型

徐周波,陈浦青,刘华东,杨欣

桂林电子科技大学

收稿日期:2022-03-21 修回日期:2022-05-31 发布日期:2022-06-29
通讯作者: 陈浦青
基金资助:
国家自然科学基金资助项目;广西自然科学基金资助项目

A deep graph matching model via self-attention network

Received:2022-03-21 Revised:2022-05-31 Online:2022-06-29

摘要/Abstract

摘要： 摘要: 现有深度图匹配模型在结点特征提取阶段常利用图卷积网络来学习结点的特征向量。然而，图卷积网络对结点特征的学习能力有限，影响了结点特征的可区分性，造成结点的相似性度量不佳，从而导致模型的匹配精度受损。为解决这一问题，本文设计了一种基于自注意力网络的深度图匹配模型(GSAN-GM)，该模型在结点特征提取阶段用新设计的自注意力网络(GSAN)来完成对结点特征的学习，其原理是通过空间编码器和自注意力机制来学习结点的空间结构和所有结点之间的联系，以改善结点的特征描述。此外，为了减少对图匹配问题放松所带来的精度损失，文中将图匹配问题建模为整数线性规划问题，在结点匹配的基础上增加了结构匹配约束，并引入高效的组合优化求解器来计算图匹配问题的局部最优解。本文在Willow Object和Pascal VOC数据集上与多个现有方法对比模型的匹配精度。实验结果表明，GSAN-GM模型具有较好的性能体现，并在多种图像的匹配任务上达到了目前最佳的效果。

关键词: 深度图匹配, 图匹配问题, 计算机视觉, 组合优化, 深度学习

Abstract: Abstract: Deep graph matching models always use graph convolution network to learn the feature of nodes in the stage of node feature extraction. However, graph convolution network has limited learning ability for node features, so it affects the distinguishability of node features which makes a bad result of similarity measure and leads to loss accuracy. To solve this problem, this paper designed a deep graph matching model via self-attention network (GSAN-GM), which used self-attention network (GSAN) based on spatial encoder to learn the features of nodes in the model phase of nodes feature extraction. The principle of GSAN was through a spatial encoder and a self-attention module to learn spatial features of nodes and the relationship between all nodes to enhance the node features. In order to reduce the accuracy loss caused by relaxing graph matching problem, in this paper, the problem was modelled as an integer linear programming problem that considers constraints of structure matching and utilized an efficient combinatorial optimization solver to compute the local optimal solution of graph matching problem. The accuracy of our model was compared with the state-of-the-art methods on Willow Object and Pascal VOC datasets. Experimental results show that GSAN-GM has better performance and achieves the best accuracy on matching tasks of multiple categories of images.

Key words: Keywords: deep graph matching, graph matching problem, computer vision, combinatorial optimization, deep learning

中图分类号:

TP391

徐周波陈浦青刘华东杨欣. 基于自注意力网络的深度图匹配模型[J]. 计算机应用.

[1]	杨先凤, 汤依磊, 李自强. 基于交替注意力机制和图卷积网络的方面级情感分析模型[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1058-1064.
[2]	王铂越, 李英祥, 钟剑丹. 基于改进Res-UNet的昼夜地基云图分割网络[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1310-1316.
[3]	万泽轩, 谢春丽, 吕泉润, 梁瑶. 基于依赖增强的分层抽象语法树的代码克隆检测[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1259-1268.
[4]	唐睿, 岳士博, 张睿智, 刘川, 庞川林. UAV协助下非正交多址接入使能的数据采集系统中能效优化机制[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1209-1218.
[5]	孙祥杰, 魏强, 王奕森, 杜江. 代码相似性检测技术综述[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1248-1258.
[6]	张鹏飞, 韩李涛, 冯恒健, 李洪梅. 基于注意力机制和全局特征优化的点云语义分割[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1086-1092.
[7]	董炜娜, 刘佳, 潘晓中, 陈立峰, 孙文权. 基于编码-解码网络的大容量鲁棒图像隐写方案[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 772-779.
[8]	赵奎, 仇慧琪, 李旭, 徐知非. 结合注意力和多路径融合的实时肺结节检测算法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 945-952.
[9]	李雨秋, 侯利萍, 薛健, 吕科, 王泳. 基于内容解译的遥感图像推荐方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 722-731.
[10]	蔡美玉, 朱润哲, 吴飞, 张开昱, 李家乐. 基于注意力机制和多粒度特征融合的跨视角匹配模型[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 901-908.
[11]	唐瑶瑶, 朱叶晨, 刘仰川, 高欣. CT图像环形伪影去除方法研究现状及展望[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 890-900.
[12]	徐大鹏, 侯新民. 基于网络结构设计的图神经网络特征选择方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 663-670.
[13]	张家伟, 高冠东, 肖珂, 宋胜尊. 基于改进分层注意网络和TextCNN联合建模的暴力犯罪分级算法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 403-410.
[14]	刘祥, 华蓓, 林飞, 魏宏原. 面向深度学习应用的组件式开发框架的设计实现[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 526-535.
[15]	宋钰丹, 王晶, 王雪徽, 马朝阳, 林友芳. 基于自适应多任务学习的睡眠生理时序分类方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 654-662.