基于多特征融合的自监督图像配准算法

doi:10.11772/j.issn.1001-9081.2023050692

《计算机应用》唯一官方网站 ›› 2024, Vol. 44 ›› Issue (5): 1597-1604.DOI: 10.11772/j.issn.1001-9081.2023050692

• 多媒体计算与计算机仿真 • 上一篇

基于多特征融合的自监督图像配准算法

韩贵金¹, 张馨渊¹(), 张文涛², 黄娅¹

^1.西安邮电大学自动化学院，西安 710121
^2.中国建筑第八工程局有限公司西南分公司，成都 610041

收稿日期:2023-06-01 修回日期:2023-08-18 接受日期:2023-08-21 发布日期:2023-08-28 出版日期:2024-05-10
通讯作者: 张馨渊
作者简介:韩贵金（1978—），男，河南濮阳人，副教授，博士，CCF会员，主要研究方向：图像处理、计算机视觉
张文涛（1984—），男，四川成都人，高级工程师，主要研究方向：深度学习、图像处理
黄娅（1998—），女，河南濮阳人，硕士研究生，主要研究方向：计算机视觉、图像分割。
第一联系人：张馨渊（1996—），男，河北沧州人，硕士研究生，主要研究方向：图像配准、图像融合
基金资助:
陕西省科技厅重点研发计划项目(2023?YBGY?032)

Self-supervised image registration algorithm based on multi-feature fusion

Guijin HAN¹, Xinyuan ZHANG¹(), Wentao ZHANG², Ya HUANG¹

^1.School of Automation，Xi’an University of Posts & Telecommunications，Xi’an Shaanxi 710121，China
^2.Southwest Branch of China Construction Eighth Engineering Bureau Company Limited，Chengdu Sichuan 610041，China

Received:2023-06-01 Revised:2023-08-18 Accepted:2023-08-21 Online:2023-08-28 Published:2024-05-10
Contact: Xinyuan ZHANG
About author:HAN Guijin， born in 1978， Ph. D.， associate professor. His research interests include image processing， computer vision.
ZHANG Wentao， born in 1984， senior engineer. His research interests include deep learning， image processing.
HUANG Ya， born in 1998， M. S. candidate. Her research interests include computer vision， image segmentation.
Supported by:
Key Research and Development Plan of Shaanxi Provincial Science and Technology Department(2023-YBGY-032)

摘要/Abstract

摘要：

为保证提取特征的信息量丰富，当前基于深度学习的图像配准算法通常采用深层卷积神经网络，模型的计算复杂度高，而且还存在相似特征点区分度低的问题。针对上述问题，提出一种基于多特征融合的自监督图像配准算法（SIRA-MFF）。首先，使用浅层卷积神经网络提取图像特征，降低计算复杂度，并且通过在特征提取层添加特征点方向描述符，弥补浅层网络特征信息量单一的问题；其次，在特征提取层后添加用于扩大特征点感受野的嵌入与交互层，融合特征点局部和全局信息以提升相似特征点区分度；最终，最佳匹配方案由改进的特征匹配层计算得到，并同步设计了一种基于交叉熵的损失函数用于模型训练。在ILSVRC2012数据集生成的2个测试集中，SIRA-MFF的平均匹配准确率（AMA）分别为95.18%和93.26%，优于对比算法；在IMC-PT-SparseGM-50测试集中，SIRA-MFF的AMA为89.69%，也优于对比算法，且与ResMtch算法相比，单张图像运算时间降低了49.45%。实验结果表明，SIRA-MFF具有较高精度和较强的鲁棒性。

关键词: 图像配准, 自监督学习, 特征融合, 特征描述符, 特征嵌入

Abstract:

To ensure that extracted features contain rich information， current deep learning-based image registration algorithms usually employ deep convolutional neural networks， which have high computational complexity and low discrimination of similar feature points. To address the above issues， a Self-supervised Image Registration Algorithm based on Multi-Feature Fusion （SIRA-MFF） was proposed. First， shallow convolutional neural networks were used to extract image features and reduce the computational complexity. Moreover， the problem of single feature information in shallow networks was remedied by adding feature point direction descriptors to the feature extraction layer. Second， an embedding and interaction layer was added after the feature extraction layer to enlarge the receptive field of feature points， by which local and global information of feature points was fused to improve the discrimination of similar feature points. Finally， the feature matching layer was optimized to obtain the best matching scheme. A cross-entropy based loss function was also designed for model training. The SIRA-MFF achieved the Average Matching Accuracy （AMA） of 95.18% and 93.26% on the two test sets generated from the ILSVRC2012 dataset， which was better than comparison algorithms. In the IMC-PT-SparseGM-50 test set， the SIRA-MFF achieved the AMA of 89.69%， which was also better than comparison algorithms； and compared to ResMtch algorithm， SIRA-MFF decreased the operation time of a single image by 49.45%. Experimental results show that SIRA-MFF has higher accurate and stronger robust.

Key words: image registration, self-supervised learning, feature fusion, feature descriptor, feature embedding

中图分类号:

TP391.41

韩贵金, 张馨渊, 张文涛, 黄娅. 基于多特征融合的自监督图像配准算法[J]. 计算机应用, 2024, 44(5): 1597-1604.

Guijin HAN, Xinyuan ZHANG, Wentao ZHANG, Ya HUANG. Self-supervised image registration algorithm based on multi-feature fusion[J]. Journal of Computer Applications, 2024, 44(5): 1597-1604.

图/表 10

参考文献 27

1	LOWE G D. Distinctive image features from scale-invariant key-points［J］. International Journal of Computer Vision， 2004， 60（2）： 91-110. 10.1023/b:visi.0000029664.99615.94
2	MUR-ARTAL R， MOTEIL J M M， TARDÓS J D. ORB-SLAM： a versatile and accurate monocular SLAM system［J］. IEEE Transactions on Robotics， 2015， 31（5）：1147-1163. 10.1109/tro.2015.2463671
3	FISCHLER M A， BOLLES R C. Random sample consensus： a paradigm for model fitting with applications to image analysis and automated cartography［M］// Readings in Computer Vision： Issues， Problems， Principles， and Paradigms. San Francisco： Morgan Kaufmann Publishers Inc.， 1987： 726-740. 10.1016/b978-0-08-051581-6.50070-2
4	RAGURAM R， J-M FRAHN， POLLEFEYS M. A comparative analysis of RANSAC techniques leading to adaptive real-time random sample consensus［C］// Proceedings of the 10th European Conference on Computer Vision. Berlin： Springer， 2008： 500-513. 10.1007/978-3-540-88688-4_37
5	GOLD S， RANGARAJAN A. A graduated assignment algorithm for graph matching［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 1996， 18（4）： 377-388. 10.1109/34.491619
6	TIAN Y， YAN J， ZHANG H， et al. On the convergence of graph matching： graduated assignment revisited［C］// Proceedings of the 12th European Conference on Computer Vision. Berlin： Springer， 2012： 821-835. 10.1007/978-3-642-33712-3_59
7	贾雯晓，张贵仓，汪亮亮，等.基于SIFT和改进的RANSAC图像配准算法［J］.计算机工程与应用，2018，54（2）：203-207. 10.3778/j.issn.1002-8331.1707-0264
	JIA W X， ZHANG G C， WANG L L， et al. Image registration algorithm based on SIFT and improved RANSAC［J］. Computer Engineering and Applications， 2018， 54（2）： 203-207. 10.3778/j.issn.1002-8331.1707-0264
8	LEORDEANU M， HEBERT M. A spectral technique for correspondence problems using pairwise constraints［C］// Proceedings of the Tenth IEEE International Conference on Computer Vision. Piscataway： IEEE， 2005： 1482-1489. 10.1109/iccv.2005.20
9	樊玮，王慧敏，邢艳.基于自编码器的多视图属性网络表示学习模型［J］.计算机应用，2021，41（4）：1064-1070. 10.11772/j.issn.1001-9081.2020061006
	FAN W， WANG H M， XING Y. Auto-encoder based multi-view attributed network representation learning model［J］. Journal of Computer Applications， 2021， 41（4）： 1064-1070. 10.11772/j.issn.1001-9081.2020061006
10	ZANFIR A， SMINCHISESCU C. Deep learning of graph matching［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2018： 2684-2693. 10.1109/cvpr.2018.00284
11	SIMONYAN K， ZISSERMAN A. Very deep convolutional networks for large-scale image recognition ［EB/OL］. ［2023-05-01］. .
12	TYSZKIEWICZ M J， FUA P， TRULLS E. DISK： learning local features with policy gradient ［EB/OL］. （2022-06-24）［2023-05-01］. . 10.1109/iccv51070.2023.00203
13	LEE J， JEONG Y， KIM S， et al. Learning to distill convolutional features into compact local descriptors［C］// Proceedings of the 2021 IEEE Winter Conference on Applications of Computer Vision. Piscataway： IEEE， 2021： 897-907. 10.1109/wacv48630.2021.00094
14	徐少康，张战成，姚浩男，等.基于姿态编码器的2D/3D脊椎医学图像实时配准方法［J］.计算机应用，2023，43（2）：589-594. 10.11772/j.issn.1001-9081.2021122147
	XU S K， ZHANG Z C， YAO H N， et al. 2D/3D spine medical image real-time registration method based on pose encoder［J］. Journal of Computer Applications， 2023， 43（2）：589-594. 10.11772/j.issn.1001-9081.2021122147
15	WANG R， YAN J， YANG X. Combinatorial learning of robust deep graph matching： an embedding based approach［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2023， 45（6）：6984-7000. 10.1109/tpami.2020.3005590
16	SHEN X， WANG C， LI X， et al. RF-Net： an end-to-end image matching network based on receptive field［C］// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2019： 8124-8132. 10.1109/cvpr.2019.00832
17	WANG F-D， XUE N， ZHANG Y， et al. A functional representation for graph matching［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2020， 42（11）： 2737-2754.
18	VINIAVSKYI O， DOBKO M， MISHKIN D， et al. OpenGlue： open-source graph neural net based pipeline for image matching ［EB/OL］. ［2023-05-01］. .
19	LIU H， WANG T， LI Y， et al. Deep probabilistic graph matching ［EB/OL］. （2022-01-05）［2023-05-01］. . 10.1158/0008-5472.sabcs12-p4-05-01
20	JIANG W， TRULLS E， HOSANG J， et al. COTR： correspondence transformer for matching across images［C］// Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision. Piscataway： IEEE， 2021： 6187-6197. 10.1109/iccv48922.2021.00615
21	LIN Y， YANG M， JUN Y， et. al. Graph matching with bi-level noisy correspondence ［EB/OL］. （2022-12-08）［2023-05-01］. . 10.1109/iccv51070.2023.02135
22	DENG Y， MA J. ResMatch： residual attention learning for local feature matching ［EB/OL］. ［2023-05-01］. . 10.1609/aaai.v38i2.27915
23	HE K， ZHANG X， REN S， et al. Deep residual learning for image recognition［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2016： 770-778. 10.1109/cvpr.2016.90
24	LAGUNA A B， RIBA E， PONSA D， et al. Key.Net： key-point detection by handcrafted and learned CNN filters［C］// Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision. Piscataway： IEEE， 2019： 5835-5843. 10.1109/iccv.2019.00593
25	ZHOU J， CUI G， HU S， et al. Graph neural networks： a review of methods and applications ［EB/OL］. ［2021-10-06］. .
26	LIN T-Y， GOYAL P， GIRSHICK R， et al. Focal loss for dense object detection［C］// Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway： IEEE， 2017： 2999-3001. 10.1109/iccv.2017.324
27	WANG R， GUO Z， JIANG S， et.al. Deep learning of partial graph matching via differentiable top-k ［C］// Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway： IEEE， 2023： 6272-6281. 10.1109/cvpr52729.2023.00607

算法	ILSVRC2012测试集l				ILSVRC2012测试集2
算法	AMA/%	AEPE	MS/%	计算时间/s	AMA/%	AEPE	MS/%	计算时间/s
SIFT	31.94	125.47	7.79	1.182	29.89	146.36	7.79	1.056
GMN	54.08	32.46	18.36	1.016	42.58	46.52	18.36	0.981
PCA-GM	62.15	29.37	21.46	1.134	53.42	32.26	21.46	1.079
OpenGlue	89.55	16.32	24.89	0.703	83.97	22.47	25.89	0.686
COTR	91.64	8.85	25.46	17.251	87.91	9.46	24.46	18.027
COMMON	89.43	7.99	26.54	1.388	82.44	10.27	25.81	1.752
ResMatch	90.27	6.32	24.11	0.329	88.87	7.95	22.76	0.316
SIRA-MFF	95.18	6.37	25.87	0.206	93.26	7.26	24.91	0.197

算法	ILSVRC2012测试集l				ILSVRC2012测试集2
算法	AMA/%	AEPE	MS/%	计算时间/s	AMA/%	AEPE	MS/%	计算时间/s
SIFT	31.94	125.47	7.79	1.182	29.89	146.36	7.79	1.056
GMN	54.08	32.46	18.36	1.016	42.58	46.52	18.36	0.981
PCA-GM	62.15	29.37	21.46	1.134	53.42	32.26	21.46	1.079
OpenGlue	89.55	16.32	24.89	0.703	83.97	22.47	25.89	0.686
COTR	91.64	8.85	25.46	17.251	87.91	9.46	24.46	18.027
COMMON	89.43	7.99	26.54	1.388	82.44	10.27	25.81	1.752
ResMatch	90.27	6.32	24.11	0.329	88.87	7.95	22.76	0.316
SIRA-MFF	95.18	6.37	25.87	0.206	93.26	7.26	24.91	0.197

算法	AMA/%	AEPE/%	MS/%	计算时间/s
SIFT	30.27	137.42	7.79	1.160
GMN	53.26	42.36	18.36	0.926
PCA-GM	76.82	37.42	21.46	1.125
OpenGlue	88.57	23.58	24.89	0.625
COTR	91.25	7.98	24.91	18.367
COMMON	84.62	8.87	26.31	1.427
ResMatch	82.03	38.91	16.84	0.364
SIRA-MFF	89.69	7.86	25.16	0.184

算法	AMA/%	AEPE/%	MS/%	计算时间/s
SIFT	30.27	137.42	7.79	1.160
GMN	53.26	42.36	18.36	0.926
PCA-GM	76.82	37.42	21.46	1.125
OpenGlue	88.57	23.58	24.89	0.625
COTR	91.25	7.98	24.91	18.367
COMMON	84.62	8.87	26.31	1.427
ResMatch	82.03	38.91	16.84	0.364
SIRA-MFF	89.69	7.86	25.16	0.184

Key.Net	DFE			特征匹配		EIL	AMA/%
Key.Net	32	128	256	ED	PML	EIL	AMA/%
√				√			14.42
√					√		23.72
	√				√		49.26
		√			√		58.84
			√		√		66.39
			√		√	√	89.69

基于多特征融合的自监督图像配准算法

Self-supervised image registration algorithm based on multi-feature fusion

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 10

参考文献 27

相关文章 15

编辑推荐

Metrics

[1]	李鸿天, 史鑫昊, 潘卫国, 徐成, 徐冰心, 袁家政. 融合多尺度和注意力机制的小样本目标检测[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1437-1444.
[2]	汪炅, 唐韬韬, 贾彩燕. 无负采样的正样本增强图对比学习推荐方法PAGCL[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1485-1492.
[3]	黄荣, 宋俊杰, 周树波, 刘浩. 基于自监督视觉Transformer的图像美学质量评价方法[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1269-1276.
[4]	郑宇亮, 陈云华, 白伟杰, 陈平华. 融合事件数据和图像帧的车辆目标检测[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 931-937.
[5]	吴宁, 罗杨洋, 许华杰. 基于多尺度特征融合的遥感图像语义分割方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 737-744.
[6]	贾宗泽, 高鹏飞, 马应龙, 刘晓峰, 夏海鑫. 基于注意力机制的多特征融合对话行为层次化分类方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 715-721.
[7]	蒋占军, 吴佰靖, 马龙, 廉敬. 多尺度特征和极化自注意力的Faster-RCNN水漂垃圾识别[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 938-944.
[8]	李新叶, 侯晔凝, 孔英会, 燕志旗. 结合特征融合与增强注意力的少样本目标检测[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 745-751.
[9]	黄巧玲, 郑伯川, 丁梓成, 吴泽东. 融合监督注意力模块和跨阶段特征融合的图像修复改进网络[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 572-579.
[10]	黄子麒, 胡建鹏. 实体类别增强的汽车领域嵌套命名实体识别[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 377-384.
[11]	石志良, 廖诗旗, 甘梓博, 祝少博. 三维桡骨成角楔形截骨术前自动规划算法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 588-594.
[12]	张雨宁, 阿布都克力木·阿布力孜, 梅悌胜, 徐春, 麦尔达娜·买买提热依木, 哈里旦木·阿布都克里木, 侯钰涛. 基于自监督特征提取的骨骼X线影像异常检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 175-181.
[13]	朱志平, 杨燕, 王杰. 基于场景图感知的跨模态图像描述模型[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 58-64.
[14]	杨昊, 张轶. 基于上下文信息和多尺度融合重要性感知的特征金字塔网络算法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2727-2734.
[15]	刘欢, 吴亮红, 张侣, 陈亮, 周博文, 张红强. 基于特征双融合CenterNet的白细胞检测方法[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2602-2610.

算法	参数量/10³
PCA-GM	1 471.0
OpenGlue	940.0
COTR	166.0
COMMON	39.0
SIRA-MFF	6.7

算法	参数量/10³
PCA-GM	1 471.0
OpenGlue	940.0
COTR	166.0
COMMON	39.0
SIRA-MFF	6.7