Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (2): 536-545.DOI: 10.11772/j.issn.1001-9081.2024121782

• Multimedia computing and computer simulation • Previous Articles    

Low-overlap point cloud registration network integrating position encoding and overlap masks

Xiaowei LA1, Lihua HU1(), Jianhua HU2, Xiaoling YAO1, Xinbo WANG2   

  1. 1.College of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan Shanxi 030024,China
    2.Institute of Automation,Chinese Academy of Sciences,Beijing 100190,China
  • Received:2024-12-18 Revised:2025-05-14 Accepted:2025-05-15 Online:2025-06-05 Published:2026-02-10
  • Contact: Lihua HU
  • About author:LA Xiaowei, born in 2000, M. S. candidate. His research interests include computer vision, point cloud registration.
    HU Lihua, born in 1982, Ph. D., professor. Her research interests include computer vision, artificial intelligence, pattern recognition. Email:hlh@tyust.edu.cn
    HU Jianhua, born in 1987, Ph. D., associate research fellow. His research interests include intelligent robots,3D vision.
    YAO Xiaoling, born in 1993, Ph. D. candidate. Her research interests include computer vision, data mining.
    WANG Xinbo, born in 1987, Ph. D., associate research fellow. His research interests include 3D vision, robots.
  • Supported by:
    National Natural Science Foundation of China(62273248);Science and Technology Service Network Program of the Chinese Academy of Sciences(STS-HP-202202);Graduate Education Innovation Project of Taiyuan University of Science and Technology(BY2022017)

融合位置编码和重叠掩模的低重叠点云配准网络

喇孝伟1, 胡立华1(), 胡建华2, 姚晓玲1, 王欣波2   

  1. 1.太原科技大学 计算机科学与技术学院,太原 030024
    2.中国科学院 自动化研究所,北京 100190
  • 通讯作者: 胡立华
  • 作者简介:喇孝伟(2000—),男,河北张家口人,硕士研究生,主要研究方向:计算机视觉、点云配准
    胡立华(1982—),女,山西忻州人,教授,博士,CCF会员,主要研究方向:计算机视觉、人工智能、模式识别 Email:hlh@tyust.edu.cn
    胡建华(1987—),男,山西忻州人,副研究员,博士,主要研究方向:智能机器人、3D视觉
    姚晓玲(1993—),女,山西忻州人,博士研究生,主要研究方向:计算机视觉、数据挖掘
    王欣波(1987—),男,浙江嵊州人,副研究员,博士,主要研究方向:3D视觉、机器人。
  • 基金资助:
    国家自然科学基金资助项目(62273248);中国科学院科技服务网络计划项目(STS-HP-202202);太原科技大学研究生教育创新项目(BY2022017)

Abstract:

For the issues of low registration accuracy and high mismatch rate in low-overlap point cloud registration due to insufficient descriptive information of keypoint features and minimal overlapping regions, this paper proposed a low-overlap point cloud registration network that integrates position encoding and overlap masks was proposed to reduce mismatch rate and improve registration accuracy. First, a PointNet-based point-wise feature encoder was employed to extract keypoints, which were then enhanced by fusing their feature information, coordinate data, and position encoding to generate more discriminative keypoint descriptors. Second, the fused features were processed through self-attention and cross-attention modules to strengthen the descriptive power of point cloud features and enhance contextual interaction, thereby addressing the problem of insufficient keypoint descriptive information. Third, an overlap mask module was introduced after the attention modules to filter out keypoints from non-overlapping regions through learned masks, further reducing mismatch rate. Finally, optimal matching was achieved using the Sinkhorn algorithm, followed by refinement with the Iterative Closest Point (ICP) algorithm to enhance registration accuracy. Experimental results on the CODD and KITTI datasets, compared with various existing low-overlap point cloud registration methods, demonstrate that the network with ICP refinement performs superiorly. Specifically, on the CODD dataset, it reduces the Relative Translation Error (RTE) and Relative Rotation Error (RRE) by 53.29% and 42.72%, respectively, compared to the state-of-the-art method CoFiI2P (Coarse-to-Fine correspondences for Image-to-Point cloud registration), while improving the Registration Recall (RR) by 0.2 percentage points. The results indicate that the proposed network effectively extracts descriptive information from keypoint features and significantly improves point cloud registration accuracy in low-overlap scenarios.

Key words: low-overlap scenario, point cloud registration, position encoding, overlap mask, self-attention, cross-attention

摘要:

针对低重叠场景下点云配准方法存在的关键点特征描述信息不足和重叠点云区域较少,进而导致点云的误匹配率高以及配准精度低的问题,设计一种融合位置编码和重叠掩模的低重叠点云配准网络,以降低点云的误匹配率,并提高配准的精度。首先,采用PointNet逐点特征编码器提取点云关键点,并融合关键点的特征信息、坐标信息和位置编码,生成更具判别力的关键点特征;其次,将融合后的特征输入自注意力和交叉注意力模块,以增强点云特征的描述能力,加强点云的上下文信息交互,从而解决关键点描述信息不足的问题;再次,在注意力模块之后引入重叠掩模模块,通过学习重叠掩模去除非重叠区域的关键点,以降低误匹配率;最后,结合Sinkhorn算法进行最优匹配,并采用迭代最近邻点(ICP)算法进行细化,提高点云配准精度。在CODD数据集和KITTI数据集上与多种现有的低重叠点云配准方法进行对比的实验结果表明,经过ICP细化后的网络性能更优,特别是在CODD数据集上,它比当前先进的低重叠点云配准方法CoFiI2P (Coarse-to-Fine correspondences for Image-to-Point cloud registration)的相对平移误差(RTE)和相对旋转误差(RRE)分别降低了53.29%和42.72%,配准召回率(RR)提升了0.2个百分点。可见,该网络能充分提取关键点特征的描述信息,并有效提升低重叠场景下的点云配准精度。

关键词: 低重叠场景, 点云配准, 位置编码, 重叠掩模, 自注意力, 交叉注意力

CLC Number: