《计算机应用》唯一官方网站

• •    下一篇

面向点云分类分割的层次化旋转不变几何结构的表征学习方法

刘威1,李维刚2,田志强2   

  1. 1. 武汉科技大学电子信息学院
    2. 武汉科技大学
  • 收稿日期:2025-08-07 修回日期:2025-10-15 发布日期:2025-11-05 出版日期:2025-11-05
  • 通讯作者: 刘威

Representation learning method with hierarchical rotation-invariant geometric structure for point cloud classification and segmentation

  • Received:2025-08-07 Revised:2025-10-15 Online:2025-11-05 Published:2025-11-05

摘要: 现有点云深度学习方法能够有效处理固定视角下的点云数据,但在实际应用中,物体方向的变化会使点云描述受到旋转变换的影响,从而影响深度学习网络的识别精度。针对这一问题,提出一种层次化旋转不变几何结构的表征学习方法。首先,通过三角化的局部几何结构对点云样本进行建模,在每个点的邻域内构建三角表面,提取描述欧氏空间与切平面几何关系的旋转不变特征,然后将提取到的旋转不变特征通过卷积算子表达,并通过自注意力增强卷积聚合局部邻域结构,实现局部和全局信息的自适应融合,进一步提取精细的旋转不变特征并增强表达力和全局一致性。最后,引入层次化逆瓶颈残差模块,通过多级非线性映射和渐进式通道扩展,实现从浅层几何特征到深层语义特征的层次化融合,增强旋转不变特征的高阶表达能力和判断力,提升对复杂空间结构和多样旋转情况下的表达和区分能力。所提方法在ModelNet40数据集上实现了93.9%的整体分类准确率(OA),在ScanObjectNN数据集上实现了87.8%的整体分类准确率(OA),在ShapeNet数据集的分割任务中取得了82.3%的平均交并比(mIoU)。实验结果表明,所提方法具有良好的分类分割能力,同时兼具旋转不变性,表现出优异的鲁棒性和泛化能力。

Abstract: Existing point cloud deep learning methods were able to process point cloud data from fixed viewpoints. However, in practical applications, changes in object orientation affected the point cloud description by rotational transformations, thereby reducing recognition accuracy. To address this issue, a hierarchical rotation-invariant geometric structure representation learning method was proposed. First, point cloud samples were modeled using triangulated local geometric structures. A triangular surface was constructed within each neighborhood, and rotation-invariant features describing the geometric relationship between Euclidean space and the tangent plane were extracted. These extracted rotation-invariant features were then expressed through convolutional operators. Self-attention-enhanced convolutions were employed to aggregate local neighborhood structures, achieving adaptive fusion of local and global information. This further refined the rotation-invariant features and improved their global consistency. Finally, a hierarchical inverted residual multilayer perceptron module was introduced. Through multi-level nonlinear mapping and progressive channel expansion, shallow geometric features were hierarchically fused with deep semantic features, enhancing the high-level expressiveness and discriminative power of rotation-invariant features, and improving the ability to express and distinguish complex spatial structures and diverse rotations. The proposed method achieved an Overall Accuracy (OA) of 93.9% on the ModelNet40 dataset, an Overall Accuracy (OA) of 87.8% on the ScanObjectNN dataset, and achieved a mean Intersection over Union(mIoU) of 82.3% in the segmentation task of the ShapeNet dataset. The experimental results show that the proposed method demonstrates strong classification and segmentation performance, maintains rotation-invariance, and exhibits excellent robustness and generalization capabilities.

中图分类号: