面向点云分类分割的层次化旋转不变几何结构的表征学习方法

doi:10.11772/j.issn.1001-9081.2025070897

《计算机应用》唯一官方网站

• • 下一篇

面向点云分类分割的层次化旋转不变几何结构的表征学习方法

刘威¹,李维刚²,田志强²

1. 武汉科技大学电子信息学院
2. 武汉科技大学

收稿日期:2025-08-07 修回日期:2025-10-15 发布日期:2025-11-05 出版日期:2025-11-05
通讯作者: 刘威

Representation learning method with hierarchical rotation-invariant geometric structure for point cloud classification and segmentation

Received:2025-08-07 Revised:2025-10-15 Online:2025-11-05 Published:2025-11-05

摘要/Abstract

摘要： 现有点云深度学习方法能够有效处理固定视角下的点云数据，但在实际应用中，物体方向的变化会使点云描述受到旋转变换的影响，从而影响深度学习网络的识别精度。针对这一问题，提出一种层次化旋转不变几何结构的表征学习方法。首先，通过三角化的局部几何结构对点云样本进行建模，在每个点的邻域内构建三角表面，提取描述欧氏空间与切平面几何关系的旋转不变特征，然后将提取到的旋转不变特征通过卷积算子表达，并通过自注意力增强卷积聚合局部邻域结构，实现局部和全局信息的自适应融合，进一步提取精细的旋转不变特征并增强表达力和全局一致性。最后，引入层次化逆瓶颈残差模块，通过多级非线性映射和渐进式通道扩展，实现从浅层几何特征到深层语义特征的层次化融合，增强旋转不变特征的高阶表达能力和判断力，提升对复杂空间结构和多样旋转情况下的表达和区分能力。所提方法在ModelNet40数据集上实现了93.9%的整体分类准确率(OA)，在ScanObjectNN数据集上实现了87.8%的整体分类准确率(OA)，在ShapeNet数据集的分割任务中取得了82.3%的平均交并比(mIoU)。实验结果表明，所提方法具有良好的分类分割能力，同时兼具旋转不变性，表现出优异的鲁棒性和泛化能力。

Abstract: Existing point cloud deep learning methods were able to process point cloud data from fixed viewpoints. However, in practical applications, changes in object orientation affected the point cloud description by rotational transformations, thereby reducing recognition accuracy. To address this issue, a hierarchical rotation-invariant geometric structure representation learning method was proposed. First, point cloud samples were modeled using triangulated local geometric structures. A triangular surface was constructed within each neighborhood, and rotation-invariant features describing the geometric relationship between Euclidean space and the tangent plane were extracted. These extracted rotation-invariant features were then expressed through convolutional operators. Self-attention-enhanced convolutions were employed to aggregate local neighborhood structures, achieving adaptive fusion of local and global information. This further refined the rotation-invariant features and improved their global consistency. Finally, a hierarchical inverted residual multilayer perceptron module was introduced. Through multi-level nonlinear mapping and progressive channel expansion, shallow geometric features were hierarchically fused with deep semantic features, enhancing the high-level expressiveness and discriminative power of rotation-invariant features, and improving the ability to express and distinguish complex spatial structures and diverse rotations. The proposed method achieved an Overall Accuracy (OA) of 93.9% on the ModelNet40 dataset, an Overall Accuracy (OA) of 87.8% on the ScanObjectNN dataset, and achieved a mean Intersection over Union(mIoU) of 82.3% in the segmentation task of the ShapeNet dataset. The experimental results show that the proposed method demonstrates strong classification and segmentation performance, maintains rotation-invariance, and exhibits excellent robustness and generalization capabilities.

中图分类号:

中图分类号:TP391.4

刘威李维刚田志强. 面向点云分类分割的层次化旋转不变几何结构的表征学习方法[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2025070897.

[1]	张国有聂宏宇潘理虎雷润东. 基于多层感知机级联宽度学习系统的点云语义分割网络Point-MLPBLS[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[2]	杜艺续明进孔佳仪王力瑶赵晨. 基于YOLOv11的低秩自适应参数高效微调算法[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[3]	吕超马歌谣. 基于冗余特征抑制的轻量级人体姿态估计网络[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[4]	邵培荣蔺素珍王彦博. 以人为中心的细节增强虚拟试衣方法[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[5]	曹杰谢凌锋王丙金张昌河余紫东邓超. 考虑类不平衡和背景多样性问题的青少年脊柱侧弯筛查方法[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[6]	吕潇宋慧慧樊佳庆. 深浅层表示融合的半监督视频目标分割[J]. 《计算机应用》唯一官方网站, 0, (): 0-0.
[7]	徐志刚张创. 基于门控位置编码的壁画图像多级色彩还原[J]. , , (): 0-0.
[8]	陈文兵鞠虎陈允杰. 一种基于倒数函数-谱残差的显著对象探测和提取方法[J]. , 0, (): 0-0.
[9]	杨谊. 道路突发中断情况下实时最短路径快速求解算法[J]. , 0, (): 0-0.
[10]	谢德茂吴自然陈冲吴桂初叶鹏. 自动点焊机中机器视觉系统的设计与应用[J]. , 0, (): 0-0.
[11]	熊昌镇王聪. 结合图切技术和卷积网络的交通标志数据集构建方法[J]. , 0, (): 0-0.
[12]	刘雨桐李志清杨晓玲. 改进卷积神经网络在遥感图像分类中的应用研究[J]. , 0, (): 0-0.

面向点云分类分割的层次化旋转不变几何结构的表征学习方法

Representation learning method with hierarchical rotation-invariant geometric structure for point cloud classification and segmentation

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 12

编辑推荐

Metrics