基于双注意力机制和多尺度融合的点云分类与分割网络

doi:10.11772/j.issn.1001-9081.2024091254

《计算机应用》唯一官方网站

• • 下一篇

基于双注意力机制和多尺度融合的点云分类与分割网络

李维刚^1,2,邵佳乐¹,田志强²

1.武汉科技大学信息科学与工程学院 2.武汉科技大学冶金自动化与检测技术教育部工程研究中心

收稿日期:2024-09-04 修回日期:2024-10-16 发布日期:2024-10-31 出版日期:2024-10-31
通讯作者: 邵佳乐
作者简介:李维刚(1977—)，男，湖北咸宁人，教授，博士，主要研究方向：工业过程控制、人工智能、机器学习；邵佳乐(2000—)，男，河南南阳人，硕士研究生，主要研究方向：深度学习、点云数据处理；田志强(1996—)，男，湖北武汉人，博士研究生，主要研究方向：计算机视觉。
基金资助:
湖北省科技人才服务企业项目(202400288)

Point cloud classification and segmentation network based on dual attention mechanism and multi-scale fusion

LI Weigang^1,2, SHAO Jiale¹, TIAN Zhiqiang²

1. School of Information Science and Engineering, Wuhan University of Science and Technology 2. Engineering Research Center for Metallurgical Automation and Measurement Technology of Ministry of Education,
Wuhan University of Science and Technology

Received:2024-09-04 Revised:2024-10-16 Online:2024-10-31 Published:2024-10-31
About author:LI Weigang, born in 1977, Ph. D., professor. His research interests include industrial process control, artificial intelligence, machine learning. SHAO Jiale, born in 2000, M. S. candidate. His research interests include deep learning, point cloud data processing. TIAN Zhiqiang, born in 1996, Ph. D. candidate. His research interests include computer vision.
Supported by:
Hubei Provincial Science and Technology Talent Service Enterprise Project (202400288)

摘要/Abstract

摘要： 现有的网络难以有效学习点云局部几何形状信息，存在无法有效关注重要特征结构及融合不充分等问题。为此，提出一种基于双注意力机制和多尺度融合的点云分类与分割网络。首先，在数据特征提取阶段利用几何自适应卷积(GAC)动态地调整卷积核的几何位置和权重，使它能够动态适应点云数据的局部几何结构，从而更有效地捕捉局部特征；其次，为进一步提升特征表达能力，引入双注意力机制(DAM)自动学习并调整特征通道和空间信息的权重，从而增强关键点的特征表示；最后，连接不同尺度的特征信息进行有效融合，增强特征学习效果，使得最终的特征表示更加丰富，提高网络的分类分割精度。在ModelNet40、ShapeNet和S3DIS数据集上的实验结果表明，所提网络与PointNet++和DGCNN(Dynamic Graph Convolutional Neural Network)对比提高了总体分类精度(OA)和平均交并比(mIoU)，有效提升了点云分类与分割的性能。

关键词: 点云, 分类分割, 深度学习, 注意力机制, 特征融合

Abstract: Existing networks are difficult to effectively learn the local geometric shape information of point clouds, and there are problems such as being unable to effectively focus on important feature structures and insufficient fusion. Therefore, a point cloud classification and segmentation network based on dual attention mechanism and multi-scale fusion was proposed. Firstly, in the data feature extraction stage, the geometric position and weight of the convolution kernel are dynamically adjusted using Geometric Adaptive Convolution (GAC), so that it can dynamically adapt to the local geometric structure of point cloud data, thereby more effectively capturing local features. Secondly, in order to further improve the feature expression ability, the Dual Attention Mechanism (DAM) was introduced to automatically learn and adjust the weights of feature channels and spatial information, thereby enhancing the feature representation of key points. Finally, feature information of different scales was connected for effective fusion to enhance the feature learning effect, making the final feature representation richer and improving the classification and segmentation accuracy of the network. Experimental results on ModelNet40, ShapeNet and S3DIS datasets show that the proposed network improves the Overall Accuracy (OA) and mean Intersection over Union (mIoU) compared with PointNet++ and Dynamic Graph Convolutional Neural Network (DGCNN), and effectively improves the performance of point cloud classification and segmentation.

Key words: point cloud, classification and segmentation, deep learning, attention mechanism, feature fusion

中图分类号:

TP391.4

李维刚邵佳乐田志强. 基于双注意力机制和多尺度融合的点云分类与分割网络[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2024091254.

LI Weigang, SHAO Jiale, TIAN Zhiqiang. Point cloud classification and segmentation network based on dual attention mechanism and multi-scale fusion[J]. Journal of Computer Applications, DOI: 10.11772/j.issn.1001-9081.2024091254.

[1]	王利琴, 耿智雷, 李英双, 董永峰, 边萌. 基于路径和增强三元组文本的开放世界知识推理模型[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1177-1183.
[2]	郭诗月, 党建武, 王阳萍, 雍玖. 结合注意力机制和多尺度特征融合的三维手部姿态估计[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1293-1299.
[3]	潘理虎, 彭守信, 张睿, 薛之洋, 毛旭珍. 面向运动前景区域的视频异常检测[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1300-1309.
[4]	王一丁, 王泽浩, 李耀利, 蔡少青, 袁媛. 多尺度2D-Adaboost的中药材粉末显微图像识别算法[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1325-1332.
[5]	周阳, 李辉. 基于语义和细节特征双促进的遥感影像建筑物提取网络[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1310-1316.
[6]	胡婕, 郑启扬, 孙军, 张龑. 基于多标签关系图和局部动态重构学习的多标签分类模型[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1104-1112.
[7]	张李伟, 梁泉, 胡禹涛, 朱乔乐. 基于分组卷积的通道重洗注意力机制[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1069-1076.
[8]	姜坤元, 李小霞, 王利, 曹耀丹, 张晓强, 丁楠, 周颖玥. 引入解耦残差自注意力的边界交叉监督语义分割网络[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1120-1129.
[9]	徐春, 吉双焱, 马欢, 孙恩威, 王萌萌, 苏明钰. 基于知识图谱和对话结构的问诊推荐方法[J]. 《计算机应用》唯一官方网站, 2025, 45(4): 1157-1168.
[10]	周浩, 王超, 崔国恒, 罗廷金. 基于多语义关联与融合的视觉问答模型[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 739-745.
[11]	耿海军, 董赟, 胡治国, 池浩田, 杨静, 尹霞. 基于Attention-1DCNN-CE的加密流量分类方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 872-882.
[12]	薛振华, 李强, 黄超. 视觉基础模型驱动的像素级图像异常检测方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 823-831.
[13]	陈瑞龙, 胡涛, 卜佑军, 伊鹏, 胡先君, 乔伟. 面向加密恶意流量检测模型的堆叠集成对抗防御方法[J]. 《计算机应用》唯一官方网站, 2025, 45(3): 864-871.
[14]	王地欣, 王佳昊, 李敏, 陈浩, 胡光耀, 龚宇. 面向水声通信网络的异常攻击检测[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 526-533.
[15]	洪梓榕, 包广清. 基于集成学习的雷达自动目标识别综述[J]. 《计算机应用》唯一官方网站, 2025, 45(2): 371-382.

基于双注意力机制和多尺度融合的点云分类与分割网络

Point cloud classification and segmentation network based on dual attention mechanism and multi-scale fusion

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics