Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (5): 1534-1544.DOI: 10.11772/j.issn.1001-9081.2025050575

• Multimedia computing and computer simulation • Previous Articles    

VU-RED-F: improved CAD model replacement for U-RED single-view point clouds

Gengxin FAN1,2,3, Huiyan HAN1,2,3(), Liqun KUANG1,2,3, Ziyang JIN1,2,3, Huafeng ZHAO1,2,3   

  1. 1.School of Computer Science and Technology,North University of China,Taiyuan Shanxi 030051,China
    2.Shanxi Key Laboratory of Machine Vision and Virtual Reality (North University of China),Taiyuan Shanxi 030051,China
    3.Shanxi Provincial Vision Information Processing and Intelligent Robot Engineering Research Center,Taiyuan Shanxi 030051,China
  • Received:2025-05-27 Revised:2025-09-26 Accepted:2025-10-10 Online:2025-10-17 Published:2026-05-10
  • Contact: Huiyan HAN
  • About author:FAN Gengxin, born in 1999, M. S. candidate. Her research interests include point cloud reconstruction.
    KUANG Liqun, born in 1976, Ph. D., professor. His research interests include artificial intelligence, computer vision.
    JIN Ziyang, born in 2000, M. S. candidate. Her research interests include 3D reconstruction.
    ZHAO Huafeng, born in 2001, M. S. candidate. His research interests include computer vision.
  • Supported by:
    Shanxi Province Natural Science Foundation(202303021211153);Shanxi Province Major Science and Technology Plan “Unveiling the List and Assigning the Leader” Project(202201150401021);Shanxi Province Graduate Research Innovation Project(2024SJ279)

VU-RED-F:改进U-RED的单视角点云CAD模型替换

樊耿鑫1,2,3, 韩慧妍1,2,3(), 况立群1,2,3, 晋紫阳1,2,3, 赵华峰1,2,3   

  1. 1.中北大学 计算机科学与技术学院,太原 030051
    2.机器视觉与虚拟现实山西省重点实验室(中北大学),太原 030051
    3.山西省视觉信息处理及智能机器人工程研究中心,太原 030051
  • 通讯作者: 韩慧妍
  • 作者简介:樊耿鑫(1999—),女,山西运城人,硕士研究生,CCF会员,主要研究方向:点云重建
    况立群(1976—),男,江西高安人,教授,博士,CCF会员,主要研究方向:人工智能、计算机视觉
    晋紫阳(2000—),女,山西霍州人,硕士研究生,CCF会员,主要研究方向:三维重建
    赵华峰(2001—),男,山西运城人,硕士研究生,CCF会员,主要研究方向:计算机视觉。
  • 基金资助:
    山西省自然科学基金资助项目(202303021211153);山西省科技重大专项计划“揭榜挂帅”项目(202201150401021);山西省研究生科研创新项目(2024SJ279)

Abstract:

In robotic environmental perception tasks, single-view point clouds suffer from severe geometric information loss due to sensor viewpoint limitations. Point cloud reconstruction methods based on Computer-Aided Design (CAD) model replacement can avoid the risks of structural instability associated with reconstruction directly from point clouds by retrieving similar models and applying deformation. Unsupervised 3D shape REtrieval and Deformation (U-RED) algorithm achieves topologically consistent CAD model replacement while maintaining the editability of the reconstruction results. However, when dealing with objects with complex topology, it still faces problems such as insufficient rotation and translation invariance in point cloud representations, difficulty in distinguishing neighboring components due to geometric similarity among homologous components, and parameter update failures caused by scattered attention weights and gradient vanishing or explosion. To address these challenges, a Vector neuron enhanced Unsupervised REtrieval and Deformation algorithm with Feature affine residual (VU-RED-F) was proposed based on U-RED. Firstly, a Vector Neuron Encoder (VNE) was constructed to improve the robustness of the feature extraction module in representing rotation and translation invariance of point clouds. Secondly, learnable affine transformation residuals were introduced to reconstruct the feature mapping process, adaptively adjust the feature distribution, and enhance the network's ability to discriminate local geometric structures between components. Finally, by integrating soft-threshold gating and residual correction, the stability of gradient propagation was enhanced while constraining the sparsity of the attention distribution, thereby boosting network convergence efficiency and reducing loss during retrieval and deformation. Experimental results on the synthetic PartNet and ComplementMe datasets, as well as the real Scan2CAD dataset, show that the VU-RED-F algorithm has the lowest average chamfer distance (cd) loss, improving the fidelity of local geometric details in CAD models.

Key words: point cloud reconstruction, model retrieval and deformation, vector neuron, Feature Affine Residual (FA?Res), residual network

摘要:

在机器人环境感知任务中,单视角点云因传感器视角受限导致几何信息严重缺失。基于计算机辅助设计(CAD)模型替换的点云重建方法通过检索相似模型并实施变形,可有效规避直接从点云中重建的结构失控风险。无监督三维形状检索与变形(U-RED)算法在保持重建结果可编辑性的同时可实现拓扑一致的CAD模型替换,但面对复杂拓扑结构物体时仍存在点云旋转平移不变性表征不足、同源部件间的几何相似性导致近邻部件区分困难、注意力权重分散以及梯度消失与爆炸引起的参数更新失效问题。针对上述挑战,在U-RED基础上提出基于向量神经元与特征仿射残差增强的无监督检索变形算法(VU-RED-F)。首先,构建向量神经元编码器(VNE),提升特征提取模块在点云旋转平移不变性表征的鲁棒性;其次,引入可学习的仿射变换残差以重构特征映射过程,自适应调整特征分布,增强网络对部件间局部几何结构的判别能力;最后,融合软阈值门控与残差校正,在约束注意力分布稀疏性的同时增强梯度传播稳定性,提升网络收敛效率,降低检索变形过程中的损失。在PartNet和ComplementMe合成数据集以及Scan2CAD真实数据集上的实验结果表明,VU-RED-F算法的平均倒角距离(cd)损失最小,提高了CAD模型的局部几何细节保真度。

关键词: 点云重建, 模型检索与变形, 向量神经元, 特征仿射残差, 残差网络

CLC Number: