Journal of Computer Applications ›› 2023, Vol. 43 ›› Issue (4): 1220-1225.DOI: 10.11772/j.issn.1001-9081.2022020261

• Multimedia computing and computer simulation • Previous Articles    

Robust RGB-D SLAM system incorporating instance segmentation and clustering in dynamic environment

Tianzouzi XIAO1(), Xiaobo ZHOU2, Xin LUO1, Qipeng TANG1   

  1. 1.School of Artificial Intelligence and Automation,Huazhong University of Science and Technology,Wuhan Hubei 430074,China
    2.COMAC Shanghai Aircraft Manufacturing Company Limited,Shanghai 201324,China
  • Received:2022-03-09 Revised:2022-05-25 Accepted:2022-05-31 Online:2022-08-16 Published:2023-04-10
  • Contact: Tianzouzi XIAO
  • About author:ZHOU Xiaobo, born in 1992, assistant engineer. His research interests include design of civil aircraft process equipment.
    LUO Xin, born in 1986, Ph. D., lecturer. His research interests include computer vision based fault detection.
    TANG Qipeng, born in 1991, Ph. D., lecturer. His research interests include digital integrated control.
  • Supported by:
    National Natural Science Foundation of China(51907174)

动态环境下结合实例分割与聚类的鲁棒RGB-D SLAM系统

肖田邹子1(), 周小博2, 罗欣1, 唐其鹏1   

  1. 1.华中科技大学 人工智能与自动化学院,武汉 430074
    2.中国商飞上海飞机制造有限公司,上海 201324
  • 通讯作者: 肖田邹子
  • 作者简介:周小博(1992—),男,江苏启东人,助理工程师,主要研究方向:民用航空器工艺装备设计;
    罗欣(1986—),男,河南信阳人,讲师,博士,主要研究方向:基于计算机视觉的故障检测;
    唐其鹏(1991—),男,江西南昌人,讲师,博士,主要研究方向:数字集成化控制。
  • 基金资助:
    国家自然科学基金资助项目(51907174)

Abstract:

Visual Simultaneous Location And Mapping (VSLAM) technology is commonly used for indoor robot navigation and perception. However, the pose estimation method of VSLAM aims at static environment, and might lead to the location and mapping failure when moving objects exist in the scene. To solve this problem, an Instance Segmentation and Clustering SLAM (ISC-SLAM) system was proposed. In this system, the instance segmentation network was used to generate the possibility masks of dynamic objects in the scene, and the dynamic points in the scene were detected by using the multi-view geometry method during the segmentation. After matching the obtained possibility masks and the detected dynamic points, the accurate dynamic masks of moving objects were determined. The feature points of the dynamic objects were able to be deleted by using the dynamic masks and then the position of camera was estimated accurately by using the remained static feature points. To solve the under-segmentation problem of the instance segmentation network, the depth filling algorithm and clustering algorithm were applied to ensure the completed deletion elimination of dynamic feature points. Finally, the moving objects obscured background was reconstructed, and the static point cloud map was built with the correct camera pose. Experimental results on Technical University of Munich (TUM) dataset demonstrate that the proposed system can achieve robust positioning and mapping while ensuring real-time performance in dynamic environment.

Key words: Simultaneous Location And Mapping (SLAM), instance segmentation, clustering, dynamic environment, static dense point cloud map

摘要:

视觉同步定位与建图(VSLAM)技术常常用于室内机器人的导航与感知,然而VSLAM的位姿估算方法是针对静态环境的,当场景中存在运动对象时,可能会导致定位和建图失败。针对此问题,提出了一个结合实例分割与聚类的VSLAM系统。所提系统使用实例分割网络生成场景中动态对象的概率掩膜,同时利用多视图几何的方法检测场景中的动态点,并将检测到的动态点与获得的概率掩膜匹配之后确定动态物体的精确动态掩膜;利用动态掩膜删除动态物体的特征点,然后利用剩余的静态特征点准确估计摄像机的位置。为了解决实例分割网络欠分割的问题,采用深度填充算法和聚类算法保证动态特征点完全删除。最后,重建图片被动态物体遮挡的背景,在正确的相机位姿下建立静态稠密点云地图。在公开的TUM(Technical University of Munich)数据集上的实验结果表明,在动态环境中,所提系统在保证实时性的同时能实现鲁棒的定位与建图。

关键词: 同步定位与建图, 实例分割, 聚类, 动态环境, 静态稠密点云地图

CLC Number: