基于RGB-D图像的室内机器人同时定位与地图构建

doi:10.11772/j.issn.1001-9081.2020040518

计算机应用 ›› 2020, Vol. 40 ›› Issue (12): 3637-3643.DOI: 10.11772/j.issn.1001-9081.2020040518

• 虚拟现实与多媒体计算 • 上一篇下一篇

基于RGB-D图像的室内机器人同时定位与地图构建

赵宏, 刘向东, 杨永娟

兰州理工大学计算机与通信学院, 兰州 730050

收稿日期:2020-04-23 修回日期:2020-08-10 出版日期:2020-12-10 发布日期:2020-08-21
通讯作者: 刘向东(1994-),男,甘肃陇西人,硕士研究生,CCF会员,主要研究方向:三维重建、视觉SLAM。Liuxd1994@foxmail.com
作者简介:赵宏(1971-),男,甘肃西和人,教授,博士生导师,博士,CCF会员,主要研究方向:并行与分布式处理、三维重建、深度学习;杨永娟(1996-),女,甘肃靖远人,硕士研究生,CCF会员,主要研究方向:三维重建、视觉惯性SLAM
基金资助:
国家自然科学基金资助项目（51668043，61262016）；赛尔网络下一代互联网技术创新项目（NG1120160311，NG1120160112）。

Indoor robot simultaneous localization and mapping based on RGB-D image

ZHAO Hong, LIU Xiangdong, YANG Yongjuan

School of Computer and Communications, Lanzhou University of Technology, Lanzhou Gansu 730050, China

Received:2020-04-23 Revised:2020-08-10 Online:2020-12-10 Published:2020-08-21
Supported by:
This work is partially supported by the National Natural Science Foundation of China （51668043， 61262016）， the Next Generation Internet Technology Innovation Project of CERNET （NG1120160311， NG1120160112）.

摘要/Abstract

摘要： 同时定位与地图构建（SLAM）是机器人在未知环境实现自主导航的关键技术，针对目前常用的RGB-D SLAM系统实时性差和精确度低的问题，提出一种新的RGB-D SLAM系统，以进一步提升实时性和精确度。首先，采用ORB算法检测图像特征点，并对提取的特征点采用基于四叉树的均匀化策略进行处理，并结合词袋模型（BoW）进行特征匹配。然后，在系统相机姿态初始值估计阶段，结合PnP和非线性优化方法为后端优化提供一个更接近最优值的初始值；在后端优化中，使用光束法平差（BA）对相机姿态初始值进行迭代优化，从而得到相机姿态的最优值。最后，根据相机姿态和每帧点云地图的对应关系，将所有的点云数据注册到同一个坐标系中，得到场景的稠密点云地图，并对点云地图利用八叉树进行递归式的压缩以得到一种用于机器人导航的三维地图。在TUM RGB-D数据集上，将构建的RGB-D SLAM同RGB-D SLAMv2、ORB-SLAM2系统进行了对比，实验结果表明所构建的RGB-D SLAM系统在实时性和精确度上的综合表现更优。

关键词: RGB-D传感器, 同时定位与地图构建, 稠密点云地图, 八叉树地图

Abstract: Simultaneous Localization and Mapping (SLAM) is a key technology for robots to realize autonomous navigation in unknown environments. Aiming at the poor real-time performance and low accuracy of the commonly used RGB-Depth (RGB-D) SLAM system, a new RGB-D SLAM system was proposed to further improve the real-time performance and accuracy. Firstly, the Oriented FAST and Rotated BRIEF (ORB) algorithm was used to detect the image feature points, and the extracted feature points were processed by using the quadtree-based homogenization strategy, and the Bag of Words (BoW) was used to perform feature matching. Then, in the stage of system camera pose initial value estimation, an initial value which was closer to the optimal value was provided for back-end optimization by combining the Perspective n Point (PnP) and nonlinear optimization methods. In the back-end optimization, the Bundle Adjustment (BA) was used to optimize the initial value of the camera pose iteratively for obtaining the optimal value of the camera pose. Finally, according to the correspondence between the camera pose and the point cloud map of each frame, all the point cloud data were registered in a coordinate system to obtain the dense point cloud map of the scene, and the octree was used to compress the point cloud map recursively, so as to obtain a 3D map for robot navigation. On the TUM RGB-D dataset, the proposed RGB-D SLAM system, RGB-D SLAMv2 system and ORB-SLAM2 system were compared. Experimental results show that the proposed RGB-D SLAM system has better comprehensive performance on real-time and accuracy.

Key words: RGB-Depth （RGB-D) senor, Simultaneous Localization and Mapping (SLAM), dense point cloud map, octo-map

中图分类号:

TP242.6

赵宏, 刘向东, 杨永娟. 基于RGB-D图像的室内机器人同时定位与地图构建[J]. 计算机应用, 2020, 40(12): 3637-3643.

ZHAO Hong, LIU Xiangdong, YANG Yongjuan. Indoor robot simultaneous localization and mapping based on RGB-D image[J]. Journal of Computer Applications, 2020, 40(12): 3637-3643.

参考文献

[1] 刘浩敏, 章国锋, 鲍虎军. 基于单目视觉的同时定位与地图构建[J]. 计算机辅助设计与图形学学报, 2016, 28(6):855-868.(LIU H M,ZHANG G F,BAO H J. A survey of monocular simultaneous localization and mapping[J]. Journal of Computer Aided Design and Compute Graphics,2016,28(6):855-868.)
[2] 高翔, 张涛, 刘毅, 等. 视觉SLAM十四讲:从理论到实践[M]. 北京:电子工业出版社, 2017:132-180.(GAO X,ZHANG T,LIU Y,et al. The 14 Lectures on Visual SLAM:from Theory to Practice[M]. Beijing:Publishing House of Electronics Industry,2017:132-180.)
[3] CADENA C,CARLONE L,CARRILLO H,et al. Past,present, and future of simultaneous localization and mapping:toward the robust-perception age[J]. IEEE Transactions on Robotics,2016, 32(6):1309-1332.
[4] 胡凌燕, 曹禄, 熊鹏文, 等. 基于RGB-D图像的三维同步定位与建图研究[J]. 系统仿真学报, 2017, 29(11):2840-2846.(HU L Y,CAO L,XIONG P W,et al. 3D simultaneous localization and mapping based on RGB-D Images[J]. Journal of System Simulation,2017,29(11):2840-2846.)
[5] WANG J, HUANG S, ZHAO L, et al. High quality 3D reconstruction of indoor environments using RGB-D sensors[C]//Proceedings of the 12th IEEE Conference on Industrial Electronics and Applications. Piscataway:IEEE,2017:1739-1744.
[6] 权美香, 朴松昊, 李国. 视觉SLAM综述[J]. 智能系统学报, 2016,11(6):768-776. (QUAN M X,PIAO S H,LI G. An overview of visual SLAM[J]. CAAI Transactions on Intelligent Systems,2016,11(6):768-776.)
[7] DAVISON A J,REID I D,MOLTON N D,et al. MonoSLAM:realtime single camera SLAM[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2007,29(6):1052-1067.
[8] 陈世浪, 吴俊君. 基于RGB-D相机的SLAM技术研究综述[J]. 计算机工程与应用, 2019, 55(7):30-39, 126.(CHEN S L,WU J J. RGB-D SLAM:a survey[J]. Computer Engineering and Applications,2019,55(7):30-39,126.)
[9] 余涛. Kinect应用开发实战:用最自然的方式与机器对话[M]. 北京:机械工业出版社, 2013:48-62.(YU T. Kinect Application Development Practice:the Most Natural Way to Talk to the Machine[M]. Beijing:China Machine Press,2013:48-62.)
[10] HENRY P,KRAININ M,HERBST E,et al. RGB-D mapping:using Kinect-style depth cameras for dense 3D modeling of indoor environments[J]. International Journal of Robotics Research, 2012,31(5):647-663.
[11] ENDRES F. RGB-D SLAMv2[EB/OL].[2020-01-16]. https://github.com/felixendres/rgbdslam_v2.
[12] RUBLEE E, RABAUD V, KONOLIGE K, et al. ORB:an efficient alternative to SIFT or SURF[C]//Proceedings of the 2011 IEEE International Conference on Computer Vision. Piscataway:IEEE,2011:2564-2571.
[13] LOWE D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision,2004,60(2):91-110.
[14] BAY H,TUVTELARRS T,VAN GOOL L. SURF:speeded up robust features[J]. Computer Vision and Image Understanding, 2008,110(3):404-417.
[15] PITZER B. SiftGPU[EB/OL].[2019-09-18]. https://github.com/pitzer/SiftGPU.
[16] ROS.org. Package summary[EB/OL].[2019-10-19]. http://wiki.ros.org/ROS/
[17] MUR-ARTAL R,MONTIEL J M M,TARDÓS J D. ORB-SLAM:a versatile and accurate monocular SLAM system[J]. IEEE Transactions on Robotics,2015,31(5):1147-1163.
[18] MUR-ARTAL R,TARDÓS J D. ORB-SLAM2:an open-source SLAM system for monocular,stereo and RGB-D camera[J]. IEEE Transactions on Robotics,2017,33(5):1255-1262.
[19] 林辉灿, 吕强, 王国胜, 等. 基于VSLAM的自主移动机器人三维同时定位与地图构建[J]. 计算机应用, 2017, 37(10):2884-2887,2894. (LIN H C, LYU Q, WANG G S, et al. 3D simultaneous localization and mapping for mobile robot based on VSLAM[J]. Journal of Computer Applications,2017,37(10):2884-2887,2894.)
[20] GALVEZ-LÓPEZ D,TARDÓS J D. Bags of binary words for fast place recognition in image sequences[J]. IEEE Transactions on Robotics,2012,28(5):1188-1197.
[21] 陈鹏, 王晨骁. IEPnP:一种基于EPnP的相机位姿迭代估计算法[J]. 光学学报, 2018, 38(4):130-136.(CHEN P,WANG C X. IEPnP:an iterative camera pose estimation algorithm based on EPnP[J]. Acta Optica Sinica,2018,38(4):130-136.)
[22] 樊彦国, 柴江龙, 许明明, 等. 基于ORB与RANSAC融合改进的图像配准[J]. 光学精密工程, 2017, 27(3):702-717.(FAN Y G, CHAI J L,XU M M,et al. Improved fast image registration algorithm based on ORB and RANSAC fusion[J]. Optics and Precision Engineering,2017,27(3):702-717.)
[23] OpenCV Team. OpenCV homepage[EB/OL].[2019-08-19]. https://opencv.org/.
[24] GAO X,HOU X,TANG J,et al. Complete solution classification for the perspective-three-point problem[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2003, 25(8):930-943.
[25] STURM J,ENGELHARD N,ENDRES F. A benchmark for the evaluation of RGB-D SLAM systems[C]//Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems. Piscataway:IEEE,2012:573-580.

基于RGB-D图像的室内机器人同时定位与地图构建

Indoor robot simultaneous localization and mapping based on RGB-D image

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 7

编辑推荐

Metrics

[1]	席志红, 王洪旭, 韩双全. 基于ORB-SLAM2系统的快速误匹配剔除算法与地图构建[J]. 计算机应用, 2020, 40(11): 3289-3294.
[2]	丁斗建, 赵晓林, 王长根, 高关根, 寇磊. 基于视觉的机器人自主定位与障碍物检测方法[J]. 计算机应用, 2019, 39(6): 1849-1854.
[3]	黄帅, 付光远, 伍明, 岳敏. 未知环境基于单目次优视差的多模滤波目标跟踪算法[J]. 计算机应用, 2019, 39(3): 864-868.
[4]	胡章芳, 鲍合章, 陈旭, 范霆铠, 赵立明. 基于改进闭环检测算法的视觉同时定位与地图构建[J]. 计算机应用, 2018, 38(3): 873-878.
[5]	林辉灿, 吕强, 王国胜, 张洋, 梁冰. 基于VSLAM的自主移动机器人三维同时定位与地图构建[J]. 计算机应用, 2017, 37(10): 2884-2887.
[6]	赵亮陈敏李洪臣. 基于视觉同时定位与地图构建数据关联优化算法[J]. 计算机应用, 2014, 34(2): 576-579.
[7]	曾文静张铁栋徐玉如姜大鹏. Data association method of SLAM based on ant colony algorithm[J]. 计算机应用, 2009, 29(1): 136-138,.