首页 | 本学科首页   官方微博 | 高级检索  
     

动态场景下基于实例分割和三维重建的多物体单目 SLAM
引用本文:冯 洲,续欣莹,郑宇轩,程 兰,李鹏越.动态场景下基于实例分割和三维重建的多物体单目 SLAM[J].仪器仪表学报,2023,44(8):51-62.
作者姓名:冯 洲  续欣莹  郑宇轩  程 兰  李鹏越
作者单位:太原理工大学电气与动力工程学院
基金项目:国家自然科学基金(62073232);
摘    要:针对大多数SLAM系统在动态环境下相机位姿估计不准确与环境语义信息利用不充分的问题,提出一种基于实例分割的关键帧检测和贝叶斯动态特征概率传播的动态物体检测算法,并对环境中存在的静态物体三维重建,以此构建一个动态环境下的多物体单目SLAM系统。该系统对关键帧输入图像进行实例分割与特征提取,获取潜在运动物体特征点集合与静态物体特征点集合;利用非运动物体特征点集合获取帧间位姿变换,普通帧利用贝叶斯对动静态特征点进行概率传播,利用静态特征点集实现对相机位姿的精准估计;在关键帧中对静态物体进行联合数据关联,数据充足后进行多物体三维重建,构建多物体语义地图,最终实现多物体单目SLAM。本文在TUM与Boon公开数据集上的实验结果表明,在动态场景下,相较于ORB-SLAM2算法,绝对位姿误差的均方根误差平均降低54.1%和58.2%。

关 键 词:多物体单目SLAM  动态场景  实例分割  位姿估计  三维重建

Multi-object monocular SLAM based on instance segmentation and 3D reconstruction in dynamic scene
Feng Zhou,Xu Xinying,Zheng Yuxuan,Cheng Lan,Li Pengyue.Multi-object monocular SLAM based on instance segmentation and 3D reconstruction in dynamic scene[J].Chinese Journal of Scientific Instrument,2023,44(8):51-62.
Authors:Feng Zhou  Xu Xinying  Zheng Yuxuan  Cheng Lan  Li Pengyue
Affiliation:1.College of Electrical and Power Engineering, Taiyuan University of Technology
Abstract:To address the problems of inaccurate camera pose estimation and insufficient utilization of environmental semantic information in most SLAM systems in dynamic environments, proposes a dynamic object detection algorithm based on the instance segmentation, keyframe detection, and Bayesian dynamic feature probability propagation, and three-dimensional reconstruction of static objects in the environment. To construct a multi object monocular SLAM system in a dynamic environment, the system performs instance segmentation and feature extraction on key frame input images, which could obtain a set of potential moving object feature points and a set of static object feature points. A set of non-moving object feature points is used to obtain inter frame pose transformation, Bayesian probability propagation of dynamic and static feature points are utilized for ordinary frames, and a set of static feature points is used to achieve accurate estimation of camera pose. Joint data association is performed on static objects in key frames, and after sufficient data is available, multi object 3D reconstruction is performed to construct a multi object semantic map. Finally, multi object monocular SLAM is achieved. The experimental results on TUM and Boon public dataset show that in dynamic scenarios, compared to the ORB-SLAM2 algorithm, the RMSE of APE decreases by 54. 1% and 58. 2% on average.
Keywords:multi-object monocular SLAM  dynamic scene  instance segmentation  posture estimation  three-dimensional reconstruction
点击此处可从《仪器仪表学报》浏览原始摘要信息
点击此处可从《仪器仪表学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号