首页 | 本学科首页   官方微博 | 高级检索  
     

三维模板跟踪的基准合成数据集构建及算法评估
引用本文:何弦,李佳宸,金立,刘力,钟凡,秦学英. 三维模板跟踪的基准合成数据集构建及算法评估[J]. 计算机学报, 2022, 45(3): 585-600. DOI: 10.11897/SP.J.1016.2022.00585
作者姓名:何弦  李佳宸  金立  刘力  钟凡  秦学英
作者单位:山东大学软件学院 济南 250101;数字媒体技术教育部工程研究中心 济南 250101;视辰信息科技(上海)有限公司 上海 201203;山东大学计算机科学与计算学院 山东 青岛 266237
基金项目:国家自然科学基金项目(62172260,61907026);;山东省高等学校科学技术计划项目(J18KA392)资助~~;
摘    要:三维模板跟踪旨在将预先构建的三维CAD模型与输入图像中的相应目标进行精确配准,在增强现实、机器人等领域具有重要的应用,也是计算机视觉领域的关键问题之一.近年来,三维模板跟踪的准确率和稳定性都得到了持续提升,但仅有少量的工作关注三维模板跟踪数据集的构建.随着深度学习的普及,各领域中大规模数据集的构建越来越被重视,为算法的...

关 键 词:三维模板跟踪  数据集构建  算法测评  增强现实  真实感渲染

A Synthetic Dataset and Performance Evaluation for 3D Template Tracking
HE Xian,LI Jia-Chen,JIN Li,LIU Li,ZHONG Fan,QIN Xue-Ying. A Synthetic Dataset and Performance Evaluation for 3D Template Tracking[J]. Chinese Journal of Computers, 2022, 45(3): 585-600. DOI: 10.11897/SP.J.1016.2022.00585
Authors:HE Xian  LI Jia-Chen  JIN Li  LIU Li  ZHONG Fan  QIN Xue-Ying
Affiliation:(Department of Software,Shandong University,Jinan 250101;Engineering Research Center of Digital Media Technology,Ministry of Education,Shandong University,Jinan 250101;Shichen Information Technology(Shanghai)Co.,Ltd,Shanghai 201203;Department of Computer Science and Technology,Shandong University,Qingdao,Shandong 266237)
Abstract:3D template tracking aims to accurately align pre-constructed 3D CAD models with the corresponding targets in the input images,and has important applications in augmented reality and robotics.It is also one of the key problems in the field of computer vision.In recent years,various approaches have been proposed to improve the accuracy and robustness of 3D template tracking,but only a small amount of work has contributed to the construction of 3D template tracking datasets.With the development and wide applications of deep learning,the construction of large-scale datasets in various fields has been paid more and more attention,laying the foundation for the training,testing and evaluation of algorithms,which has greatly promoted the development of related fields.Previous datasets for 3D template tracking are acquired by either video capture or computer rendering.Video-captured datasets are realistic,but since the pose is computed based on hand-crafted markers,the accuracy of the ground-truth pose is not guaranteed and the size of these datasets are also limited due to the time-consuming labelling process.Computer-rendered datasets could be synthesized massively,but the quality of rendered image sequences is limited by the adopted render techniques.Altogether,previous datasets suffer from problems such as limited scale,inaccurate ground-truth poses,unrealistic images and insufficient diversity of model settings,therefore it is meaningful and challenging to construct a high-quality and large-scale dataset for 3D template tracking.In this paper,we propose to construct a large-scale 3D template tracking dataset RDOT(Render Dataset for Object Tracking)based on photorealistic rendering.RDOT is rendered with photorealistic rendering method.The model set contains tens of objects with different physical structures and realistic materials,it also allows the camera and objects to move in pre-defined complex motion modes.Moreover,compared with previous datasets,RDOT takes more accurate control of settings of rendering scenes,it offers various detailed settings of lighting,noise,motion blur and occlusion in different degrees of difficulty.To the best of our knowledge,RDOT is currently the largest 3D template tracking dataset which meets the demands of performance evaluation.Based on RDOT,we evaluated previous 3D template tracking methods in an objective and fair way.Previous approaches have been evaluated on different datasets that suffer the aforementioned problems.In our evaluation,the tracking methods are evaluated with three precision metrics,including ADE(Average Edge Distance),ASD(Average Surface Distance)and RR(Reinitialization Rate).We analyze the evaluation results from multiple aspects considering structures of objects,materials of objects and different settings of rendering scenes.In addition,since RGB-based 3D tracking method usually produce significant errors in the depth direction due to the missing of depth constraint,we propose a statistical model of tracking errors that can be computed based on the accurate ground-truth pose of RDOT.By applying the error model to compensate the resulting object pose parameters,the tracking accuracy can be improved significantly.Finally,we discuss the disadvantages of different tracking approaches,and give an overall conclusion and perspective for future 3D template tracking approaches.
Keywords:3D template tracking  dataset construction  algorithm evaluation  augmented reality  photorealistic rendering
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号