首页 | 本学科首页   官方微博 | 高级检索  
     

基于深度学习的双目立体匹配方法综述
引用本文:尹晨阳,职恒辉,李慧斌.基于深度学习的双目立体匹配方法综述[J].计算机工程,2022,48(10):1-12.
作者姓名:尹晨阳  职恒辉  李慧斌
作者单位:西安交通大学 数学与统计学院, 西安 710049
基金项目:国家自然科学基金面上项目(61976173);教育部-中国移动人工智能建设项目(MCM20190701)。
摘    要:双目立体匹配是计算机视觉领域的经典问题,在自动驾驶、遥感、机器人感知等诸多任务中得到广泛应用。双目立体匹配的主要目标是寻找双目图像对中同名点的对应关系,并利用三角测量原理恢复图像深度信息。近年来,基于深度学习的立体匹配方法在匹配精度和匹配效率上均取得了远超传统方法的性能表现。将现有基于深度学习的立体匹配方法分为非端到端方法和端到端方法。基于深度学习的非端到端方法利用深度神经网络取代传统立体匹配方法中的某一步骤,根据被取代步骤的不同,该类方法被分为基于代价计算网络、基于代价聚合网络和基于视差优化网络的3类方法。基于深度学习的端到端方法根据代价体维度的不同可分为基于3D代价体和基于4D代价体的方法。从匹配精度、时间复杂度、应用场景等多个角度对非端到端和端到端方法中的代表性成果进行分析,并归纳各类方法的优点以及存在的局限性。在此基础上,总结基于深度学习的立体匹配方法当前面临的主要挑战并展望该领域未来的研究方向。

关 键 词:计算机视觉  深度学习  双目图像  立体匹配方法  图像深度  
收稿时间:2022-03-24
修稿时间:2022-06-18

Survey of Binocular Stereo-matching Methods Based on Deep Learning
YIN Chenyang,ZHI Henghui,LI Huibin.Survey of Binocular Stereo-matching Methods Based on Deep Learning[J].Computer Engineering,2022,48(10):1-12.
Authors:YIN Chenyang  ZHI Henghui  LI Huibin
Affiliation:School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an 710049, China
Abstract:Binocular stereo matching is a classical problem in the field of computer vision and has been widely used in many tasks such as automated driving, remote sensing, and robot perception.The main goal of binocular stereo matching is to identify the corresponding relationship of same-named points in a binocular image pair and to recover image depth information based on the triangulation principle.In recent years, stereo-matching methods based on deep learning have achieved much better performance than traditional methods in terms of matching accuracy and efficiency.Existing stereo-matching methods based on deep learning are divided into non-end-to-end and end-to-end methods.The non-end-to-end methods based on deep learning use deep neural networks to replace steps in traditional stereo-matching methods.Based on these different steps, these methods can be divided into three types of networks:cost-based computing, cost-based aggregation, and disparity-based optimization.The end-to-end methods based on deep learning can be divided into 3D and 4D cost-volume-based methods according to different cost-volume dimensions.The representative methods of non- and end-to-end methods are analyzed in terms of matching accuracy, time complexity, and application scenarios, and the advantages and limitations of various methods are summarized.Accordingly, the main challenges of stereo-matching methods based on deep learning are summarized and future research directions in the field are prospected.
Keywords:computer vision  deep learning  binocular images  stereo-matching method  image depth  
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号