首页 | 本学科首页   官方微博 | 高级检索  
     

语义导向多尺度多视图深度估计算法
引用本文:贠璟扬,李学华,向维.语义导向多尺度多视图深度估计算法[J].计算机工程与应用,2022,58(2):215-224.
作者姓名:贠璟扬  李学华  向维
作者单位:1.北京信息科技大学 信息与通信工程学院,北京 100101 2.詹姆斯库克大学 科学与工程学院,昆士兰 凯恩斯 4878
基金项目:北京市自然科学基金-海淀原始创新联合基金(重点研究专题)(L182039);北京市自然科学基金(市教委联合资助)(KZ201911232046)。
摘    要:目前利用深度学习进行多视图深度估计的方法可以根据卷积类型可以大致分为两类.其中,基于2D卷积网络的模型预测计算速度快,但预测精度较低;基于3D卷积网络的模型预测精度高,却存在高硬件消耗.同时,多视图中相机外部参数的变化使得模型无法在物体边缘、遮挡或纹理较弱区域生成高精度预测结果.针对上述问题,提出了基于3D卷积的语义导...

关 键 词:多视图立体匹配  深度估计  深度神经网络  监督学习

Semantic-Guidance Multi-scale Network for Multi-view Stereo
YUN Jingyang,LI Xuehua,XIANG Wei.Semantic-Guidance Multi-scale Network for Multi-view Stereo[J].Computer Engineering and Applications,2022,58(2):215-224.
Authors:YUN Jingyang  LI Xuehua  XIANG Wei
Affiliation:1.School of Information and Communication Engineering, Beijing Information Science and Technology University, Beijing 100101, China 2.College of Science and Engineering, James Cook University, Cairns, Queensland 4878, Commonwealth of Australia
Abstract:The current multi-view depth estimation methods based on deep learning can be roughly divided into two categories according to the type of convolution neural network. Among them, the model based on the 2D convolutional network has a fast prediction speed with a lower prediction accuracy while the model based on the 3D convolutional network achieves higher prediction accompanying more hardwares consumption. Also, the transformation of the external parameters of the camera in the multi-view make it impossible for the model to generate high-precision prediction results on the edges of objects, occlusions or textureless areas. In response to the above problems, this paper proposes a multi-scale semantic-oriented multi-view depth estimation algorithm based on 3D convolution which can reduce hardware demand while ensuring prediction accuracy. At the same time, for areas such as occlusion or textureless areas, the image features extracted by the network itself are used as a prior guidance information to enhance the network’s perception of global information and a multi-scale fusion method is combined to enhance the robustness of the network. In the testing comparison of the public datasets, the method proposed in this paper predicts the depth map results more clearly, also can handle sensitive areas such as the object boundaries or occlusion regions in picture.
Keywords:multi-view stereo  depth estimation  deep neural network  supervised learning
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号