语义导向多尺度多视图深度估计算法 Semantic-Guidance Multi-scale Network for Multi-view Stereo期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

语义导向多尺度多视图深度估计算法

引用本文：	贠璟扬,李学华,向维.语义导向多尺度多视图深度估计算法[J].计算机工程与应用,2022,58(2):215-224.

作者姓名：	贠璟扬李学华向维

作者单位：	1.北京信息科技大学信息与通信工程学院，北京 100101 2.詹姆斯库克大学科学与工程学院，昆士兰凯恩斯 4878

基金项目：	北京市自然科学基金-海淀原始创新联合基金(重点研究专题)(L182039);北京市自然科学基金(市教委联合资助)(KZ201911232046)。

摘要：	目前利用深度学习进行多视图深度估计的方法可以根据卷积类型可以大致分为两类.其中,基于2D卷积网络的模型预测计算速度快,但预测精度较低;基于3D卷积网络的模型预测精度高,却存在高硬件消耗.同时,多视图中相机外部参数的变化使得模型无法在物体边缘、遮挡或纹理较弱区域生成高精度预测结果.针对上述问题,提出了基于3D卷积的语义导...
关键词：	多视图立体匹配深度估计深度神经网络监督学习
Semantic-Guidance Multi-scale Network for Multi-view Stereo

YUN Jingyang,LI Xuehua,XIANG Wei.Semantic-Guidance Multi-scale Network for Multi-view Stereo[J].Computer Engineering and Applications,2022,58(2):215-224.

Authors:	YUN Jingyang LI Xuehua XIANG Wei

Affiliation:	1.School of Information and Communication Engineering, Beijing Information Science and Technology University, Beijing 100101, China 2.College of Science and Engineering, James Cook University, Cairns, Queensland 4878, Commonwealth of Australia

Abstract:	The current multi-view depth estimation methods based on deep learning can be roughly divided into two categories according to the type of convolution neural network. Among them, the model based on the 2D convolutional network has a fast prediction speed with a lower prediction accuracy while the model based on the 3D convolutional network achieves higher prediction accompanying more hardwares consumption. Also, the transformation of the external parameters of the camera in the multi-view make it impossible for the model to generate high-precision prediction results on the edges of objects, occlusions or textureless areas. In response to the above problems, this paper proposes a multi-scale semantic-oriented multi-view depth estimation algorithm based on 3D convolution which can reduce hardware demand while ensuring prediction accuracy. At the same time, for areas such as occlusion or textureless areas, the image features extracted by the network itself are used as a prior guidance information to enhance the network’s perception of global information and a multi-scale fusion method is combined to enhance the robustness of the network. In the testing comparison of the public datasets, the method proposed in this paper predicts the depth map results more clearly, also can handle sensitive areas such as the object boundaries or occlusion regions in picture.

Keywords:	multi-view stereo depth estimation deep neural network supervised learning
本文献已被维普万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏