基于密集卷积网络的单目图像深度估计方法 Method for Estimating Monocular Image Depth Based on Dense Convolutional Network期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于密集卷积网络的单目图像深度估计方法

引用本文：	王亚群,戴华林,王丽,李国燕. 基于密集卷积网络的单目图像深度估计方法[J]. 计算机工程, 2021, 47(11): 262-267,291. DOI: 10.19678/j.issn.1000-3428.0059516

作者姓名：	王亚群戴华林王丽李国燕

作者单位：	天津城建大学计算机与信息工程学院,天津 300384

基金项目：	天津市自然科学基金（17JCQNJC00500）。

摘要：	为解决目前单目图像深度估计方法存在的精度低、网络结构复杂等问题，提出一种密集卷积网络结构，该网络采用端到端的编码器和解码器结构。编码器引入密集卷积网络DenseNet，将前面每一层的输出作为本层的输入，在加强特征重用和前向传播的同时减少参数量和网络计算量，从而避免梯度消失问题发生。解码器结构采用带有空洞卷积的上投影模块和双线性插值模块，以更好地表达由编码器所提取的图像特征，最终得到与输入图像相对应的估计深度图。在NYU Depth V2室内场景深度数据集上进行训练、验证和测试，结果表明，该密集卷积网络结构在δ<1.25时准确率达到0.851，均方根误差低至0.482。
关键词：	密集卷积网络单目图像编码器解码器深度估计
收稿时间：	2020-09-14
修稿时间：	2020-10-22
Method for Estimating Monocular Image Depth Based on Dense Convolutional Network

WANG Yaqun,DAI Hualin,WANG Li,LI Guoyan. Method for Estimating Monocular Image Depth Based on Dense Convolutional Network[J]. Computer Engineering, 2021, 47(11): 262-267,291. DOI: 10.19678/j.issn.1000-3428.0059516

Authors:	WANG Yaqun DAI Hualin WANG Li LI Guoyan

Affiliation:	School of Computer and Information Engineering, Tianjin Chengjian University, Tianjin 300384, China

Abstract:	To address the low accuracy and complex network structure of the existing methods for estimating monocular image depth,a dense convolutional network for estimating monocular image depth is proposed,which adopts an end-to-end encoder and decoder.Dense convolutional Network(DenseNet) is introduced into the encoder,and the output of each previous layer is taken as the input of this layer,which enhances feature reuse and forward propagation while reducing the number of parameters and network computation,thus avoiding the occurrence of gradient disappearance to a certain extent.The decoder adopts the upper projection module with cavity convolution and the bilinear upper sampling module to better express the image features extracted by the encoder,and finally obtain the estimated depth map corresponding to the input image.The proposed network is trained,verified and tested on NYU Depth V2,an indoor scene depth data set.The results show that the proposed dense convolutional network structure achieves an accuracy of 0.851 and a Root Mean Square Error(RMSE) of 0.482 in the case of δ<1.25.

Keywords:	dense convolutional network monocular image encoder decoder depth estimation
本文献已被万方数据等数据库收录！
	点击此处可从《计算机工程》浏览原始摘要信息
	点击此处可从《计算机工程》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏