基于层级特征融合的单目深度估计算法 Monocular depth estimation algorithm based on hierarchical integration期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于层级特征融合的单目深度估计算法

引用本文：	郑秋梅,于涛,王风华,林超.基于层级特征融合的单目深度估计算法[J].光电子．激光,2022,33(9):925-931.

作者姓名：	郑秋梅于涛王风华林超

作者单位：	中国石油大学华东计算机科学与技术学院，山东青岛 266580,中国石油大学华东计算机科学与技术学院，山东青岛 266580,中国石油大学华东计算机科学与技术学院，山东青岛 266580,中国石油大学华东信息化建设处，山东青岛 266580

基金项目：	国家自然科学基金(52074341,51874340)和中央高校基本科研业务费专项资金资助(19CX02030A)资助项目

摘要：	MonoDepth2的提出使自监督单目深度估计取得了重大的进展,但该网络在大的无语义区域和边界处预测效果并不理想, 主要原因是基础的U-Net框架没有充分利用多尺度特征信息,导致来自于大梯度区域的深度估计较差。针对此问题,本文提出了一个改进的DepthNet,层级特征融合网络(hierarchical integration net,HINet)。优化了U-Net网络结构,使编码器端在每一层都能产生不同尺度的特征信息,从而让解码器端在每一层都能够充分融合多尺度特征。由于不同尺度的特征信息对于特定的解码器层都有不同程度的贡献,本文提出的层级特征融合算法还增加了通道注意力模块,提升重要特征尺度的权重。当采用立体图像对进行训练时,本文对数据进行了预处理,并增加了立体对的深度暗示损失函数。在KI TTI 数据集上的实验结果表明,所有指标均获得了不同程度的提升,其中绝对相对误差减少了0.09,平方相对误差减少了0.093。
关键词：	单目深度特征提取 U-Net 自监督层级融合
收稿时间：	2022/1/27 0:00:00
修稿时间：	2022/2/20 0:00:00
Monocular depth estimation algorithm based on hierarchical integration

ZHENG Qiumei,YU Tao,WANG Fenghua and LIN Chao.Monocular depth estimation algorithm based on hierarchical integration[J].Journal of Optoelectronics·laser,2022,33(9):925-931.

Authors:	ZHENG Qiumei YU Tao WANG Fenghua and LIN Chao

Affiliation:	College of Computer Science and Technology,China University of Petroleum Huadon g,Qingdao,Shandong 266580, China,College of Computer Science and Technology,China University of Petroleum Huadon g,Qingdao,Shandong 266580, China,College of Computer Science and Technology,China University of Petroleum Huadon g,Qingdao,Shandong 266580, China and Information Construction Department,China Un iversity of Petroleum Huadong,Qingdao,Shandong 266580, China

Abstract:	The proposal of MonoDepth2 has made significant progress in self-supe rvised monocular depth estimation,but the prediction effect of the network in large non semantic regions and boundarie s is not ideal.The main reason is that the basic U-Net framework does not make full use of multi-scale feature informatio n,resulting in poor depth estimation from large gradient regions.To address this problem,this paper proposed an improved DepthNet,a hierarchical integration net (HINet).The U-Net network structure is optimized so that the encoder side can generate feature information of different scales at each layer,thus allowing the decoder side to fully fuse multi-scale features at each layer.Since the feature information of different scales contributes to a specific decoder layer to diffe rent degrees,the hierarchical integration (HINet) algorithm proposed in this paper also adds a channel attention module to enhance the weight of important feature scales.When stereo pairs are used for training,this paper preprocesses the dat a and adds a depth-implying loss function for stereo pairs.The experimental results on the KITTI dataset show that all indicators are improved to varying degrees,in which the absolute relative error is reduced by 0.09 and the squared relative error is reduced by 0.093.

Keywords:	monocular depth feature extraction U-Net self-supervisied hierarchical in tegration (HINet)

	点击此处可从《光电子．激光》浏览原始摘要信息
	点击此处可从《光电子．激光》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏