首页 | 本学科首页   官方微博 | 高级检索  
     

基于多尺度特征融合的红外单目测距算法
引用本文:刘斌,李港庆,安澄全,王水根,王建生.基于多尺度特征融合的红外单目测距算法[J].计算机应用,2022,42(3):804-809.
作者姓名:刘斌  李港庆  安澄全  王水根  王建生
作者单位:哈尔滨工程大学 信息与通信工程学院,哈尔滨 150001
艾睿光电科技有限公司,山东 烟台 264000
摘    要:由于MonoDepth2的提出,无监督单目测距在可见光领域取得了重大发展;然而在某些场景例如夜间以及一些低能见度的环境,可见光并不适用,而红外热成像可以在夜间和低能见度条件下获得清晰的目标图像,因此对于红外图像的深度估计显得尤为必要.由于可见光和红外图像的特性不同,直接将现有可见光单目深度估计算法迁移到红外图像是不合理...

关 键 词:无监督  单目测距  红外图像  双向特征金字塔网络  跨阶段部分网络
收稿时间:2021-06-02
修稿时间:2021-07-14

Infrared monocular ranging algorithm based on multiscale feature fusion
LIU Bin,LI Gangqing,AN Chengquan,WANG Shuigen,WANG Jiansheng.Infrared monocular ranging algorithm based on multiscale feature fusion[J].journal of Computer Applications,2022,42(3):804-809.
Authors:LIU Bin  LI Gangqing  AN Chengquan  WANG Shuigen  WANG Jiansheng
Affiliation:College of Information and Communication Engineering,Harbin Engineering University,Harbin Heilongjiang 150001,China
IRay Technology Company Limited,Yantai Shandong 264000,China
Abstract:Due to the introduction of MonoDepth2, unsupervised monocular ranging has made great progress in the field of visible light. However, visible light is not applicable in some scenes, such as at night and in some low-visibility environments. Infrared thermal imaging can obtain clear target images at night and under low-visibility conditions, so it is necessary to estimate the depth of infrared image. However, due to the different characteristics of visible and infrared images, it is unreasonable to migrate existing monocular depth estimation algorithms directly to infrared images. An infrared monocular ranging algorithm based on multiscale feature fusion after improving the MonoDepth2 algorithm can solve this problem. A new loss function, edge loss function, was designed for the low texture characteristic of infrared image to reduce pixel mismatch during image reprojection. The previous unsupervised monocular ranging simply upsamples the four-scale depth maps to the original image resolution to calculate projection errors, ignoring the correlation between scales and the contribution differences between different scales. A weighted Bi-directional Feature Pyramid Network (BiFPN) was applied to feature fusion of multiscale depth maps so that the blurring of depth map edge was solved. In addition, Residual Network (ResNet) structure was replaced by Cross Stage Partial Network (CSPNet) to reduce network complexity and increase operation speed. The experimental results show that edge loss is more suitable for infrared image ranging, resulting in better depth map quality. After adding BiFPN structure, the edge of depth image is clearer. After replacing ResNet with CSPNet, the inference speed is improved by about 20 percentage points. The proposed algorithm can accurately estimate the depth of the infrared image, solving the problem of depth estimation in night low-light scenes and some low-visibility scenes, and the application of this algorithm can also reduce the cost of assisted driving to a certain extent.
Keywords:unsupervised  monocular ranging  infrared image  Bi-directional Feature Pyramid Network (BiFPN)  Cross Stage Partial Network (CSPNet)  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号