首页 | 本学科首页   官方微博 | 高级检索  
     

基于多尺度变形卷积的特征金字塔光流计算方法
引用本文:范兵兵, 葛利跃, 张聪炫, 李兵, 冯诚, 陈震. 基于多尺度变形卷积的特征金字塔光流计算方法. 自动化学报, 2023, 49(1): 197−209 doi: 10.16383/j.aas.c220142
作者姓名:范兵兵  葛利跃  张聪炫  李兵  冯诚  陈震
作者单位:1.南昌航空大学测试与光电工程学院 南昌 330063;;2.南昌航空大学江西省图像处理与模式识别重点实验室 南昌 330063;;3.中国科学院自动化研究所模式识别国家重点实验室 北京 100190;;4.北京航空航天大学仪器科学与光电工程学院 北京 100083
基金项目:科技创新2030“新一代人工智能” 重大项目(2020AAA0105802, 2020AAA0105801, 2020AAA0105800), 国家重点研发计划 (2020 YFC2003800), 国家自然科学基金 (62222206, 62272209, 61866026, 61772255, 61866025), 江西省技术创新引导类计划项目(20212AEI 91005), 江西省教育厅科学技术项目 (GJJ210910), 江西省优势科技创新团队 (20165BCB19007), 江西省自然科学基金重点项目 (20202ACB214007), 江西省图像处理与模式识别重点实验室开放基金 (ET202104413) 资助
摘    要:针对现有深度学习光流计算方法的运动边缘模糊问题, 提出了一种基于多尺度变形卷积的特征金字塔光流计算方法. 首先, 构造基于多尺度变形卷积的特征提取模型, 显著提高图像边缘区域特征提取的准确性; 然后, 将多尺度变形卷积特征提取模型与特征金字塔光流计算网络耦合, 提出一种基于多尺度变形卷积的特征金字塔光流计算模型; 最后, 设计一种结合图像与运动边缘约束的混合损失函数, 通过指导模型学习更加精准的边缘信息, 克服了光流计算运动边缘模糊问题. 分别采用 MPI-Sintel 和 KITTI2015 测试图像集对该方法与代表性的深度学习光流计算方法进行综合对比分析. 实验结果表明, 该方法具有更高的光流计算精度, 有效解决了光流计算的边缘模糊问题.

关 键 词:光流   深度学习   变形卷积   特征金字塔   边缘保护
收稿时间:2022-03-05

A Feature Pyramid Optical Flow Estimation Method Based on Multi-scale Deformable Convolution
Fan Bing-Bing, Ge Li-Yue, Zhang Cong-Xuan, Li Bing, Feng Cheng, Chen Zhen. A feature pyramid optical flow estimation method based on multi-scale deformable convolution. Acta Automatica Sinica, 2023, 49(1): 197−209 doi: 10.16383/j.aas.c220142
Authors:FAN Bing-Bing  GE Li-Yue  ZHANG Cong-Xuan  LI Bing  FENG Cheng  CHEN Zhen
Affiliation:1. School of Measuring and Optical Engineering, Nanchang Hangkong University, Nanchang 330063;;2. Key Laboratory of Jiangxi Province for Image Processing and Pattern Recognition, Nanchang Hangkong University, Nanchang 330063;;3. National Key Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190;;4. School of Instrumentation and Optoelectronic Engineering, Beihang University, Beijing 100083
Abstract:To cope with the issue of edge-blurring caused by the existing deep-learning based optical flow estimation methods, this paper proposes a feature pyramid optical flow estimation method based on multi-scale deformation convolution. Firstly, a feature extraction module based on multi-scale deformable convolution is constructed to improve the accuracy of feature extraction in the regions of image edges. Secondly, by coupling the multi-scale deformable convolution feature extraction module with the feature pyramid based optical flow estimation network, a feature pyramid optical flow estimation model based on multi-scale deformable convolution is presented. Thirdly, a hybrid loss function combining the constraints of image and motion edges is designed, which addresses the issue of edge-blurring by guiding the optical flow model to learn more accurate edge information. Finally, the MPI-Sintel and KITTI2015 test datasets are used for conducting a comprehensive comparison between the proposed method and some representative deep-learning based optical flow estimation methods. The experimental results indicate that the proposed method achieves higher computational accuracy, and overcomes the issue of edge-blurring in optical flow estimation.
Keywords:Optical flow  deep-learning  deformable convolution  feature pyramid  edge-preserving
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号