基于多尺度变形卷积的特征金字塔光流计算方法 A Feature Pyramid Optical Flow Estimation Method Based on Multi-scale Deformable Convolution期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于多尺度变形卷积的特征金字塔光流计算方法

引用本文：	范兵兵, 葛利跃, 张聪炫, 李兵, 冯诚, 陈震. 基于多尺度变形卷积的特征金字塔光流计算方法. 自动化学报, 2023, 49(1): 197−209 doi: 10.16383/j.aas.c220142

作者姓名：	范兵兵葛利跃张聪炫李兵冯诚陈震

作者单位：	1.南昌航空大学测试与光电工程学院南昌 330063;;2.南昌航空大学江西省图像处理与模式识别重点实验室南昌 330063;;3.中国科学院自动化研究所模式识别国家重点实验室北京 100190;;4.北京航空航天大学仪器科学与光电工程学院北京 100083

基金项目：	科技创新2030“新一代人工智能” 重大项目(2020AAA0105802, 2020AAA0105801, 2020AAA0105800), 国家重点研发计划 (2020 YFC2003800), 国家自然科学基金 (62222206, 62272209, 61866026, 61772255, 61866025), 江西省技术创新引导类计划项目(20212AEI 91005), 江西省教育厅科学技术项目 (GJJ210910), 江西省优势科技创新团队 (20165BCB19007), 江西省自然科学基金重点项目 (20202ACB214007), 江西省图像处理与模式识别重点实验室开放基金 (ET202104413) 资助

摘要：	针对现有深度学习光流计算方法的运动边缘模糊问题, 提出了一种基于多尺度变形卷积的特征金字塔光流计算方法. 首先, 构造基于多尺度变形卷积的特征提取模型, 显著提高图像边缘区域特征提取的准确性; 然后, 将多尺度变形卷积特征提取模型与特征金字塔光流计算网络耦合, 提出一种基于多尺度变形卷积的特征金字塔光流计算模型; 最后, 设计一种结合图像与运动边缘约束的混合损失函数, 通过指导模型学习更加精准的边缘信息, 克服了光流计算运动边缘模糊问题. 分别采用 MPI-Sintel 和 KITTI2015 测试图像集对该方法与代表性的深度学习光流计算方法进行综合对比分析. 实验结果表明, 该方法具有更高的光流计算精度, 有效解决了光流计算的边缘模糊问题.
关键词：	光流深度学习变形卷积特征金字塔边缘保护
收稿时间：	2022-03-05
A Feature Pyramid Optical Flow Estimation Method Based on Multi-scale Deformable Convolution

Fan Bing-Bing, Ge Li-Yue, Zhang Cong-Xuan, Li Bing, Feng Cheng, Chen Zhen. A feature pyramid optical flow estimation method based on multi-scale deformable convolution. Acta Automatica Sinica, 2023, 49(1): 197−209 doi: 10.16383/j.aas.c220142

Authors:	FAN Bing-Bing GE Li-Yue ZHANG Cong-Xuan LI Bing FENG Cheng CHEN Zhen

Affiliation:	1. School of Measuring and Optical Engineering, Nanchang Hangkong University, Nanchang 330063;;2. Key Laboratory of Jiangxi Province for Image Processing and Pattern Recognition, Nanchang Hangkong University, Nanchang 330063;;3. National Key Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190;;4. School of Instrumentation and Optoelectronic Engineering, Beihang University, Beijing 100083

Abstract:	To cope with the issue of edge-blurring caused by the existing deep-learning based optical flow estimation methods, this paper proposes a feature pyramid optical flow estimation method based on multi-scale deformation convolution. Firstly, a feature extraction module based on multi-scale deformable convolution is constructed to improve the accuracy of feature extraction in the regions of image edges. Secondly, by coupling the multi-scale deformable convolution feature extraction module with the feature pyramid based optical flow estimation network, a feature pyramid optical flow estimation model based on multi-scale deformable convolution is presented. Thirdly, a hybrid loss function combining the constraints of image and motion edges is designed, which addresses the issue of edge-blurring by guiding the optical flow model to learn more accurate edge information. Finally, the MPI-Sintel and KITTI2015 test datasets are used for conducting a comprehensive comparison between the proposed method and some representative deep-learning based optical flow estimation methods. The experimental results indicate that the proposed method achieves higher computational accuracy, and overcomes the issue of edge-blurring in optical flow estimation.

Keywords:	Optical flow deep-learning deformable convolution feature pyramid edge-preserving

	点击此处可从《自动化学报》浏览原始摘要信息
	点击此处可从《自动化学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏