首页 | 本学科首页   官方微博 | 高级检索  
     

基于3D卷积神经网络的无参考视频质量评价
引用本文:王春峰,苏荔,张维刚,黄庆明.基于3D卷积神经网络的无参考视频质量评价[J].软件学报,2016,27(S2):103-112.
作者姓名:王春峰  苏荔  张维刚  黄庆明
作者单位:中国科学院大学 数据挖掘与知识管理重点实验室, 北京 100049,中国科学院大学 数据挖掘与知识管理重点实验室, 北京 100049;中国科学院 计算技术研究所 智能信息处理重点实验室, 北京 100190,中国科学院大学 数据挖掘与知识管理重点实验室, 北京 100049;哈尔滨工业大学(威海) 计算机与科学与技术学院, 山东 威海 264209,中国科学院大学 数据挖掘与知识管理重点实验室, 北京 100049;中国科学院 计算技术研究所 智能信息处理重点实验室, 北京 100190
基金项目:国家自然科学基金(61025011,61332016,61472389);国家重点基础研究发展计划(973)(2015CB351802)
摘    要:无参考视频质量评价(NR-VQA)在无法获得原始高质量视频参照的前提下,对失真视频的视觉质量进行定量度量.常规NR-VQA方法通常针对特定失真类型设计,或者与人的主观感受存在偏差.首次将3D深度卷积神经网络(3D-CNN)引入到了视频质量评价中,提出了一种基于3D-CNN的无参考视频质量评价方法,可以适用于非特定失真类型的NR-VQA.首先,通过3D块来有效学习和表征视频内容的时空特征.其次,对常规的3D卷积网络模型进行改进,使其适用于视频质量评价的任务.实验结果表明,所提出的方法在多种失真类型和多个测试指标上,与人的主观感知一致性较高.作为无参考视频质量评价方法,其性能与许多全参考评价方法具有可比性,同时比主流的NR-VQA方法具有更快的运行速度,这使得所提模型在实际中具有更好的应用前景.

关 键 词:视频质量评价  3D  深度卷积神经网络  无参考  全参考
收稿时间:5/1/2016 12:00:00 AM
修稿时间:2016/10/18 0:00:00

No Reference Video Quality Assessment Based on 3D Convolutional Neural Network
WANG Chun-Feng,SU Li,ZHANG Wei-Gang and HUANG Qing-Ming.No Reference Video Quality Assessment Based on 3D Convolutional Neural Network[J].Journal of Software,2016,27(S2):103-112.
Authors:WANG Chun-Feng  SU Li  ZHANG Wei-Gang and HUANG Qing-Ming
Affiliation:Key Laboratory on Big Data Mining and Knowledge Management, University of Chinese Academy of Sciences, Beijing 100049, China,Key Laboratory on Big Data Mining and Knowledge Management, University of Chinese Academy of Sciences, Beijing 100049, China;Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, The Chinese Academy of Sciences, Beijing 100190, China,Key Laboratory on Big Data Mining and Knowledge Management, University of Chinese Academy of Sciences, Beijing 100049, China;Institute of Computer Science and Technology, Harbin Institute of Technology(Weihai), Weihai 264209, China and Key Laboratory on Big Data Mining and Knowledge Management, University of Chinese Academy of Sciences, Beijing 100049, China;Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, The Chinese Academy of Sciences, Beijing 100190, China
Abstract:No reference video quality assessment (NR-VQA) measures distorted videos quantitatively without the reference of original high quality videos. Conventional NR-VQA methods are generally designed for specific types of distortions, or not consistent with human''s perception. This paper innovatively introduces 3D deep convolutional neural network (3D-CNN) into VQA and proposes a 3D-CNN based NR-VQA method, which is universal for non-specific types of distortions. First, the proposed method utilizes 3D patches to learn spatio-temporal features that represent video content effectively. Second, the original 3D-CNN model is modified which is used to classify videos to make it adapt to VQA task. Experiments demonstrate that the proposed method is highly consistent with human''s perception across numerous distortions and metrics. Compared with other state-of-the-art no-reference VQA methods, the proposed method runs much faster while keeping the similar performance. As a no-reference VQA method, it is even comparable with many of the state-of-the-art full-reference VQA methods, which provides the proposed method with better application prospects.
Keywords:video quality assessment  3D  convolutional neural network  no reference  full reference
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号