首页 | 本学科首页   官方微博 | 高级检索  
     

级联手工特征与深度特征的视频关键帧检测方法
引用本文:毋立芳,赵宽,简萌,王向东. 级联手工特征与深度特征的视频关键帧检测方法[J]. 信号处理, 2019, 35(11): 1871-1879. DOI: 10.16798/j.issn.1003-0530.2019.11.012
作者姓名:毋立芳  赵宽  简萌  王向东
作者单位:北京工业大学信息学部
基金项目:国家自然科学基金(61976010, 61802011, 61702022);北京市教委科学基金(KM201910005024);中国博士后科学基金资助项目(2018M640033);北京工业大学“日新”人才培养计划基金会
摘    要:关键帧检测是有效的视频内容分析的关键环节。常用的基于手工特征的方法运行效率高但很难有效表征关键帧特征,因而性能不好。基于深度特征的方法因为网络结构复杂,导致效率不高。在体育比赛类视频中,关键帧常为比赛转播中镜头变化的最后一帧。但广播视频中除了包含比赛视频还包括很多其他类型的镜头如中场休息、渐变镜头等。因此检测最后一帧包含很多比赛无关内容。针对这一问题,本文提出了一种手工特征与深度特征相结合的视频关键帧检测方法。首先基于颜色直方图特征进行镜头边界检测获取最后一帧。进一步基于直方图相似性提出一种类似聚类的方法得到候选关键帧。最后,基于深度神经网络对候选关键帧进行分类,得到真正的关键帧。在冰壶比赛视频和篮球比赛视频上的对比实验结果表明,相对于传统的背景差分法、光流法等,本文提出方法能够快速、可靠地提取关键帧。 

关 键 词:手工特征   深度特征   神经网络   关键帧检测   镜头分割
收稿时间:2019-08-10

Video Key Frame Detection Method by Cascaded Manual Feature and Depth Feature
Affiliation:Department of Information, Beijing University of Technology
Abstract:Key frame detection is the key link of effective video content analysis. The commonly used methods based on manual features are efficient but difficult to represent key frame features effectively, so the performance is not good. Because of the complexity of network structure, the method based on depth feature is inefficient. In sports games video, the key frame is often the last frame of shot change in the game broadcast. However, in addition to the game video, there are many other types of shots in the broadcast video, such as halftime, gradient shot and so on. So the last frame contains a lot of irrelevant content. In order to solve this problem, this paper proposes a video key frame detection method which combines manual feature and depth feature. Firstly, the last frame is obtained by shot boundary detection based on color histogram feature. Furthermore, based on histogram similarity, a similar clustering method is proposed to get candidate keyframes. Finally, the candidate keyframes are classified based on the depth neural network to get the real keyframes. The experimental results on curling match video and basketball match video show that compared with the traditional background difference method, optical flow method, etc, This method can extract key frames quickly and reliably. 
Keywords:
点击此处可从《信号处理》浏览原始摘要信息
点击此处可从《信号处理》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号