首页 | 本学科首页   官方微博 | 高级检索  
     

基于多模态特征融合的新闻故事单元分割
引用本文:刘嘉琦,封化民,闫建鹏.基于多模态特征融合的新闻故事单元分割[J].计算机工程,2012,38(24):161-165.
作者姓名:刘嘉琦  封化民  闫建鹏
作者单位:1. 西安电子科技大学通信工程学院,西安,710071
2. 西安电子科技大学通信工程学院,西安710071;北京电子科技学院,北京100070
基金项目:国家自然科学基金资助项目,北京市自然科学基金资助项目
摘    要:对新闻视频进行结构分析,提出一种基于多模态特征融合的新闻故事单元分割方法。将新闻视频分割成音频流和视频流,选择静音区间为音频候选点,将镜头边界切变点作为视频候选点,做主持人镜头和主题字幕的探测,挑选主持人镜头为候选区间,并记录主题字幕的起始位置和结束位置,利用时间轴融合音频候选点、视频候选点、主持人镜头和主题字幕,对新闻视频进行故事单元分割。实验结果表明,该方法的查全率为83.18%,查准率为83.92%。

关 键 词:新闻视频  多模态特征  字幕  音频  故事单元分割
收稿时间:2011-11-22
修稿时间:2012-02-10

News Story Unit Segmentation Based on Multi-modal Feature Fusion
LIU Jia-qi , FENG Hua-min , YAN Jian-peng.News Story Unit Segmentation Based on Multi-modal Feature Fusion[J].Computer Engineering,2012,38(24):161-165.
Authors:LIU Jia-qi  FENG Hua-min  YAN Jian-peng
Affiliation:1(1.School of Telecommunication Engineering,Xidian University,Xi’an 710071,China;2.Beijing Electronic Science and Technology Institution,Beijing 100070,China)
Abstract:News story unit segmentation method based on multi-modal feature fusion is proposed in this paper by analyzing news video structure. News video is divided into audio stream and video stream. Mute intervals are detected as audio candidate points, and the shot segmentations for news video are detected and shot boundary points are chosen as video candidate points, anchorperson shot and topic caption are detected. Story units are detected by fusing audio candidate points, video candidate points, anchorperson shot and topic caption based on time axis. Experimental results show that this method can get 83.18% in recall and 83.92% in precision.
Keywords:news videom  ulti-modal feature  caption  audio  story unit segmentation
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号