首页 | 本学科首页   官方微博 | 高级检索  
     

基于播音员识别的新闻视频故事分割方法
引用本文:徐新文,李国辉,甘亚莉.基于播音员识别的新闻视频故事分割方法[J].计算机工程与应用,2008,44(19):4-7.
作者姓名:徐新文  李国辉  甘亚莉
作者单位:国防科技大学信息系统与管理学院,长沙,410073
基金项目:国家自然科学基金 , 高等学校博士学科点专项科研项目
摘    要:新闻视频的语义单元分割是基于内容的新闻视频检索和情报挖掘的重要步骤,受到众多研究者的关注。提出了一种基于播音员识别的新闻视频故事单分割的新方法,首先从新闻节目中提取各播音员的声学感知特征的作为其声纹,训练出其相应的混合高斯模型(GMM),并采用KL差异法从视频镜头中探测出各播音员和非播音员音频镜头,最后结合视频字幕帧事件和新闻节目特殊的结构知识对新闻节目进行故事单元分割。在2个多小时的CCTV和CNN新闻视频实验中获得96.02%查准率和92.58%的查全率。

关 键 词:播音员声纹  故事单元分割  高斯混合模型  新闻视频
收稿时间:2008-2-18
修稿时间:2008-4-1  

Segmentation method of news video stories based on announcer identification
XU Xin-wen,Ll Guo-hui,GAN Ya-li.Segmentation method of news video stories based on announcer identification[J].Computer Engineering and Applications,2008,44(19):4-7.
Authors:XU Xin-wen  Ll Guo-hui  GAN Ya-li
Affiliation:School of Information System and Management,National University of Defense Technology,Changsha 410073,China
Abstract:As an important step of content based news video retrieving and information mining,semantic unit segmentation has attracted many researchers’ interests.This paper focuses on a new method of news video stories segmentation which is based on the announcer identification.Firstly,the voiceprints including acoustic perception characteristics of each announcer are extracted,and their Gaussian mixture models are trained,then the audio shots of announcer and not-announcer are detected by the KL divergence method,at last the unit segmenting is carried on under the guidance of video topic caption frames and special structure knowledge of news program.Finally the 92.58% recall and the 96.02% precision are achieved during more than 2 hours’ experiment.
Keywords:voiceprint  story unit segmentation  Gaussian mixture model  news video
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号