基于多模式分析自动解析新闻视频 Automatic Parsing of News Video Using Multimodal Analysis期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于多模式分析自动解析新闻视频

引用本文：	王伟强,高文.基于多模式分析自动解析新闻视频[J].软件学报,2001,12(9):1271-1278.

作者姓名：	王伟强高文

作者单位：	1. 中国科学院计算技术研究所 2. 中国科学院计算技术研究所哈尔滨工业大学计算机科学与工程系

基金项目：	Supported by the National Natural Science Foundation of China under Grant No.69789301 (国家自然科学基金); the National High Technology Development Program of China under Grant No.863-306-ZT03-01-2 (国家863高科技发展计划)

摘要：	提出一种结合视觉、声音、文字等多种模式信息自动解析新闻视频的方法,并对音频特征的提取以及综合多种模式信息解析新闻视频的算法进行了详细的探讨.多种模式信息的使用有效地弥补了仅基于图像分析技术分割新闻条目的不足,从而使该方法对不同方式存在的新闻条目在分割时具有更广泛的适应性.在包含184100帧的测试数据集上,对于新闻条目边界点的检测,系统获得了95.1%查全率,93.3%的正确率.实验结果证明了该方法的有效性、强壮性.
关键词：	MPEG-2视频新闻条目自动分割音视频信息分析播音员镜头标题文字
收稿时间：	2000/10/24 0:00:00
修稿时间：	2000年10月24
Automatic Parsing of News Video Using Multimodal Analysis

WANG Wei qiang and GAO Wen.Automatic Parsing of News Video Using Multimodal Analysis[J].Journal of Software,2001,12(9):1271-1278.

Authors:	WANG Wei qiang and GAO Wen

Abstract:	The paper presents an approach, which exploits multimodal information (video, audio and text) to automatically parse news video. In the paper, audio features extraction, as well as multimodal information integration scheme, are addressed in detail. Integration of multiple information sources can overcome the weakness of the approach only exploiting the image analysis techniques. That makes our approach have wider adaptation to variable existence situations of news items. On test data with 184 100 frames, when the system detects boundaries between news items, the recall 95.1% and the accuracy 93.3% are obtained. The experiment results show the approach is valid and robust.

Keywords:	MPEG 2 video automatic segmentation of news items audio and visual information analysis anchor shot caption detection
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《软件学报》浏览原始摘要信息
	点击此处可从《软件学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏