一种视频文本自动定位、跟踪和识别的方法 An Algorithm of Automatic Video Text Locating, Tracking and Recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种视频文本自动定位、跟踪和识别的方法

引用本文：	李朝晖,余英林.一种视频文本自动定位、跟踪和识别的方法[J].中国图象图形学报,2005,10(4):457-462,i003.

作者姓名：	李朝晖余英林

作者单位：	[1]广州大学信息学院计算机系，广州510405／／华南理工大学电子与通信工程系，广州510641 [2]华南理工大学电子与通信工程系，广州510641

基金项目：	国家自然科学基金项目(60372068)，广东省科学基金项目(011628)

摘要：	视频数据中的文本能提供重要的语义信息。本文提出了一种视频文本自动定位、跟踪和识别的方法，首先用基于小波和LH检测视频帧文本所在的位置，然后用运动估计的方法，跟踪后继帧文本的位置，再用多帧平均的方法增强文本区域，最后经过二值化处理和连通分量分析，将文本字符送入OCR软件进行识别。实验结果表明，该方法简单易行，能快速地定位和跟踪文本区域，定位精度和识别效果良好。
关键词：	自动定位文本跟踪二值化处理 OCR软件语义信息视频数据运动估计多帧平均分量分析识别效果定位精度视频帧字符
文章编号：	1006-8961(2005)04-0457-06
An Algorithm of Automatic Video Text Locating, Tracking and Recognition

LI Zhao-hui and LI Zhao-hui.An Algorithm of Automatic Video Text Locating, Tracking and Recognition[J].Journal of Image and Graphics,2005,10(4):457-462,i003.

Authors:	LI Zhao-hui~ and LI Zhao-hui~

Affiliation:	LI Zhao-hui~

Abstract:	Text in video can provide an important supplemental source of index semantic information. In this paper, an algorithm of automatic video text locating, tracking and recognition is presented. First, the text regions are located by several steps: wavelet decomposition, high frequency component intensity and density detection, horizontal and vertical convex detection based LH, and text locating. Then the text regions are tracked in next consecutive frames. After multiple frames averaging, the text regions are enhanced. By binarization of the enhanced text regions followed by component analysis, the text regions with clean background are obtained. Then the text regions are recognized by OCR software, the final text strings are attained. Experimental results show that the proposed algorithm can detect and track text region simply and effectively.

Keywords:	text detecting semantic context video index
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《中国图象图形学报》浏览原始摘要信息
	点击此处可从《中国图象图形学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏