首页 | 本学科首页   官方微博 | 高级检索  
     

一种视频文本自动定位、跟踪和识别的方法
引用本文:李朝晖,余英林.一种视频文本自动定位、跟踪和识别的方法[J].中国图象图形学报,2005,10(4):457-462,i003.
作者姓名:李朝晖  余英林
作者单位:[1]广州大学信息学院计算机系,广州510405//华南理工大学电子与通信工程系,广州510641 [2]华南理工大学电子与通信工程系,广州510641
基金项目:国家自然科学基金项目(60372068),广东省科学基金项目(011628)
摘    要:视频数据中的文本能提供重要的语义信息。本文提出了一种视频文本自动定位、跟踪和识别的方法,首先用基于小波和LH检测视频帧文本所在的位置,然后用运动估计的方法,跟踪后继帧文本的位置,再用多帧平均的方法增强文本区域,最后经过二值化处理和连通分量分析,将文本字符送入OCR软件进行识别。实验结果表明,该方法简单易行,能快速地定位和跟踪文本区域,定位精度和识别效果良好。

关 键 词:自动定位  文本  跟踪  二值化处理  OCR软件  语义信息  视频数据  运动估计  多帧平均  分量分析  识别效果  定位精度  视频帧  字符
文章编号:1006-8961(2005)04-0457-06

An Algorithm of Automatic Video Text Locating, Tracking and Recognition
LI Zhao-hui and LI Zhao-hui.An Algorithm of Automatic Video Text Locating, Tracking and Recognition[J].Journal of Image and Graphics,2005,10(4):457-462,i003.
Authors:LI Zhao-hui~ and LI Zhao-hui~
Affiliation:LI Zhao-hui~
Abstract:Text in video can provide an important supplemental source of index semantic information. In this paper, an algorithm of automatic video text locating, tracking and recognition is presented. First, the text regions are located by several steps: wavelet decomposition, high frequency component intensity and density detection, horizontal and vertical convex detection based LH, and text locating. Then the text regions are tracked in next consecutive frames. After multiple frames averaging, the text regions are enhanced. By binarization of the enhanced text regions followed by component analysis, the text regions with clean background are obtained. Then the text regions are recognized by OCR software, the final text strings are attained. Experimental results show that the proposed algorithm can detect and track text region simply and effectively.
Keywords:text detecting  semantic context  video index
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中国图象图形学报》浏览原始摘要信息
点击此处可从《中国图象图形学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号