首页 | 本学科首页   官方微博 | 高级检索  
     

文本图像信息的提取与识别
引用本文:邱立松,黄继风.文本图像信息的提取与识别[J].计算机与数字工程,2013(12):1981-1984.
作者姓名:邱立松  黄继风
作者单位:上海师范大学信息与机电工程学院,上海200234
摘    要:文本是计算机视觉的许多应用中的一项重要特征,图像中的文本往往包含着比较丰富的信息,将文本图像信息里的文字进行提取和识别,对于图像内容的分析、理解、信息检索等方面具有重要的意义。文本图像的识别分为预处理,文字的切分,细化,特征选择与提取,最后对候选文字进行识别。在文字的切分方面提出了一种改进的投影算法,该算法能在很大程度上提高文字切分的准确度,采用基于数学形态学算法对文字进行细化处理,并在特征选择方面引用了多级分类的算法。

关 键 词:预处理  文字识别  特征选择  多级分类

Feature Extraction and Recognition of Information in Document Image
QIU Lisong,HUANG Jifeng.Feature Extraction and Recognition of Information in Document Image[J].Computer and Digital Engineering,2013(12):1981-1984.
Authors:QIU Lisong  HUANG Jifeng
Affiliation:(College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 200234)
Abstract:Document is an important feature in many computer vision applications. The document image tend to contain more abundant information. Feature extraction and recognition of characters in document image have the vital significance in image content analysis and un- derstanding, even in information retrieval. The recognition of document image includes the following steps: characters preprocessing, seg- mentation, thinning, feature selection and extraction, finally the recognition of candidate words. An improved projection algorithm is pro- posed. This algorithm can greatly improve the segmentation accuracy. Mathematical morphology proposed is used to characters thinning, and a multi-stage classification algorithm is introduced in feature selection.
Keywords:preprocessing  recognition of characters  feature extraction  multi-stage classification
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号