文本图像信息的提取与识别 Feature Extraction and Recognition of Information in Document Image期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

文本图像信息的提取与识别

引用本文：	邱立松,黄继风.文本图像信息的提取与识别[J].计算机与数字工程,2013(12):1981-1984.

作者姓名：	邱立松黄继风

作者单位：	上海师范大学信息与机电工程学院,上海200234

摘要：	文本是计算机视觉的许多应用中的一项重要特征，图像中的文本往往包含着比较丰富的信息，将文本图像信息里的文字进行提取和识别，对于图像内容的分析、理解、信息检索等方面具有重要的意义。文本图像的识别分为预处理，文字的切分，细化，特征选择与提取，最后对候选文字进行识别。在文字的切分方面提出了一种改进的投影算法，该算法能在很大程度上提高文字切分的准确度，采用基于数学形态学算法对文字进行细化处理，并在特征选择方面引用了多级分类的算法。
关键词：	预处理文字识别特征选择多级分类
Feature Extraction and Recognition of Information in Document Image

QIU Lisong,HUANG Jifeng.Feature Extraction and Recognition of Information in Document Image[J].Computer and Digital Engineering,2013(12):1981-1984.

Authors:	QIU Lisong HUANG Jifeng

Affiliation:	(College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 200234)

Abstract:	Document is an important feature in many computer vision applications. The document image tend to contain more abundant information. Feature extraction and recognition of characters in document image have the vital significance in image content analysis and un- derstanding, even in information retrieval. The recognition of document image includes the following steps： characters preprocessing, seg- mentation, thinning, feature selection and extraction, finally the recognition of candidate words. An improved projection algorithm is pro- posed. This algorithm can greatly improve the segmentation accuracy. Mathematical morphology proposed is used to characters thinning, and a multi-stage classification algorithm is introduced in feature selection.

Keywords:	preprocessing recognition of characters feature extraction multi-stage classification
本文献已被维普等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏