首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The task of handwritten Chinese character recognition is one of the most challenging areas of human handwriting classification. The main reason for this is related to the writing system itself which encompasses thousands of characters, coupled with high levels of diversity in personal writing styles and attributes. Much of the existing work for both online and off-line handwritten Chinese character recognition has focused on methods which employ feature extraction and segmentation steps. The preprocessed data from these steps form the basis for the subsequent classification and recognition phases. This paper proposes an approach for handwritten Chinese character recognition and classification using only an image alignment technique and does not require the aforementioned steps. Rather than extracting features from the image, which often means building models from very large training data, the proposed method instead uses the mean image transformations as a basis for model building. The use of an image-only model means that no subjective tuning of the feature extraction is required. In addition by employing a fuzzy-entropy-based metric, the work also entails improved ability to model different types of uncertainty. The classifier is a simple distance-based nearest neighbour classification system based on template matching. The approach is applied to a publicly available real-world database of handwritten Chinese characters and demonstrates that it can achieve high classification accuracy and is robust in the presence of noise.  相似文献   

2.
This paper proposes a novel illumination-robust face recognition technique that combines the statistical global illumination transformation and the non-statistical local face representation methods. When a new face image with arbitrary illumination is given, it is transformed into a number of face images exhibiting different illuminations using a statistical bilinear model-based indirect illumination transformation. Each illumination transformed image is then represented by a histogram sequence that concatenates the histograms of the non-statistical multi-resolution uniform local Gabor binary patterns (MULGBP) for all the local regions. This is facilitated by dividing the input image into several regular local regions, converting each local region using several Gabor filters, and converting each Gabor filtered region image into multi-resolution local binary patterns (MULBP). Finally, face recognition is performed by a simple histogram matching process. Experimental results demonstrate that the proposed face recognition method is highly robust to illumination variation as exhibited in the real environment.  相似文献   

3.
A novel segment confidence-based binary segmentation (SCBS) for cursive handwritten words is presented in this paper. SCBS is a character segmentation strategy for off-line cursive handwriting recognition. Unlike the approaches in the literature, SCBS is an unordered segmentation approach. SCBS is repetition of binary segmentation and fusion of segment confidence. Each repetition generates only one final segmentation point. The binary segmentation module is a contour tracing algorithm to find a segmentation path to divide a segment into two segments. A set of segments before binary segmentation is called pre-segments, and a set of segments after binary segmentation is called post-segments. SCBS uses over-segmentation technique to generate suspicious segmentation points on pre-segments. On each suspicious segmentation point, binary segmentation is performed and the highest fusion value is recorded. If the highest fusion value is greater than the one of pre-segments, the suspicious segmentation point becomes the final segmentation point for the iteration. If not, no more segmentation is required. Segment confidence is obtained by fusing mean character, lexical and shape confidences. The proposed approach has been evaluated on local and benchmark (CEDAR) databases.  相似文献   

4.
Chinese characters are constructed by strokes according to structural rules. Therefore, the geometric configurations of characters are important features for character recognition. In handwritten characters, stroke shapes and their spatial relations may vary to some extent. The attribute value of a structural identification is then a fuzzy quantity rather than a binary quantity. Recognizing these facts, we propose a fuzzy attribute representation (FAR) to describe the structural features of handwritten Chinese characters for an on-line Chinese character recognition (OLCCR) system. With a FAR. a fuzzy attribute graph for each handwritten character is created, and the character recognition process is thus transformed into a simple graph matching problem. This character representation and our proposed recognition method allow us to relax the constraints on stroke order and stroke connection. The graph model provides a generalized character representation that can easily incorporate newly added characters into an OLCCR system with an automatic learning capability. The fuzzy representation can describe the degree of structural deformation in handwritten characters. The character matching algorithm is designed to tolerate structural deformations to some extent. Therefore, even input characters with deformations can be recognized correctly once the reference dictionary of the recognition system has been trained using a few representative learning samples. Experimental results are provided to show the effectiveness of the proposed method.  相似文献   

5.
This paper proposes a model-based structural matching method for handwritten Chinese character recognition (HCCR). This method is able to obtain reliable stroke correspondence and enable structural interpretation. In the model base, the reference character of each category is described in an attributed relational graph (ARG). The input character is described with feature points and line segments. The strokes and inter-stroke relations of input character are not determined until being matched with a reference character. The structural matching is accomplished in two stages: candidate stroke extraction and consistent matching. All candidate input strokes to match the reference strokes are extracted by line following and then the consistent matching is achieved by heuristic search. Some structural post-processing operations are applied to improve the stroke correspondence. Recognition experiments were implemented on an image database collected in KAIST, and promising results have been achieved.  相似文献   

6.
7.
In this paper, we present a methodology for off-line handwritten character recognition. The proposed methodology relies on a new feature extraction technique based on recursive subdivisions of the character image so that the resulting sub-images at each iteration have balanced (approximately equal) numbers of foreground pixels, as far as this is possible. Feature extraction is followed by a two-stage classification scheme based on the level of granularity of the feature extraction method. Classes with high values in the confusion matrix are merged at a certain level and for each group of merged classes, granularity features from the level that best distinguishes them are employed. Two handwritten character databases (CEDAR and CIL) as well as two handwritten digit databases (MNIST and CEDAR) were used in order to demonstrate the effectiveness of the proposed technique. The recognition result achieved, in comparison to the ones reported in the literature, is the highest for the well-known CEDAR Character Database (94.73%) and among the best for the MNIST Database (99.03%)  相似文献   

8.
This paper presents an effective automated analysis system for mixed documents consisting of handwritten texts and graphic images. In the preprocessing step, an input image is binarized, then graphic regions are separated from text parts using chain codes of connected components. In the character recognition step, we recognize two different sets of handwritten characters: Korean and alphanumeric characters. Considering the structural complexity and variations of Korean characters, we separate them based on partial recognition results of vowels and extract primitive phonemes using a branch and bound algorithm based on dynamic programming (DP) matching. Finally, to validate recognition results, a dictionary and knowledge are employed. Computer simulation with 50 test documents shows that the proposed algorithm analyzes effectively mixed documents.  相似文献   

9.
孙伟 《微处理机》2002,(4):24-27
手写汉字识别是模式识别领域极具应用前景的研究课题之一。本文介绍了用Visual C^ 6.0构造用于研究手写汉字识别的模拟系统,用软件方式实现手写输入。该系统使用方便,使用者可以将精力集中在手写汉字特征提取、识别速度和识别率等方面。  相似文献   

10.
Describes a technique of gray-scale character recognition that offers both noise tolerance and affine-invariance. The key ideas are twofold. First is the use of normalized cross-correlation as a matching measure to realize noise tolerance. Second is the application of global affine transformation (GAT) to the input image so as to achieve affine-invariant correlation with the target image. In particular, optimal GAT is efficiently determined by the successive iteration method using topographic features of gray-scale images as matching constraints. We demonstrate the high matching ability of the proposed GAT correlation method using gray-scale images of numerals subjected to random Gaussian noise and a wide range of affine transformation. Moreover, extensive recognition experiments show that the achieved recognition rate of 94.3 percent against rotation within 30 degrees, scale change within 30 percent, and translation within 20 percent of the character width along with random Gaussian noise is sufficiently high compared to the 42.8 percent offered by simple correlation  相似文献   

11.
In this paper, we discuss an appearance-matching approach to the difficult problem of interpreting color scenes containing occluded objects. We have explored the use of an iterative, coarse-to-fine sum-squared-error method that uses information from hypothesized occlusion events to perform run-time modification of scene-to-template similarity measures. These adjustments are performed by using a binary mask to adaptively exclude regions of the template image from the squared-error computation. At each iteration higher resolution scene data as well as information derived from the occluding interactions between multiple object hypotheses are used to adjust these masks. We present results which demonstrate that such a technique is reasonably robust over a large database of color test scenes containing objects at a variety of scales, and tolerates minor 3D object rotations and global illumination variations. Received: 21 November 1996 / Accepted: 14 October 1997  相似文献   

12.
为了更加高效地利用模板匹配的方法实现对车牌字符图像的识别,结合数学形态学和模糊集理论,提出基于数学形态学的模糊模板匹配方法。首先,对于二值图像的每个像素点及其8-邻域,以赋权的方式刻画中心像素点隶属于字符的程度;其次,加4×4窗口选取代表点,并有重叠地遍历整个字符图像,以构造字符图像的模糊隶属度矩阵;进而运用海明贴近度计算待识别字符的归类,实现对字符的识别;最后,使用Matlab对模糊模板匹配方法进行编程,并在实际字符图像中测试识别效果。与传统模板匹配方法相比较,测试的结果表明,车牌字符的识别准确率得到了显著的提高。  相似文献   

13.
A fast method of handwritten word recognition suitable for real time applications is presented in this paper. Preprocessing, segmentation and feature extraction are implemented using a chain code representation of the word contour. Dynamic matching between characters of a lexicon entry and segment(s) of the input word image is used to rank the lexicon entries in order of best match. Variable duration for each character is defined and used during the matching. Experimental results prove that our approach using the variable duration outperforms the method using fixed duration in terms of both accuracy and speed. Speed of the entire recognition process is about 200 msec on a single SPARC-10 platform and the recognition accuracy is 96.8 percent are achieved for lexicon size of 10, on a database of postal words captured at 212 dpi  相似文献   

14.
针对传统弹性匹配法在手写字符识别中存在着由于过匹配而造成误识别的不足,提出一种基于高阶统计的形变弹性匹配法。根据高阶统计量包含字符形状上的细节变化信息,采用独立分量分析抽取出每个字符类的内在变化方向,并将其应用到弹性匹配的形变模型中。字符的任意种形状变化由这组独立分量的线性叠加来表示。通过形变模型,类模板字符发生形变逐次向输入待识别字符趋近,从而在两个字符之间求得一种最佳匹配。在实验结果中,识别率达到92.81%,得到了提高,表明该方法的有效性。  相似文献   

15.
卢达  浦炜  陈琦玮  谢铭培 《计算机应用》2005,25(10):2418-2421
对手写汉字识别问题,提出了一种在识别之前对手写汉字预分类的新方法,该方法用Neocognitron网提取字符笔画特征,然后采用有监督的扩展ART神经网络(SEART)产生一定数量的预分类组并通过基于模糊相似测量的匹配算法进行预分类。实验表明,该方法用于手写汉字分类效果良好,预分类正确率达到98.22%。  相似文献   

16.
A model-based computer vision system for recognizing handwritten ZIP codes   总被引:1,自引:1,他引:0  
This paper describes a recognition system for handwritten ZIP Codes currently under development at the Environmental Research Institute of Michigan (ERIM). Included within this system are techniques for preprocessing address block images, locating ZIP Codes, splitting touching characters, and identifying handwritten numerals. These techniques rely on mathematical morphology-based image processing and on hierarchical matching of object models to symbolic image representations. The image processing uses adaptive filtering, thresholding, and skeletonizing to create binary and state-labeled images. The matching process uses these images and extensively developed handwritten digit models to identify ZIP Codes. The end-to-end system has been tested on 500 randomly selected address block images. The system correctly recognized a large portion of the ZIP Codes in the test images (45.0%), and incorrectly classified a very low percentage of isolated handwritten digits (0.9%). Overall performance continues to be improved through incremental digit model refinement.This work was funded by the Office of Advanced Technology, United States Postal Service under contract 104230-86-H-0042.  相似文献   

17.
研究LeNet-5在扫描文档中手写体日期字符识别的应用,由于文档扫描的过程中会引入各种噪声,特别是光照和颜色干扰,直接使用LeNet-5算法不能取得较好效果。先在整份文档中对特定待识别字符的进行定位和划分,并对划分出的字符图像进行去噪、灰度化和二值化处理等预处理,接着将字符图像分割成一个个单个字符,然后在LeNet-5网络基础上结合模型匹配法实现对手写体日期字符的识别。分析在不同参数组合下的识别效果,调整算法模型参数有效地提升了模型对于实际对象的性能,实现出一种能够对手写体日期字符集实现较好识别效果的算法。实验结果表明了算法的有效性,并应用于具体工程实践。  相似文献   

18.
In this paper, we present a novel segmentation-free Arabic handwriting recognition system based on hidden Markov model (HMM). Two main contributions are introduced: a new technique for dividing the image into nonuniform horizontal segments to extract the features and a new technique for solving the problems of the skewing of characters by fusing multiple HMMs. Moreover, two enhancements are introduced: the pre-processing method and feature extraction using concavity space. The proposed system first pre-processes the input image by setting the thickness of the input word to three pixels and fixing the spacing between the different parts of the word. The input image is divided into constant number of nonuniform horizontal segments depending on the distribution of the foreground pixels. A set of robust features representing the gradient of the foreground pixels is extracted using sliding windows. The input image is decomposed into several images representing the vertical, horizontal, left diagonal and right diagonal edges in the image. A set of robust features representing the densities of the foreground pixels in the various edge images is extracted using sliding windows. The proposed system builds character HMM models and learns word HMM models using embedded training. Besides the vertical sliding window, two slanted sliding windows are used to extract the features. Three different HMMs are used: one for the vertical sliding window and two for the slanted windows. A fusion scheme is used to combine the three HMMs. The proposed system is very promising and outperforms all the other Arabic handwriting recognition systems reported in the literature.  相似文献   

19.
Segmentation is one of the most important pre-processing steps toward pattern recognition and image understanding. It is often used to partition an image into separate regions, which ideally correspond to different real-world objects. In this paper, novel color image segmentation is proposed and implemented using fuzzy inference system in optimized color space. This system, which is designed by neuro-adaptive learning technique, applies a sample image as an input and can reveal the likelihood of being a special color for each pixel through the image. The intensity of each pixel shows this likelihood in the gray-level output image. After choosing threshold value, a binary image is obtained, which can be applied as a mask to segment desired color in input image. Besides using fuzzy systems, optimizing color space for segmentation is another feature of proposed method. This optimizing is implemented by genetic algorithms and influence on system accuracy. Two applications of developed method are discussed, and still it could be applicable in wide range of color image segmentation or object detection purposes.  相似文献   

20.
针对手写数字识别提出一种基于模板匹配决策分类器设计方法。就该方法下的模式识别分类器设计进行详细论述,给出该分类器算法实现。该算法在对手写的数字图像进行预处理的基础上从待识别的手写数字图像中提取若干特征量与事先建立的标准模板库中模板对应的特征量进行比较,计算待识别图像和标准模板特征量之间的距离,用最小距离法判定其所属类。实验结果表明,该决策分类器算法实现容易,匹配速度快,保证字符识别的正确率。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号