期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Eigen-deformations for elastic matching based handwritten character recognition

Seiichi Uchida^{Author Vitae} Hiroaki SakoeAuthor Vitae 《Pattern recognition》2003,36(9):2031-2040

Deformations in handwritten characters have category-dependent tendencies. In this paper, the estimation and the utilization of such tendencies called eigen-deformations are investigated for the better performance of elastic matching based handwritten character recognition. The eigen-deformations are estimated by the principal component analysis of actual deformations automatically collected by the elastic matching. From experimental results it was shown that typical deformations of each category can be extracted as the eigen-deformations. It was also shown that the recognition performance can be improved significantly by using the eigen-deformations for the detection of overfitting, which is the main cause of the misrecognition in the elastic matching based recognition methods. 相似文献

2.

Parallel compact integration in handwritten Chinese character recognition 总被引：1，自引：0，他引：1

WANG Chunheng XIAO Baihua & DAI RuweiInstitute of Automation Chinese Academy of Sciences Beijing China 《中国科学F辑(英文版)》2004,47(1):89-96

In this paper, a new parallel compact integration scheme based on multi-layer perceptron (MLP) networks is proposed to solve handwritten Chinese character recognition (HCCR) problems. The idea of metasynthesis is applied to HCCR, and compact MLP network classifier is defined. Human intelligence and computer capabilities are combined together effectively through a procedure of two-step supervised learning. Compared with previous integration schemes, this scheme is characterized with parallel compact structure and better performance. It provides a promising way for applying MLP to large vocabulary classification. 相似文献

3.

Real-time arabic handwritten character recognition

T.S. El-Sheikh S.G. El-Taweel 《Pattern recognition》1990,23(12):1323-1332

This paper considers the development of a real-time Arabic handwritten character recognition system. The shape of an Arabic character depends on its position in a given word. The system assumes that characters result from a reliable segmentation stage, thus, the position of the character is known a priori. Thus, four different sets of character shapes have been independently considered. Each set is further divided into four subsets depending on the number of strokes in the character. The system has been heavily tested and the average recognition rate has been found to be 99.6% where most of the misrecognized characters were actually written with little care. Thus, the system can be reliably used for the recognition of on-line handwritten characters entered via a graphic tablet. 相似文献

4.

Learning stochastic edit distance: Application in handwritten character recognition

Jose Oncina Marc Sebban 《Pattern recognition》2006,39(9):1575-1587

Many pattern recognition algorithms are based on the nearest-neighbour search and use the well-known edit distance, for which the primitive edit costs are usually fixed in advance. In this article, we aim at learning an unbiased stochastic edit distance in the form of a finite-state transducer from a corpus of (input, output) pairs of strings. Contrary to the other standard methods, which generally use the Expectation Maximisation algorithm, our algorithm learns a transducer independently on the marginal probability distribution of the input strings. Such an unbiased way to proceed requires to optimise the parameters of a conditional transducer instead of a joint one. We apply our new model in the context of handwritten digit recognition. We show, carrying out a large series of experiments, that it always outperforms the standard edit distance. 相似文献

5.

Zoning methods for handwritten character recognition: A survey

D. Impedovo G. Pirlo 《Pattern recognition》2014

This paper presents a survey on zoning methods for handwritten character recognition. Through the analysis of the relevant literature in the field, the most valuable zoning methods are presented in terms of both topologies and membership functions. Throughout the paper, diverse zoning topologies are presented based on both static and adaptive approaches. Concerning static approaches, uniform and non-uniform zoning strategies are discussed. When adaptive zonings are considered, manual and automatic strategies for optimal zoning design are illustrated as well as the most appropriate zoning representation techniques. In addition, the role of membership functions for zoning-based classification is highlighted and the diverse approaches to membership function selection are presented. Concerning global membership functions, the paper introduces order-based approaches as well as fuzzy approaches using border-based and ranked-based fuzzy membership values. Concerning local membership functions, the recent parameter-based approaches are described, in which the optimal membership-function is selected for each zone of the zoning method. Finally, a comparative analysis on the performance of zoning methods is presented and the most interesting approaches are focused on in terms of topology design and membership function selection. A list of selected references is provided as a useful tool for interested researchers working in the field. 相似文献

6.

High accuracy handwritten Chinese character recognition using LDA-based compound distances

Tian-Fu Gao^{Author Vitae} Cheng-Lin Liu Author Vitae 《Pattern recognition》2008,41(11):3442-3451

To improve the accuracy of handwritten Chinese character recognition (HCCR), we propose linear discriminant analysis (LDA)-based compound distances for discriminating similar characters. The LDA-based method is an extension of previous compound Mahalanobis function (CMF), which calculates a complementary distance on a one-dimensional subspace (discriminant vector) for discriminating two classes and combines this complementary distance with a baseline quadratic classifier. We use LDA to estimate the discriminant vector for better discriminability and show that under restrictive assumptions, the CMF is a special case of our LDA-based method. Further improvements can be obtained when the discriminant vector is estimated from higher-dimensional feature spaces. We evaluated the methods in experiments on the ETL9B and CASIA databases using the modified quadratic discriminant function (MQDF) as baseline classifier. The results demonstrate the superiority of LDA-based method over the CMF and the superiority of discriminant vector learning from high-dimensional feature spaces. Compared to the MQDF, the proposed method reduces the error rates by factors of over 26%. 相似文献

7.

A new benchmark on the recognition of handwritten Bangla and Farsi numeral characters

Cheng-Lin Liu Ching Y. SuenAuthor vitae 《Pattern recognition》2009,(12):3287-3295

The recognition of Indian and Arabic handwriting is drawing increasing attention in recent years. To test the promise of existing handwritten numeral recognition methods and provide new benchmarks for future research, this paper presents some results of handwritten Bangla and Farsi numeral recognition on binary and gray-scale images. For recognition on gray-scale images, we propose a process with proper image pre-processing and feature extraction. In experiments on three databases, ISI Bangla numerals, CENPARMI Farsi numerals, and IFHCDB Farsi numerals, we have achieved very high accuracies using various recognition methods. The highest test accuracies on the three databases are 99.40%, 99.16%, and 99.73%, respectively. We justified the benefit of recognition on gray-scale images against binary images, compared some implementation choices of gradient direction feature extraction, some advanced normalization and classification methods. 相似文献

8.

A novel stroke-based feature extraction for handwritten Chinese character recognition 总被引：7，自引：0，他引：7

Hung-Pin Chiu Din-Chang Tseng 《Pattern recognition》1999,32(12):301-1959

A stroke-based approach to extract skeletons and structural features for handwritten Chinese character recognition is proposed. We first determine stroke directions based on the directional run-length information of binary character patterns. According to the stroke directions and their adjacent relationships, we split strokes into stroke and fork segments, and then extract the skeletons of the stroke segments called skeleton segments. After all skeleton segments are extracted, fork segments are processed to find the fork points and fork degrees. Skeleton segments that touch a fork segment are connected at the fork point, and all connected skeleton segments form the character skeleton. According to the extracted skeletons and fork points, we can extract primitive strokes and stroke direction maps for recognition. A simple classifier based on the stroke direction map is presented to recognize regular and rotated characters to verify the ability of the proposed feature extraction for handwritten Chinese character recognition. Several experiments are carried out, and the experimental results show that the proposed approach can easily and effectively extract skeletons and structural features, and works well for handwritten Chinese character recognition. 相似文献

9.

Performance evaluation of pattern classifiers for handwritten character recognition

Cheng-Lin Liu Hiroshi Sako Hiromichi Fujisawa 《International Journal on Document Analysis and Recognition》2002,4(3):191-204

This paper describes a performance evaluation study in which some efficient classifiers are tested in handwritten digit recognition. The evaluated classifiers include a statistical classifier (modified quadratic discriminant function, MQDF), three neural classifiers, and an LVQ (learning vector quantization) classifier. They are efficient in that high accuracies can be achieved at moderate memory space and computation cost. The performance is measured in terms of classification accuracy, sensitivity to training sample size, ambiguity rejection, and outlier resistance. The outlier resistance of neural classifiers is enhanced by training with synthesized outlier data. The classifiers are tested on a large data set extracted from NIST SD19. As results, the test accuracies of the evaluated classifiers are comparable to or higher than those of the nearest neighbor (1-NN) rule and regularized discriminant analysis (RDA). It is shown that neural classifiers are more susceptible to small sample size than MQDF, although they yield higher accuracies on large sample size. As a neural classifier, the polynomial classifier (PC) gives the highest accuracy and performs best in ambiguity rejection. On the other hand, MQDF is superior in outlier rejection even though it is not trained with outlier data. The results indicate that pattern classifiers have complementary advantages and they should be appropriately combined to achieve higher performance. Received: July 18, 2001 / Accepted: September 28, 2001 相似文献

10.

Genetic engineering of hierarchical fuzzy regional representations for handwritten character recognition 总被引：1，自引：0，他引：1

Christian Gagné Marc Parizeau 《International Journal on Document Analysis and Recognition》2006,8(4):223-231

This paper presents a genetic programming based approach for optimizing the feature extraction step of a handwritten character recognizer. This recognizer uses a simple multilayer perceptron as a classifier and operates on a hierarchical feature space of orientation, curvature, and center of mass primitives. The nodes of the hierarchy represent rectangular sub-regions of their parent node, the tree root corresponding to the character's bounding box. Within each sub-region, a variable number of fuzzy features are extracted. Genetic programming is used to simultaneously learn the best hierarchy and the best combination of fuzzy features. Moreover, the fuzzy features are not predetermined, they are inferred from the evolution process which runs a two-objective selection operator. The first objective maximizes the recognition rate, and the second minimizes the feature space size. Results on Unipen data show that, using this approach, robust representations could be obtained that out-performed comparable human designed hierarchical fuzzy regional representations. 相似文献

11.

A hierarchical approach to recognition of handwritten Bangla characters

Subhadip Basu Author VitaeAuthor Vitae Ram Sarkar Author VitaeAuthor Vitae Mita Nasipuri^{Author Vitae} Dipak Kumar Basu Author Vitae 《Pattern recognition》2009,42(7):1467-1484

相似文献

12.

A classifier for Bangla handwritten numeral recognition

Ying Wen Lianghua He 《Expert systems with applications》2012,39(1):948-953

This paper presents a novel pattern classification approach - a kernel and Bayesian discriminant based classifier which utilizes the distribution characteristics of the samples in each class. A kernel combined with Bayesian discriminant in the subspace spanned by the eigenvectors which are associated with the smaller eigenvalues in each class is adopted as the classification criterion. To solve the problem of the matrix inverse, the smaller eigenvalues are substituted by a small threshold which is decided by minimizing the training error in a given database. Application of the proposed classifier to the issue of handwritten numeral recognition demonstrates that it is promising in practical applications. 相似文献

13.

Corpus-based HIT-MW database for offline recognition of general-purpose Chinese handwritten text

Tonghua Su Tianwen Zhang Dejun Guan 《International Journal on Document Analysis and Recognition》2007,10(1):27-38

A Chinese handwriting database named HIT-MW is presented to facilitate the offline Chinese handwritten text recognition. Both the writers and the texts for handcopying are carefully sampled with a systematic scheme. To collect naturally written handwriting, forms are distributed by postal mail or middleman instead of face to face. The current version of HIT-MW includes 853 forms and 186,444 characters that are produced under an unconstrained condition without preprinted character boxes. The statistics show that the database has an excellent representation of the real handwriting. Many new applications concerning real handwriting recognition can be supported by the database. 相似文献

14.

Combining additive input noise annealing and pattern transformations for improved handwritten character recognition

《Expert systems with applications》2014,41(18):8180-8188

Two problems that burden the learning process of Artificial Neural Networks with Back Propagation are the need of building a full and representative learning data set, and the avoidance of stalling in local minima. Both problems seem to be closely related when working with the handwritten digits contained in the MNIST dataset. Using a modest sized ANN, the proposed combination of input data transformations enables the achievement of a test error as low as 0.43%, which is up to standard compared to other more complex neural architectures like Convolutional or Deep Neural Networks. 相似文献

15.

Pseudo two-dimensional shape normalization methods for handwritten Chinese character recognition

Cheng-Lin Katsumi 《Pattern recognition》2005,38(12):2242-2255

The nonlinear normalization (NLN) method based on line density equalization is popularly used in handwritten Chinese character recognition. To overcome the insufficient shape restoration capability of one-dimensional NLN, a pseudo two-dimensional NLN (P2DNLN) method has been proposed and has yielded higher recognition accuracy. The P2DNLN method, however, is very computationally expensive because of the line density blurring of each row/column. In this paper, we propose a new pseudo 2D normalization method using line density projection interpolation (LDPI), which partitions the line density map into soft strips and generate 2D coordinate mapping function by interpolating the 1D coordinate functions that are obtained by equalizing the line density projections of these strips. The LDPI method adds little computational overhead to one-dimensional NLN yet performs comparably well with P2DNLN. We also apply this strategy to extending other normalization methods, including line density projection fitting, centroid-boundary alignment, moment, and bi-moment methods. The latter three methods are directly based on character image instead of line density map. Their 2D extensions provide real-time computation and high recognition accuracy, and are potentially applicable to gray-scale images and online trajectories. 相似文献

16.

An overview of character recognition methodologies 总被引：3，自引：0，他引：3

J. Mantas 《Pattern recognition》1986,19(6):425-430

This work presents an overview of character recognition methodologies that have evolved in this century. At first the scanning devices that are used in character recognition will be explained, then some points will be stressed on the major research works that have made a great impact in character recognition. From a methodological point of view we will present the different steps that have been employed in OCR. And finally the most important industrial character recognisers will be covered along with the character data bases that are used in testing the various algorithms. 相似文献

17.

Handwritten character recognition through two-stage foreground sub-sampling

Georgios Vamvakas^{Author Vitae} Basilis Gatos Author Vitae Author Vitae 《Pattern recognition》2010,43(8):2807-2816

In this paper, we present a methodology for off-line handwritten character recognition. The proposed methodology relies on a new feature extraction technique based on recursive subdivisions of the character image so that the resulting sub-images at each iteration have balanced (approximately equal) numbers of foreground pixels, as far as this is possible. Feature extraction is followed by a two-stage classification scheme based on the level of granularity of the feature extraction method. Classes with high values in the confusion matrix are merged at a certain level and for each group of merged classes, granularity features from the level that best distinguishes them are employed. Two handwritten character databases (CEDAR and CIL) as well as two handwritten digit databases (MNIST and CEDAR) were used in order to demonstrate the effectiveness of the proposed technique. The recognition result achieved, in comparison to the ones reported in the literature, is the highest for the well-known CEDAR Character Database (94.73%) and among the best for the MNIST Database (99.03%) 相似文献

18.

A confidence value estimation method for handwritten Kanji character recognition and its application to candidate reduction

Eiki?Ishidera Email author Daisuke?Nishiwaki Atsushi?Sato 《International Journal on Document Analysis and Recognition》2003,6(4):263-270

This paper describes a method for estimating a confidence value (CV) by which we can express the potential correctness of handwritten Kanji character recognition candidates. An accumulated confidence value (ACV), calculated as the sum of CVs, is also applied to reduce the number of candidates. Such reduction is vital to increasing the speed of such applications as Kanji address recognition, and it also reduces the probability of misreadings in linguistic postprocessing. Sorted sets of character candidates, ranked in increasing order of each candidates distance value, are used as feature vectors. A CV is defined as the a posteriori probability with respect to each rank. To obtain good quality approximations of probability density functions (PDFs), we introduce a subspace within which correct data can easily be separated from erroneous data and then estimate PDF parameters over this subspace. Next, we use an ACV as a measure for expressing a threshold for candidate acceptance in Kanji character recognition. The efficiency of the proposed method is evaluated in an experiment using IPTP CD-ROM2 Japanese address images, and a comparison with the results for a conventional method shows that a roughly 35% reduction in the number of candidates is obtained without reducing the number of correct candidates.Received: 29 October 2001, Accepted: 30 September 2003, Published online: 1 April 2004Correspondence to: Eiki Ishidera 相似文献

19.

Efficient postprocessing algorithms for error correction in handwritten Hangul address and human name recognition

Seong-Whan Lee Eun-Soon Kim 《Pattern recognition》1994,27(12):1631-1640

相似文献

20.

A genetic framework using contextual knowledge for segmentation and recognition of handwritten numeral strings

Javad Sadri Ching Y. Suen 《Pattern recognition》2007,40(3):898-919

For the first time, a genetic framework using contextual knowledge is proposed for segmentation and recognition of unconstrained handwritten numeral strings. New algorithms have been developed to locate feature points on the string image, and to generate possible segmentation hypotheses. A genetic representation scheme is utilized to show the space of all segmentation hypotheses (chromosomes). For the evaluation of segmentation hypotheses, a novel evaluation scheme is introduced, in order to improve the outlier resistance of the system. Our genetic algorithm tries to search and evolve the population of segmentation hypotheses, and to find the one with the highest segmentation/recognition confidence. The NIST NSTRING SD19 and CENPARMI databases were used to evaluate the performance of our proposed method. Our experiments showed that proper use of contextual knowledge in segmentation, evaluation and search greatly improves the overall performance of the system. On average, our system was able to obtain correct recognition rates of 95.28% and 96.42% on handwritten numeral strings using neural network and support vector classifiers, respectively. These results compare favorably with the ones reported in the literature. 相似文献