期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

The role of holistic paradigms in handwritten word recognition 总被引：1，自引：0，他引：1

Madhvanath S. Govindaraju V. 《IEEE transactions on pattern analysis and machine intelligence》2001,23(2):149-164

The holistic paradigm in handwritten word recognition treats the word as a single, indivisible entity and attempts to recognize words from their overall shape, as opposed to their character contents. In this survey, we have attempted to take a fresh look at the potential role of the holistic paradigm in handwritten word recognition. The survey begins with an overview of studies of reading which provide evidence for the existence of a parallel holistic reading process,in both developing and skilled readers. In what we believe is a fresh perspective on handwriting recognition, approaches to recognition are characterized as forming a continuous spectrum based on the visual complexity of the unit of recognition employed and an attempt is made to interpret well-known paradigms of word recognition in this framework. An overview of features, methodologies, representations, and matching techniques employed by holistic approaches is presented 相似文献

2.

A structural and relational approach to handwritten wordrecognition

Buse R. Zhi-Qiang Liu Caelli T. 《IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics》1997,27(5):847-861

In this paper, we present a new off-line word recognition system that is able to recognize unconstrained handwritten words using grey-scale images. This is based on structural and relational information in the handwritten word. We use Gabor filters to extract features from the words, and then use an evidence-based approach for word classification. A solution to the Gabor filter parameter estimation problem is given, enabling the Gabor filter to be automatically tuned to the word image properties. We also developed two new methods for correcting the slope of the handwritten words. Our experiments show that the proposed method achieves good recognition rates compared to standard classification methods. 相似文献

3.

HAH manuscripts: A holistic paradigm for classifying and retrieving historical Arabic handwritten documents

Zaher Al Aghbari Salama Brook 《Expert systems with applications》2009,36(8):10942-10951

相似文献

4.

A Line-Oriented Approach to Word Spotting in Handwritten Documents

A. Kolcz J. Alspector M. Augusteijn R. Carlson G. Viorel Popescu 《Pattern Analysis & Applications》2000,3(2):153-168

相似文献

5.

A line-based representation for matching words in historical manuscripts

Ethem F. Can P?nar Duygulu 《Pattern recognition letters》2011,32(8):1126-1138

相似文献

6.

A hierarchical approach to recognition of handwritten Bangla characters

Subhadip Basu Author VitaeAuthor Vitae Ram Sarkar Author VitaeAuthor Vitae Mita Nasipuri^{Author Vitae} Dipak Kumar Basu Author Vitae 《Pattern recognition》2009,42(7):1467-1484

相似文献

7.

Recognizing handwritten Arabic words using grapheme segmentation and recurrent neural networks

Gheith A. Abandah Fuad T. Jamour Esam A. Qaralleh 《International Journal on Document Analysis and Recognition》2014,17(3):275-291

The Arabic alphabet is used in around 27 languages, including Arabic, Persian, Kurdish, Urdu, and Jawi. Many researchers have developed systems for recognizing cursive handwritten Arabic words, using both holistic and segmentation-based approaches. This paper introduces a system that achieves high accuracy using efficient segmentation, feature extraction, and recurrent neural network (RNN). We describe a robust rule-based segmentation algorithm that uses special feature points identified in the word skeleton to segment the cursive words into graphemes. We show that careful selection from a wide range of features extracted during and after the segmentation stage produces a feature set that significantly reduces the label error. We demonstrate that using same RNN recognition engine, the segmentation approach with efficient feature extraction gives better results than a holistic approach that extracts features from raw pixels. We evaluated this segmentation approach against an improved version of the holistic system MDLSTM that won the ICDAR 2009 Arabic handwritten word recognition competition. On the IfN/ENIT database of handwritten Arabic words, the segmentation approach reduces the average label error by 18.5 %, the sequence error by 22.3 %, and the execution time by 31 %, relative to MDLSTM. This approach also has the best published accuracies on two IfN/ENIT test sets. 相似文献

8.

Hidden Markov Model Based Word Recognition and Its Application to Legal Amount Reading on French Checks 总被引：1，自引：0，他引：1

S. Knerr E. Augustin O. Baret D. Price 《Computer Vision and Image Understanding》1998,70(3):404-419

A hidden Markov model (HMM) based word recognition algorithm for the recognition of legal amounts from French bank checks is presented. This algorithm is part of the A2iA INTERCHEQUE recognition system. The algorithm starts from images of handwritten words which have been automatically segmented from binary check images. After finding the lower-case zone on the complete amount, words are slant corrected and then segmented into graphemes. Then, features are extracted from the graphemes, and the feature vectors are vector quantized resulting in a sequence of symbols for each word. Likelihoods of all word classes are computed by a set of HMMs, which have been previously trained using either the Viterbi algorithm or the Baum–Welch algorithm. The various parameters of the system have been identified and their importance evaluated. Results have been obtained on large real-life data bases of French handwritten checks. The HMM-based system has been shown to outperform a holistic word recognizer and another HMM-type word recognizer from the A2iA INTERCHEQUE recognition system. Word recognition rates of about 89% for the 26-word vocabulary relevant for legal amount recognition on French bank checks have been obtained. More recently, a Neural Network–HMM hybrid has been designed, which produces even better recognition rates. 相似文献

9.

Chaincode contour processing for handwritten word recognition

Madhvanath S. Kim G. Govindaraju V. 《IEEE transactions on pattern analysis and machine intelligence》1999,21(9):928-932

Contour representations of binary images of handwritten words afford considerable reduction in storage requirements while providing lossless representation. On the other hand, the one-dimensional nature of contours presents interesting challenges for processing images for handwritten word recognition. Our experiments indicate that significant gains are to be realized in both speed and recognition accuracy by using a contour representation in handwriting applications 相似文献

10.

Finding words in alphabet soup: Inference on freeform character recognition for historical scripts

Nicholas R. Howe Shaolei Feng R. ManmathaAuthor vitae 《Pattern recognition》2009,42(12):3338-3347

This paper develops word recognition methods for historical handwritten cursive and printed documents. It employs a powerful segmentation-free letter detection method based upon joint boosting with histograms of gradients as features. Efficient inference on an ensemble of hidden Markov models can select the most probable sequence of candidate character detections to recognize complete words in ambiguous handwritten text, drawing on character n-gram and physical separation models. Experiments with two corpora of handwritten historic documents show that this approach recognizes known words more accurately than previous efforts, and can also recognize out-of-vocabulary words. 相似文献

11.

Word spotting for historical documents

Toni M. Rath R. Manmatha 《International Journal on Document Analysis and Recognition》2007,9(2-4):139-152

Searching and indexing historical handwritten collections are a very challenging problem. We describe an approach called word spotting which involves grouping word images into clusters of similar words by using image matching to find similarity. By annotating “interesting” clusters, an index that links words to the locations where they occur can be built automatically. Image similarities computed using a number of different techniques including dynamic time warping are compared. The word similarities are then used for clustering using both K-means and agglomerative clustering techniques. It is shown in a subset of the George Washington collection that such a word spotting technique can outperform a Hidden Markov Model word-based recognition technique in terms of word error rates. An erratum to this article can be found at 相似文献

12.

Text search for medieval manuscript images

Yann Frank Hubert 《Pattern recognition》2007,40(12):3552-3567

相似文献

13.

Automatic scoring of short handwritten essays in reading comprehension tests

Sargur Srihari Jim Collins Rohini Srihari Harish Srinivasan Shravya Shetty Janina Brutt-Griffler 《Artificial Intelligence》2008,172(2-3):300-324

相似文献

14.

Modeling and recognition of cursive words with hidden Markov models

Wongyu Seong-Whan Jin H. 《Pattern recognition》1995,28(12):1941-1953

In this paper, a new method for modeling and recognizing cursive words with hidden Markov models (HMM) is presented. In the proposed method, a sequence of thin fixed-width vertical frames are extracted from the image, capturing the local features of the handwriting. By quantizing the feature vectors of each frame, the input word image is represented as a Markov chain of discrete symbols. A handwritten word is regarded as a sequence of characters and optional ligatures. Hence, the ligatures are also explicitly modeled. With this view, an interconnection network of character and ligature HMMs is constructed to model words of indefinite length. This model can ideally describe any form of handwritten words, including discretely spaced words, pure cursive words and unconstrained words of mixed styles. Experiments have been conducted with a standard database to evaluate the performance of the overall scheme. The performance of various search strategies based on the forward and backward score has been compared. Experiments on the use of a preclassifier based on global features show that this approach may be useful for even large-vocabulary recognition tasks. 相似文献

15.

Invariant pattern recognition using contourlets and AdaBoost

G.Y. Chen Author Vitae B. Kégl^{Author Vitae} 《Pattern recognition》2010,43(3):579-5514

In this paper, we propose new methods for palmprint classification and handwritten numeral recognition by using the contourlet features. The contourlet transform is a new two dimensional extension of the wavelet transform using multiscale and directional filter banks. It can effectively capture smooth contours that are the dominant features in palmprint images and handwritten numeral images. AdaBoost is used as a classifier in the experiments. Experimental results show that the contourlet features are very stable features for invariant palmprint classification and handwritten numeral recognition, and better classification rates are reported when compared with other existing classification methods. 相似文献

16.

Cursive word recognition using a random field based hidden Markov model

George Saon 《International Journal on Document Analysis and Recognition》1999,1(4):199-208

相似文献

17.

Recognition of handwritten Lanna Dhamma characters using a set of optimally designed moment features

Papangkorn Inkeaw Phasit Charoenkwan Hui-Ling Huang Sanparith Marukatat Shinn-Ying Ho Jeerayut Chaijaruwanich 《International Journal on Document Analysis and Recognition》2017,20(4):259-274

相似文献

18.

KERTAS: dataset for automatic dating of ancient Arabic manuscripts

Kalthoum Adam Asim Baig Somaya Al-Maadeed Ahmed Bouridane Sherine El-Menshawy 《International Journal on Document Analysis and Recognition》2018,21(4):283-290

相似文献

19.

Performance of hidden Markov model and dynamic Bayesian network classifiers on handwritten Arabic word recognition

Jawad H. AlKhateeb Olivier Pauplin Jinchang Ren Jianmin Jiang 《Knowledge》2011,24(5):680-688

This paper presents a comparative study of two machine learning techniques for recognizing handwritten Arabic words, where hidden Markov models (HMMs) and dynamic Bayesian networks (DBNs) were evaluated. The work proposed is divided into three stages, namely preprocessing, feature extraction and classification. Preprocessing includes baseline estimation and normalization as well as segmentation. In the second stage, features are extracted from each of the normalized words, where a set of new features for handwritten Arabic words is proposed, based on a sliding window approach moving across the mirrored word image. The third stage is for classification and recognition, where machine learning is applied using HMMs and DBNs. In order to validate the techniques, extensive experiments were conducted using the IFN/ENIT database which contains 32,492 Arabic words. Experimental results and quantitative evaluations showed that HMM outperforms DBN in terms of higher recognition rate and lower complexity. 相似文献

20.

A lexicon driven approach to handwritten word recognition forreal-time applications

Gyeonghwan Kim Govindaraju V. 《IEEE transactions on pattern analysis and machine intelligence》1997,19(4):366-379

A fast method of handwritten word recognition suitable for real time applications is presented in this paper. Preprocessing, segmentation and feature extraction are implemented using a chain code representation of the word contour. Dynamic matching between characters of a lexicon entry and segment(s) of the input word image is used to rank the lexicon entries in order of best match. Variable duration for each character is defined and used during the matching. Experimental results prove that our approach using the variable duration outperforms the method using fixed duration in terms of both accuracy and speed. Speed of the entire recognition process is about 200 msec on a single SPARC-10 platform and the recognition accuracy is 96.8 percent are achieved for lexicon size of 10, on a database of postal words captured at 212 dpi 相似文献