共查询到20条相似文献,搜索用时 15 毫秒
1.
Connell S.D. Jain A.K. 《IEEE transactions on pattern analysis and machine intelligence》2002,24(3):329-346
Writer-adaptation is the process of converting a writer-independent handwriting recognition system into a writer-dependent system. It can greatly increasing recognition accuracy, given adequate writer models. The limited amount of data a writer provides during training constrains the models' complexity. We show how appropriate use of writer-independent models is important for the adaptation. Our approach uses writer-independent writing style models (lexemes) to identify the styles present in a particular writer's training data. These models are then updated using the writer's data. Lexemes in the writer's data for which an inadequate number of training examples is available are replaced with the writer-independent models. We demonstrate the feasibility of this approach on both isolated handwritten character recognition and unconstrained word recognition tasks. Our results show an average reduction in error rate of 16.3 percent for lowercase characters as compared against representing each of the writer's character classes with a single model. In addition, an average error rate reduction of 9.2 percent is shown on handwritten words using only a small amount of data for adaptation 相似文献
2.
HMM based online handwriting recognition 总被引:3,自引:0,他引:3
Jianying Hu Brown M.K. Turin W. 《IEEE transactions on pattern analysis and machine intelligence》1996,18(10):1039-1045
Hidden Markov model (HMM) based recognition of handwriting is now quite common, but the incorporation of HMM's into a complex stochastic language model for handwriting recognition is still in its infancy. We have taken advantage of developments in the speech processing field to build a more sophisticated handwriting recognition system. The pattern elements of the handwriting model are subcharacter stroke types modeled by HMMs. These HMMs are concatenated to form letter models, which are further embedded in a stochastic language model. In addition to better language modeling, we introduce new handwriting recognition features of various kinds. Some of these features have invariance properties, and some are segmental, covering a larger region of the input pattern. We have achieved a writer independent recognition rate of 94.5% on 3,823 unconstrained handwritten word samples from 18 writers covering a 32 word vocabulary 相似文献
3.
Claus Bahlmann Author Vitae 《Pattern recognition》2006,39(1):115-125
The selection of valuable features is crucial in pattern recognition. In this paper we deal with the issue that part of features originate from directional instead of common linear data. Both for directional and linear data a theory for a statistical modeling exists. However, none of these theories gives an integrated solution to problems, where linear and directional variables are to be combined in a single, multivariate probability density function. We describe a general approach for a unified statistical modeling, given the constraint that variances of the circular variables are small. The method is practically evaluated in the context of our online handwriting recognition system frog on hand and the so-called tangent slope angle feature. Recognition results are compared with two alternative modeling approaches. The proposed solution gives significant improvements in recognition accuracy, computational speed and memory requirements. 相似文献
4.
Ibrahim Abdelaziz Sherif Abdou Hassanin Al-Barhamtoshy 《Pattern Analysis & Applications》2016,19(4):1129-1141
The success of using Hidden Markov Models (HMMs) for speech recognition application has motivated the adoption of these models for handwriting recognition especially the online handwriting that has large similarity with the speech signal as a sequential process. Some languages such as Arabic, Farsi and Urdo include large number of delayed strokes that are written above or below most letters and usually written delayed in time. These delayed strokes represent a modeling challenge for the conventional left-right HMM that is commonly used for Automatic Speech Recognition (ASR) systems. In this paper, we introduce a new approach for handling delayed strokes in Arabic online handwriting recognition using HMMs. We also show that several modeling approaches such as context based tri-grapheme models, speaker adaptive training and discriminative training that are currently used in most state-of-the-art ASR systems can provide similar performance improvement for Hand Writing Recognition (HWR) systems. Finally, we show that using a multi-pass decoder that use the computationally less expensive models in the early passes can provide an Arabic large vocabulary HWR system with practical decoding time. We evaluated the performance of our proposed Arabic HWR system using two databases of small and large lexicons. For the small lexicon data set, our system achieved competing results compared to the best reported state-of-the-art Arabic HWR systems. For the large lexicon, our system achieved promising results (accuracy and time) for a vocabulary size of 64k words with the possibility of adapting the models for specific writers to get even better results. 相似文献
5.
6.
Mandal Subhasis Prasanna S. R. Mahadeva Sundaram Suresh 《International Journal on Document Analysis and Recognition》2019,22(1):1-14
International Journal on Document Analysis and Recognition (IJDAR) - The task of online handwriting recognition (HR) becomes often challenging due to the presence of confusing characters which are... 相似文献
7.
The design of optimal feature sets for visual classification problems is still one of the most challenging topics in the area of computer vision. In this work, we propose a new algorithm that computes optimal features, in the minimum Bayes error sense, for visual recognition tasks. The algorithm now proposed combines the fast convergence rate of feature selection (FS) procedures with the ability of feature extraction (FE) methods to uncover optimal features that are not part of the original basis function set. This leads to solutions that are better than those achievable by either FE or FS alone, in a small number of iterations, making the algorithm scalable in the number of classes of the recognition problem. This property is currently only available for feature extraction methods that are either sub-optimal or optimal under restrictive assumptions that do not hold for generic imagery. Experimental results show significant improvements over these methods, either through much greater robustness to local minima or by achieving significantly faster convergence. 相似文献
8.
以基于隐马尔可夫模型和统计语言模型的研究作为基础,着重研究联机手写哈萨克文的切分技术、连体段分类和特征参数的独特提取技术。系统先将提取延迟笔划后的连体段主笔划作为HMM识别器的输入,再根据被识别的主笔划的编号和延迟笔划标记从连体段分类词典中查找,找到对应的连体段识别结果。通过去除连体段延迟笔画的方法可以有效地减少需建立的模型数目,进而提高识别速度和避免由字符切分所带来的问题。 相似文献
9.
Jun Du Jian-Fang Zhai Jin-Shui Hu 《International Journal on Document Analysis and Recognition》2017,20(1):69-78
This paper proposes a novel framework of writer adaptation based on deeply learned features for online handwritten Chinese character recognition. Our motivation is to further boost the state-of-the-art deep learning-based recognizer by using writer adaptation techniques. First, to perform an effective and flexible writer adaptation, we propose a tandem architecture design for the feature extraction and classification. Specifically, a deep neural network (DNN) or convolutional neural network (CNN) is adopted to extract the deeply learned features which are used to build a discriminatively trained prototype-based classifier initialized by Linde–Buzo–Gray clustering techniques. In this way, the feature extractor can fully utilize the useful information of a DNN or CNN. Meanwhile, the prototype-based classifier could be designed more compact and efficient as a practical solution. Second, the writer adaption is performed via a linear transformation of the deeply learned features which is optimized with a sample separation margin-based minimum classification error criterion. Furthermore, we improve the generalization capability of the previously proposed discriminative linear regression approach for writer adaptation by using the linear interpolation of two transformations and adaptation data perturbation. The experiments on the tasks of both the CASIA-OLHWDB benchmark and an in-house corpus with a vocabulary of 20,936 characters demonstrate the effectiveness of our proposed approach. 相似文献
10.
Sebastián Peña Saldarriaga Christian Viard-Gaudin Emmanuel Morin 《International Journal on Document Analysis and Recognition》2010,13(2):159-171
Today, there is an increasing demand of efficient archival and retrieval methods for online handwritten data. For such tasks,
text categorization is of particular interest. The textual data available in online documents can be extracted through online
handwriting recognition; however, this process produces errors in the resulting text. This work reports experiments on the
categorization of online handwritten documents based on their textual contents. We analyze the effect of word recognition
errors on the categorization performances, by comparing the performances of a categorization system with the texts obtained
through online handwriting recognition and the same texts available as ground truth. Two well-known categorization algorithms
(kNN and SVM) are compared in this work. A subset of the Reuters-21578 corpus consisting of more than 2,000 handwritten documents
has been collected for this study. Results show that classification rate loss is not significant, and precision loss is only
significant for recall values of 60–80% depending on the noise levels. 相似文献
11.
M. Kobayashi S. Masaki O. Miyamoto Y. Nakagawa Y. Komiya T. Matsumoto 《International Journal on Document Analysis and Recognition》2001,3(3):181-191
A new algorithm RAV (reparameterized angle variations) is proposed which makes explicit use of trajectory information where the time evolution of the pen coordinates plays a crucial role. The algorithm is robust against stroke connections/abbreviations
as well as shape distortions, while maintaining reasonable robustness against stroke-order variations. Preliminary experiments
are reported on tests against the Kuchibue_d-96-02 database from the Tokyo University of Agriculture and Technology.
Received July 24, 2000 / Revised October 6, 2000 相似文献
12.
This paper compares the current state of the art in online Japanese character recognition with techniques in western handwriting recognition. It discusses important developments in preprocessing, classification, and postprocessing for Japanese character recognition in recent years and relates them to the developments in western handwriting recognition. Comparing eastern and western handwriting recognition techniques allows learning from very different approaches and understanding the underlying common foundations of handwriting recognition. This is very important when it comes to developing compact modules for integrated systems supporting many writing systems capable of recognizing multilanguage documents.Received: January 12, 2002, Accepted: March 6, 2003, Published online: 4 July 2003 相似文献
13.
Giovanni Seni John Seybold 《International Journal on Document Analysis and Recognition》1999,2(1):24-29
Out-of-order diacriticals introduce significant complexity to the design of an online handwriting recognizer, because they
require some reordering of the time domain information. It is common in cursive writing to write the body of an `i' or `t'
during the writing of the word, and then to return and dot or cross the letter once the word is complete. The difficulty arises
because we have to look ahead, when scoring one of these letters, to find the mark occurring later in the writing stream that
completes the letter. We should also remember that we have used this mark, so that we don't use it again for a different letter,
and we should also penalize a word if there are some marks that look like diacriticals that are not used. One approach to
this problem is to scan the writing some distance into the future to identify candidate diacriticals, remove them in a preprocessing
step, and associate them with the matching letters earlier in the word. If done as a preliminary operation, this approach
is error-prone: marks that are not diacriticals may be incorrectly identified and removed, and true diacriticals may be skipped.
This paper describes a novel extension to a forward search algorithm that provides a natural mechanism for considering alternative
treatments of potential diacriticals, to see whether it is better to treat a given mark as a diacritical or not, and directly
compare the two outcomes by score.
Received October 30, 1998 / Revised January 25, 1999 相似文献
14.
Tappert C.C. Suen C.Y. Wakahara T. 《IEEE transactions on pattern analysis and machine intelligence》1990,12(8):787-808
This survey describes the state of the art of online handwriting recognition during a period of renewed activity in the field. It is based on an extensive review of the literature, including journal articles, conference proceedings, and patents. Online versus offline recognition, digitizer technology, and handwriting properties and recognition problems are discussed. Shape recognition algorithms, preprocessing and postprocessing techniques, experimental systems, and commercial products are examined 相似文献
15.
Off-line recognition of Chinese handwriting by multifeature andmultilevel classification 总被引:1,自引:0,他引:1
Yuan Y. Tang Lo-Ting Tu Jiming Liu Seong-Whan Lee Win-Win Lin 《IEEE transactions on pattern analysis and machine intelligence》1998,20(5):556-561
In this paper, an off-line recognition system based on multifeature and multilevel classification is presented for handwritten Chinese characters. Ten classes of multifeatures, such as peripheral shape features, stroke density features, and stroke direction features, are used in this system. The multilevel classification scheme consists of a group classifier and a five-level character classifier, where two new technologies, overlap clustering and Gaussian distribution selector are developed. Experiments have been conducted to recognize 5,401 daily-used Chinese characters. The recognition rate is about 90 percent for a unique candidate, and 98 percent for multichoice with 10 candidates 相似文献
16.
A key concept in pattern recognition is that a pattern recognizer should be designed so as to minimize the errors it makes in classifying patterns. In this article, we review a recent, promising approach for minimizing the error rate of a classifier and describe a particular application to a simple, prototype-based speech recognizer. The key idea is to define a smooth, differentiable loss function that incorporates all adaptable classifier parameters and that approximates the actual performance error rate. Gradient descent can then be used to minimize this loss. This approach allows but does not require the use of explicitly probabilistic models. Furthermore, minimum error training does not involve the estimation of probability distributions that are difficult to obtain reliably. This new method has been applied to a variety of pattern recognition problems, with good results. Here we describe a particular application in which a relatively simple distance-based classifier is trained to minimize errors in speech recognition tasks. The loss function is defined so as to reflect errors at the level of the final, grammar-driven recognition output. Thus, minimization of this loss directly optimizes the overall system performance. 相似文献
17.
18.
Philippe Dreuw Georg Heigold Hermann Ney 《International Journal on Document Analysis and Recognition》2011,14(3):273-288
We present a novel confidence- and margin-based discriminative training approach for model adaptation of a hidden Markov model
(HMM)-based handwriting recognition system to handle different handwriting styles and their variations. Most current approaches
are maximum-likelihood (ML) trained HMM systems and try to adapt their models to different writing styles using writer adaptive
training, unsupervised clustering, or additional writer-specific data. Here, discriminative training based on the maximum
mutual information (MMI) and minimum phone error (MPE) criteria are used to train writer-independent handwriting models. For
model adaptation during decoding, an unsupervised confidence-based discriminative training on a word and frame level within
a two-pass decoding process is proposed. The proposed methods are evaluated for closed-vocabulary isolated handwritten word
recognition on the IFN/ENIT Arabic handwriting database, where the word error rate is decreased by 33% relative compared to
a ML trained baseline system. On the large-vocabulary line recognition task of the IAM English handwriting database, the word
error rate is decreased by 25% relative. 相似文献
19.
Zouari Ramzi Boubaker Houcine Kherallah Monji 《Multimedia Tools and Applications》2019,78(9):12103-12123
Multimedia Tools and Applications - Recently, several researches were carried on handwritten document analysis field thanks to the evolution of data capture technologies. For a given document,... 相似文献
20.
Sim K.C. Gales M.J.F. 《IEEE transactions on audio, speech, and language processing》2006,14(3):882-889
Gaussian mixture models (GMMs) are commonly used as the output density function for large-vocabulary continuous speech recognition (LVCSR) systems. A standard problem when using multivariate GMMs to classify data is how to accurately represent the correlations in the feature vector. Full covariance matrices yield a good model, but dramatically increase the number of model parameters. Hence, diagonal covariance matrices are commonly used. Structured precision matrix approximations provide an alternative, flexible, and compact representation. Schemes in this category include the extended maximum likelihood linear transform and subspace for precision and mean models. This paper examines how these precision matrix models can be discriminatively trained and used on state-of-the-art speech recognition tasks. In particular, the use of the minimum phone error criterion is investigated. Implementation issues associated with building LVCSR systems are also addressed. These models are evaluated and compared using large vocabulary continuous telephone speech and broadcast news English tasks. 相似文献