共查询到20条相似文献,搜索用时 46 毫秒
1.
Nikolai Gorski Valery Anisimov Emmanuel Augustin Olivier Baret Sergey Maximov 《International Journal on Document Analysis and Recognition》2001,3(4):196-206
This paper presents the current state of the A2iA CheckReaderTM – a commercial bank check recognition system. The system is designed to process the flow of payment documents associated
with the check clearing process: checks themselves, deposit slips, money orders, cash tickets, etc. It processes document
images and recognizes document amounts whatever their style and type – cursive, hand- or machine printed – expressed as numerals
or as phrases. The system is adapted to read payment documents issued in different English- or French-speaking countries.
It is currently in use at more than 100 large sites in five countries and processes daily over 10 million documents. The average
read rate at the document level varies from 65 to 85% with a misread rate corresponding to that of a human operator (1%).
Received October 13, 2000 / Revised December 4, 2000 相似文献
2.
3.
Alceu de S. Britto Jr Robert Sabourin Flavio Bortolozzi Ching Y. Suen 《International Journal on Document Analysis and Recognition》2003,5(2-3):102-117
In this paper, a two-stage HMM-based recognition method allows us to compensate for the possible loss in terms of recognition
performance caused by the necessary trade-off between segmentation and recognition in an implicit segmentation-based strategy.
The first stage consists of an implicit segmentation process that takes into account some contextual information to provide
multiple segmentation-recognition hypotheses for a given preprocessed string. These hypotheses are verified and re-ranked
in a second stage by using an isolated digit classifier. This method enables the use of two sets of features and numeral models:
one taking into account both the segmentation and recognition aspects in an implicit segmentation-based strategy, and the
other considering just the recognition aspects of isolated digits. These two stages have been shown to be complementary, in
the sense that the verification stage compensates for the loss in terms of recognition performance brought about by the necessary
tradeoff between segmentation and recognition carried out in the first stage. The experiments on 12,802 handwritten numeral
strings of different lengths have shown that the use of a two-stage recognition strategy is a promising idea. The verification
stage brought about an average improvement of 9.9% on the string recognition rates. On touching digit pairs, the method achieved
a recognition rate of 89.6%.
Received June 28, 2002 / Revised July 03, 2002 相似文献
4.
E. Kavallieratou N. Fakotakis G. Kokkinakis 《International Journal on Document Analysis and Recognition》2002,4(4):226-242
In this paper, an integrated offline recognition system for unconstrained handwriting is presented. The proposed system consists
of seven main modules: skew angle estimation and correction, printed-handwritten text discrimination, line segmentation, slant
removing, word segmentation, and character segmentation and recognition, stemming from the implementation of already existing
algorithms as well as novel algorithms. This system has been tested on the NIST, IAM-DB, and GRUHD databases and has achieved
accuracy that varies from 65.6% to 100% depending on the database and the experiment. 相似文献
5.
6.
Chi Fang Changsong Liu Liangrui Peng Xiaoqing Ding 《International Journal on Document Analysis and Recognition》2002,4(3):177-182
Performance evaluation is crucial for improving the performance of OCR systems. However, this is trivial and sophisticated
work to do by hand. Therefore, we have developed an automatic performance evaluation system for a printed Chinese character
recognition (PCCR) system. Our system is characterized by using real-world data as test data and automatically obtaining the
performance of the PCCR system by comparing the correct text and the recognition result of the document image. In addition,
our performance evaluation system also provides some evaluation of performance for the segmentation module, the classification
module, and the post-processing module of the PCCR system. For this purpose, a segmentation error-tolerant character-string
matching algorithm is proposed to obtain the correspondence between the correct text and the recognition result. The experiments
show that our performance evaluation system is an accurate and powerful tool for studying deficiencies in the PCCR system.
Although our approach is aimed at the PCCR system, the idea also can be applied to other OCR systems. 相似文献
7.
A labelling approach for the automatic recognition of tables of contents (ToC) is described in this paper. A prototype is
used for the electronic consulting of scientific papers in a digital library system named Calliope. This method operates on
a roughly structured ASCII file, produced by OCR. The recognition approach operates by text labelling without using any a
priori model. Labelling is based on part-of-speech tagging (PoS) which is initiated by a primary labelling of text components
using some specific dictionaries. Significant tags are first grouped into homogeneous classes according to their grammar categories
and then reduced in canonical forms corresponding to article fields: “title” and “authors”. Non-labelled tokens are integrated
in one or another field by either applying PoS correction rules or using a structure model generated from well-detected articles.
The designed prototype operates very well on different ToC layouts and character recognition qualities. Without manual intervention,
a 96.3% rate of correct segmentation was obtained on 38 journals, including 2,020 articles, accompanied by a 93.0% rate of
correct field extraction.
Received April 5, 2000 / Revised February 19, 2001 相似文献
8.
Dot-matrix text recognition is a difficult problem, especially when characters are broken into several disconnected components.
We present a dot-matrix text recognition system which uses the fact that dot-matrix fonts are fixed-pitch, in order to overcome
the difficulty of the segmentation process. After finding the most likely pitch of the text, a decision is made as to whether
the text is written in a fixed-pitch or proportional font. Fixed-pitch text is segmented using a pitch-based segmentation
process that can successfully segment both touching and broken characters. We report performance results for the pitch estimation,
fixed-pitch decision and segmentation, and recognition processes.
Received October 18, 1999 / Revised April 21, 2000 相似文献
9.
Toshio Sato Takeo Kanade Ellen K. Hughes Michael A. Smith Shin'ichi Satoh 《Multimedia Systems》1999,7(5):385-395
The automatic extraction and recognition of news captions and annotations can be of great help locating topics of interest
in digital news video libraries. To achieve this goal, we present a technique, called Video OCR (Optical Character Reader),
which detects, extracts, and reads text areas in digital video data. In this paper, we address problems, describe the method
by which Video OCR operates, and suggest applications for its use in digital news archives. To solve two problems of character
recognition for videos, low-resolution characters and extremely complex backgrounds, we apply an interpolation filter, multi-frame
integration and character extraction filters. Character segmentation is performed by a recognition-based segmentation method,
and intermediate character recognition results are used to improve the segmentation. We also include a method for locating
text areas using text-like properties and the use of a language-based postprocessing technique to increase word recognition
rates. The overall recognition results are satisfactory for use in news indexing. Performing Video OCR on news video and combining
its results with other video understanding techniques will improve the overall understanding of the news video content. 相似文献
10.
Sonia Garcia-Salicetti Bernadette Dorizzi Patrick Gallinari Zsolt Wimmer 《International Journal on Document Analysis and Recognition》2001,4(1):56-68
In this paper, we present a hybrid online handwriting recognition system based on hidden Markov models (HMMs). It is devoted
to word recognition using large vocabularies. An adaptive segmentation of words into letters is integrated with recognition,
and is at the heart of the training phase. A word-model is a left-right HMM in which each state is a predictive multilayer
perceptron that performs local regression on the drawing (i.e., the written word) relying on a context of observations. A
discriminative training paradigm related to maximum mutual information is used, and its potential is shown on a database of
9,781 words.
Received June 19, 2000 / Revised October 16, 2000 相似文献
11.
During the last forty years, Human Handwriting Processing (HHP) has most often been investigated under the frameworks of
character (OCR) and pattern recognition. In recent years considerable progress has been made, and to date HHP can be viewed
much more as an automatic Handwriting Reading (HR) task for the machine. In this paper we propose the use of handwriting invariants,
a physical model for a first segmentation, a logical model for segmentation and recognition, a fundamental equation of handwriting,
and to integrate several sources of perception and of knowledge in order to design Handwriting Reading Systems (HRS), which
would be more universal systems than is currently the case.
At the dawn of the 3rd millennium, we guess that HHP will be considered more as a perceptual and interpretation task requiring
knowledge gained from studies on human language. This paper gives some guidelines and presents examples to design systems
able to perceive and interpret, i.e., read, handwriting automatically.
Received October 30, 1998 / Revised January 30, 1999 相似文献
12.
In this paper we describe a database that consists of handwritten English sentences. It is based on the Lancaster-Oslo/Bergen
(LOB) corpus. This corpus is a collection of texts that comprise about one million word instances. The database includes 1,066
forms produced by approximately 400 different writers. A total of 82,227 word instances out of a vocabulary of 10,841 words
occur in the collection. The database consists of full English sentences. It can serve as a basis for a variety of handwriting
recognition tasks. However, it is expected that the database would be particularly useful for recognition tasks where linguistic
knowledge beyond the lexicon level is used, because this knowledge can be automatically derived from the underlying corpus.
The database also includes a few image-processing procedures for extracting the handwritten text from the forms and the segmentation
of the text into lines and words.
Received September 28, 2001 / Revised October 10, 2001 相似文献
13.
Automatic text segmentation and text recognition for video indexing 总被引:13,自引:0,他引:13
Efficient indexing and retrieval of digital video is an important function of video databases. One powerful index for retrieval
is the text appearing in them. It enables content-based browsing. We present our new methods for automatic segmentation of
text in digital videos. The algorithms we propose make use of typical characteristics of text in videos in order to enable
and enhance segmentation performance. The unique features of our approach are the tracking of characters and words over their
complete duration of occurrence in a video and the integration of the multiple bitmaps of a character over time into a single
bitmap. The output of the text segmentation step is then directly passed to a standard OCR software package in order to translate
the segmented text into ASCII. Also, a straightforward indexing and retrieval scheme is introduced. It is used in the experiments
to demonstrate that the proposed text segmentation algorithms together with existing text recognition algorithms are suitable
for indexing and retrieval of relevant video sequences in and from a video database. Our experimental results are very encouraging
and suggest that these algorithms can be used in video retrieval applications as well as to recognize higher level semantics
in videos. 相似文献
14.
针对带表格的中文支票小写金额的自动识别问题,提出了一种多模式切分和识别算法。根据小写金额不同部分的切分和识别难度,采取了3种递进的模式:预切分模式、连写0检测模式和基于识别的切分模式。其中预切分模式用来处理小写金额中不粘连的单字;连写0检测模式用来检测并识别连写的0;基于识别的切分模式用来处理非连写0的粘连部分,在这个模式中采用了遗传算法来加速最优解的搜索过程。利用从银行采集的1053张真实支票样本进行测试,在拒识率为33.6%时,小写金额串的识别率达到66.1%,实验结果证明这种算法可以提高真实支票小写金额的识别率。 相似文献
15.
Detection, segmentation, and classification of specific objects are the key building blocks of a computer vision system for
image analysis. This paper presents a unified model-based approach to these three tasks. It is based on using unsupervised
learning to find a set of templates specific to the objects being outlined by the user. The templates are formed by averaging
the shapes that belong to a particular cluster, and are used to guide a probabilistic search through the space of possible
objects. The main difference from previously reported methods is the use of on-line learning, ideal for highly repetitive
tasks. This results in faster and more accurate object detection, as system performance improves with continued use. Further,
the information gained through clustering and user feedback is used to classify the objects for problems in which shape is
relevant to the classification. The effectiveness of the resulting system is demonstrated in two applications: a medical diagnosis
task using cytological images, and a vehicle recognition task.
Received: 5 November 2000 / Accepted: 29 June 2001
Correspondence to: K.-M. Lee 相似文献
16.
In this paper, we present a general model for Arabic bank check processing indicating the major phases of a check processing system. We then survey the available databases for Arabic bank check processing research. The state of the art in the different phases of Arabic bank check processing is surveyed (i.e., pre-processing, check analysis and segmentation, features extraction, and legal and courtesy amounts recognition). The open issues for future research are stated and areas that need improvements are presented. To the best of our knowledge, it is the first survey of Arabic bank check processing. 相似文献
17.
18.
S. Jaeger S. Manke J. Reichert A. Waibel 《International Journal on Document Analysis and Recognition》2001,3(3):169-180
This paper presents the online handwriting recognition system NPen++ developed at the University of Karlsruhe and Carnegie
Mellon University. The NPen++ recognition engine is based on a multi-state time delay neural network and yields recognition
rates from 96% for a 5,000 word dictionary to 93.4% on a 20,000 word dictionary and 91.2% for a 50,000 word dictionary. The
proposed tree search and pruning technique reduces the search space considerably without losing too much recognition performance
compared to an exhaustive search. This enables the NPen++ recognizer to be run in real-time with large dictionaries. Initial
recognition rates for whole sentences are promising and show that the MS-TDNN architecture is suited to recognizing handwritten
data ranging from single characters to whole sentences.
Received September 3, 2000 / Revised October 9, 2000 相似文献
19.
An architecture for handwritten text recognition systems 总被引:1,自引:1,他引:0
Gyeonghwan Kim Venu Govindaraju Sargur N. Srihari 《International Journal on Document Analysis and Recognition》1999,2(1):37-44
This paper presents an end-to-end system for reading handwritten page images. Five functional modules included in the system
are introduced in this paper: (i) pre-processing, which concerns introducing an image representation for easy manipulation
of large page images and image handling procedures using the image representation; (ii) line separation, concerning text line
detection and extracting images of lines of text from a page image; (iii) word segmentation, which concerns locating word
gaps and isolating words from a line of text image obtained efficiently and in an intelligent manner; (iv) word recognition,
concerning handwritten word recognition algorithms; and (v) linguistic post-pro- cessing, which concerns the use of linguistic
constraints to intelligently parse and recognize text. Key ideas employed in each functional module, which have been developed
for dealing with the diversity of handwriting in its various aspects with a goal of system reliability and robustness, are
described in this paper. Preliminary experiments show promising results in terms of speed and accuracy.
Received October 30, 1998 / Revised January 15, 1999 相似文献
20.
一种手写体大写金额串的分割新方法 总被引:3,自引:0,他引:3
手写体大写金额串的分割将直接影响识别的准确率。为了提高分割的准确率,同时保证较快的分割速度,本文采用了由粗分割和细分割组成的两步分割方法。重点介绍交叉字符和相连字符的分割方法。对于交叉的字符提出了加窗处理的中点连线分割方法,它较其它方法具有简单准确的优点;对于单笔相连的字符,先在细化字符图象上找到候选笔划的候选分割点,然后用本文提出的简明的评价准则来确定最优分割点,提高了粗分割的精度。上述方法应用于银行支票手写体大写金额的分割,取得了很好的分割效果。 相似文献