共查询到19条相似文献,搜索用时 125 毫秒
1.
基于背景分析的手写数字切分方法 总被引:1,自引:0,他引:1
考虑到手写数字串的结构特点,提出一种以分析特定背景区域为主,针对具体粘连数字采用不同分割策略的切分方法。文中引入蓄水池的概念来形象描绘出背景区域中字符间的粘连部分,并从中抽取某些特征对字符的具体粘连形态进行了归纳分类。在分割过程中,根据字符的粘连类型选用不同的滴水算法来求得分割路径。实验结果表明了该方法对于手写数字分割的有效性。 相似文献
2.
3.
主曲线是一种新的基于非线性变换的特征抽取方法,它是一种通过数据分布“中间”并满足“自相合”的光滑曲线来进行特征提取的方法。为了提高手写数字串切分的正确率,提出了一种基于笔划组合的手写数字串切分方法。该方法首先使用主曲线完成字符模板的笔划抽取,然后以字符识别器提供的置信度为依据来组合笔划,以实现手写数字串的切分过程。另外,在字符识别器设计方面,则是使用基于数字轮廓分段特征与规范化模板特征这两个单特征分类器组合。实验表明,分别基于这两个特征的分类器具有较强的互补性。由于字符识别器的置信度难以真实反映识别结果,为此需使用类条件置信变换法,通过估计分类器的后验概率来对识别器的置信度进行修正。实验结果表明,该方法对于手写数字的分割是有效的。 相似文献
4.
5.
6.
7.
针对现有的切分算法结构复杂,时间和空间复杂度高等不足,提出了一种基于凹凸特性的非限制粘连手写数字串切分的新方法。首先计算数字串图像的赋值背景,然后从中提取凹凸特性,找到切分区域,最后在切分区域内提取切分线。该方法简单快速,在提高切分正确率的同时也降低了复杂度。利用NISTSD19收集到的样本进行实验,正确率高达97.5%,切分时间也大大缩短。 相似文献
8.
联机手写笔画特征抽取的逼近-合并算法 总被引:1,自引:0,他引:1
为了对联机手写字符识别的笔画进行精确描述,提出了一种基于字符笔画特征抽取的"逼近-合并"算法.该算法分析了字符笔画的多边形逼近,求出偏离度最小的多边形逼近,并对该多边形的边进行合并,抽取出笔画方向码,实现了联机手写字符笔画的更有效合并.该方法应用在联机手写体字符识别实验系统中,其识别率为99.13%. 相似文献
9.
文字的正确识别与否很大程度上取决于能否对文字进行有效的分割。在分析传统手写文字文本图像切分算法的基础上,提出一种基于像素点的文字分割方法。算法具有很强的抗干扰性,能够有效地解决手写过程中的小角度歪斜以及字符笔画断裂和粘连的问题。通过MATLAB的仿真试验,结果表明此方法能够对手写过程中普遍存在歪斜的文字进行有效的分割。 相似文献
10.
基于可伸缩矢量图SVG的在线手写汉字是以SVG图像作为汉字图像格式、以SVG的path对象作为笔画的基本存储单元来对汉字进行显示和存储的,笔画的轮廓是以手写过程中记录的坐标值作为特征数值加以确定的。基于此种SVG手写汉字存储和表示形式,本文提出一种基于图论的在线连续手写汉字多步分割方法。该方法根据汉字笔画间的坐标位置关系对手写笔画序列构建无向图模型,并利用图的广度优先搜索将原笔画序列分割为互不连通的笔画部件,使偏旁部首分离较远、非粘连汉字得到正确分割;然后利用改进的tarjan算法对部件中的粘连字符进行分割,最后基于笔画部件间距,利用二分类迭代算法对间距进行分类,找出全局最佳分割位置,对过分割的部件进行重组合并。实验结果表明,该方法对于在线手写汉字的分割是有效可行的。 相似文献
11.
Segmentation of single- or multiple-touching handwritten numeral string using background and foreground analysis 总被引:10,自引:0,他引:10
Yi-Kai Chen Jhing-Fa Wang 《IEEE transactions on pattern analysis and machine intelligence》2000,22(11):1304-1317
An approach of segmenting a single- or multiple-touching handwritten numeral string (two-digits) is proposed. Most algorithms for segmenting connected digits mainly focus on the analysis of foreground pixels. Some concentrated on the analysis of background pixels only and others are based on a recognizer. We combine background and foreground analysis to segment single- or multiple-touching handwritten numeral strings. Thinning of both foreground and background regions are first processed on the image of connected numeral strings and the feature points on foreground and background skeletons are extracted. Several possible segmentation paths are then constructed and useless strokes are removed. Finally, the parameters of geometric properties of each possible segmentation paths are determined and these parameters are analyzed by the mixture Gaussian probability function to decide the best segmentation path or reject it. Experimental results on NIST special database 19 (an update of NIST special database 3) and some other images collected by ourselves show that our algorithm can get a correct rate of 96 percent with rejection rate of 7.8 percent, which compares favorably with those reported in the literature. 相似文献
12.
13.
一种无约束手写体数字串分割方法 总被引:11,自引:1,他引:11
针对无约束手写体数字串中的连笔字符,本文提出以基于识别的分割方法为主,结合运用剖分方法和全局识别方法等多种分割策略的数字串分割方法。这种方法直接针对数字串分割,也可以运用到非数字字符串的分割中,其分割思想对连笔汉字的分割也具有一定指导意义。 相似文献
14.
粘连断裂字符行的切分识别,是很多OCR 实际应用中存在的主要困难之一. 本文针对粘连断裂的印刷体数字行,提出了一种基于Viterbi 算法的切分识别方案,该方案采用两次切分识别的层次型结构. 在第二次切分识别过程中,首先,在候选切分点区域,结合灰度图像与二值轮廓信息,采用基于Viterbi 算法搜索的非直线路径进行切分,得到有效的切分路径;然后,结合分类器输出的可信度,采用Viterbi 算法来合并前面得到的候选切分图像块,进行动态切分与识别. 实际的金融票据识别系统实验表明,本文提出的印刷体数字行切分识别方法能够较好的克服字符行的粘连与断裂情况,提高了识别系统的识别率和鲁棒性. 相似文献
15.
For the first time, a genetic framework using contextual knowledge is proposed for segmentation and recognition of unconstrained handwritten numeral strings. New algorithms have been developed to locate feature points on the string image, and to generate possible segmentation hypotheses. A genetic representation scheme is utilized to show the space of all segmentation hypotheses (chromosomes). For the evaluation of segmentation hypotheses, a novel evaluation scheme is introduced, in order to improve the outlier resistance of the system. Our genetic algorithm tries to search and evolve the population of segmentation hypotheses, and to find the one with the highest segmentation/recognition confidence. The NIST NSTRING SD19 and CENPARMI databases were used to evaluate the performance of our proposed method. Our experiments showed that proper use of contextual knowledge in segmentation, evaluation and search greatly improves the overall performance of the system. On average, our system was able to obtain correct recognition rates of 95.28% and 96.42% on handwritten numeral strings using neural network and support vector classifiers, respectively. These results compare favorably with the ones reported in the literature. 相似文献
16.
《Image and vision computing》1987,5(1):3-9
A context-directed algorithm is proposed for segmenting connected numeral strings into their components. The algorithm is hierarchical (tree-like structure) in the sense that it tests various hypotheses ranging from the case where the numerals are completely isolated to that where the numerals may be connected, touching and/or existing in overlapping fields. Test results indicate that the algorithm is very effective in providing an accurate segmentation in a form suitable for further processing by a recognition algorithm. 相似文献
17.
18.
Shuyan ZhaoAuthor Vitae Zheru ChiAuthor Vitae Penfei ShiAuthor VitaeHong YanAuthor Vitae 《Pattern recognition》2003,36(1):145-156
Correct segmentation of handwritten Chinese characters is crucial to their successful recognition. However, due to many difficulties involved, little work has been reported in this area. In this paper, a two-stage approach is presented to segment unconstrained handwritten Chinese characters. A handwritten Chinese character string is first coarsely segmented according to the background skeleton and vertical projection after a proper image preprocessing. With several geometric features, all possible segmentation paths are evaluated by using the fuzzy decision rules learned from examples. As a result, unsuitable segmentation paths are discarded. In the fine segmentation stage that follows, the strokes that may contain segmentation points are first identified. The feature points are then extracted from candidate strokes and taken as segmentation point candidates through each of which a segmentation path may be formed. The geometric features similar to the coarse segmentation stage are used and corresponding fuzzy decision rules are generated to evaluate fine segmentation paths. Experimental results on 1000 Chinese character strings from postal mail show that our approach can achieve a reasonable good overall accuracy in segmenting unconstrained handwritten Chinese characters. 相似文献
19.
手写体数字字符串识别常用于邮件自动分拣、银行票据和财务报表的录入中,针对其分割识别算法复杂度较高、准确率较低的问题,提出一种多分类器下无分割手写数字字符串识别算法。该算法的核心是采用四个分类器实现粘连字符串的无分割识别;将残差结构应用于LeNet-5网络,以增加网络深度,提高识别准确率,加快收敛速度;使用动态选择策略,以避免长度分类器误分类对识别结果的影响。实验结果表明,在NIST SD19一位数字和Synthetic数据集训练网络下,使用NIST SD19上长度为2、3、4、5、6的字符串验证网络,其识别准确率分别为99.3%、98.5%、98.1%、96.6%和97.2%。 相似文献