首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 10 毫秒
1.
一种手写体汉字识别的神经网络多分类器集成方案   总被引:1,自引:1,他引:1  
万红梅 《计算机工程》2004,30(16):151-152
提出了一种基于单字单网的手写体汉字识别纯神经网络的多分类器集成方案,并通过实验证明用该方案实现的神经网络集成系统性能均比任一个神经网络单分类器都好,对1 000种不同的手写体汉字的1 000×10个字进行测试,集成后的识别率最高达到95.22%,比单分类器的识别率高出5.0%-8.7%。  相似文献   

2.
In this paper, we present a methodology for off-line handwritten character recognition. The proposed methodology relies on a new feature extraction technique based on recursive subdivisions of the character image so that the resulting sub-images at each iteration have balanced (approximately equal) numbers of foreground pixels, as far as this is possible. Feature extraction is followed by a two-stage classification scheme based on the level of granularity of the feature extraction method. Classes with high values in the confusion matrix are merged at a certain level and for each group of merged classes, granularity features from the level that best distinguishes them are employed. Two handwritten character databases (CEDAR and CIL) as well as two handwritten digit databases (MNIST and CEDAR) were used in order to demonstrate the effectiveness of the proposed technique. The recognition result achieved, in comparison to the ones reported in the literature, is the highest for the well-known CEDAR Character Database (94.73%) and among the best for the MNIST Database (99.03%)  相似文献   

3.
Since Chinese characters are composed from a small set of fundamental shapes (radicals) the problem of recognising large numbers of characters can be converted to that of extracting a small number of radicals and then finding their optimal combination. In this paper, radical extraction is carried out by nonlinear active shape models, in which kernel principal component analysis is employed to capture the nonlinear variation. Treating Chinese character composition as a discrete Markov process, we also propose an approach to recognition with the Viterbi algorithm. Our initial experiments are conducted on off-line recognition of 430,800 loosely-constrained characters, comprised of 200 radical categories covering 2154 character categories from 200 writers. The correct recognition rate is 93.5% characters correct (writer-independent). Consideration of published figures for existing radical approaches suggests that our method achieves superior performance.  相似文献   

4.
手写汉字的集群识别   总被引:3,自引:0,他引:3  
为了降低单个汉字的分辨率,论文分析了通用的汉字识别模型,并在此基础上建立了适于多字识别的集群识别模型。为了充分论证集群识别模型的观点,本文从理论证明和实验两方面获得支持根据。实验结果表明基于多字识别模型的集群识别能可靠提高对连续文字的识别效果,是手写汉字识别中很有希望的发展方向。  相似文献   

5.
针对传统两级手写汉字识别系统中手写汉字识别的特征提取方法的限制问题,提出了一种采用卷积神经网对相似汉字自动学习有效特征进行识别的系统方法。该方法采用来自手写云平台上的大数据来训练模型,基于频度统计生成相似子集,进一步提高识别率。实验表明,相对于传统的基于梯度特征的支持向量机和最近邻分类器方法,该方法的识别率有一定的提高。  相似文献   

6.
Previous handwritten numeral recognition algorithms applied structural classification to extract geometric primitives that characterize each image, and then utilized artificial intelligence methods, like neural network or fuzzy memberships, to classify the images. We propose a handwritten numeral recognition methodology based on simplified structural classification, by using a much smaller set of primitive types, and fuzzy memberships. More specifically, based on three kinds of feature points, we first extract five kinds of primitive segments for each image. A fuzzy membership function is then used to estimate the likelihood of these primitives being close to the two vertical boundaries of the image. Finally, a tree-like classifier based on the extracted feature points, primitives and fuzzy memberships is applied to classify the numerals. With our system, handwritten numerals in NIST Special Database 19 are recognized with correct rate between 87.33% and 88.72%.  相似文献   

7.
基于双弹性网格的手写体汉字识别   总被引:5,自引:0,他引:5  
特征提取是手写体汉字识别的关键,目前四方向网格特征已被实验证实是一种较好的手写体汉字特征。针对通常的纵横弹性网格对汉字“撇、捺”笔画特征提取的不足,提出一种新的网格构造技术——对角弹性网格,它由45°和135°的对角直线构成,将汉字图像划分为多个菱形,能够很好地适应汉字在“撇、捺”方向的变化。将这两种网格单独,以及相互组合成双网格等情况分别进行手写体识别实验,实验结果验证了对角弹性网格的有效性和双弹性网格的高识别率性。  相似文献   

8.
To improve the accuracy of handwritten Chinese character recognition (HCCR), we propose linear discriminant analysis (LDA)-based compound distances for discriminating similar characters. The LDA-based method is an extension of previous compound Mahalanobis function (CMF), which calculates a complementary distance on a one-dimensional subspace (discriminant vector) for discriminating two classes and combines this complementary distance with a baseline quadratic classifier. We use LDA to estimate the discriminant vector for better discriminability and show that under restrictive assumptions, the CMF is a special case of our LDA-based method. Further improvements can be obtained when the discriminant vector is estimated from higher-dimensional feature spaces. We evaluated the methods in experiments on the ETL9B and CASIA databases using the modified quadratic discriminant function (MQDF) as baseline classifier. The results demonstrate the superiority of LDA-based method over the CMF and the superiority of discriminant vector learning from high-dimensional feature spaces. Compared to the MQDF, the proposed method reduces the error rates by factors of over 26%.  相似文献   

9.
ART2神经网络在手写体汉字识别中的应用   总被引:4,自引:0,他引:4  
该文提出了一种基于神经网络的手写体汉字识别方法,该算法充分利用神经网络的自适应学习能力。ART2网络通过竞争学习和自稳机制原理实现分类,可以在非平稳的、有干扰的环境中进行无教师无监督的自学习。其学习过程是自组织的实时学习,能够迅速识别已学习过的样本,并能迅速适应未学习过的新对象。考虑到Gabor滤波器具有优良的方向性,该算法采用Gabor特征作为字符特征。Gabor特征反映字符的空间分布特征,而且可以组合成高维矢量,特别适用于汉字识别这大型模式识别场合。实验结果显示,该算法对测试样本识别正确率达到94%,比其他方法更准确、更可靠。  相似文献   

10.
一种识别手写汉字的多分类器集成方法   总被引:4,自引:1,他引:3  
根据多信源信息处理与字符识别的经验知识,提出了一个识别手写汉字的多分类器线性集成模型.这个模型不仅考虑到不同的分类器对不同字符识别能力的不同,而且还考虑了不同的分类器得出的输入字符与参考模板之间相似度的实际大小对判决的影响,及不同分类器提供的候选字符对判决的支持作用,更重要的是提供了一种通过监督学习,利用计算机程序自动计算模型参数的方法,因而实现了一个较好的集成系统.同时,本文还提供了三个用于集成的分类器,它们集成的结果充分显示了本方法的有效性.  相似文献   

11.
王建平  蔺菲  陈军 《计算机工程》2007,33(10):230-232,248
提出了手写体汉字笔画宽度提取、基于提取出的笔画宽度归一化手写体汉字的方法,给出手写体汉字笔画重构的思想,实现了一种基于手写体汉字笔画提取的汉字重构并最终识别手写体汉字的算法,构建了手写体汉字的识别系统。实验证实,该方法可保证原有笔画特征信息,且能有效地识别手写体汉字。  相似文献   

12.
A neural network for recognition of handwritten musical notes, based on the well-known Neocognitron model, is described. The Neocognitron has been used for the what pathway (symbol recognition), while contextual knowledge has been applied for the where (symbol placement). This way, we benefit from dividing the process for dealing with this complicated recognition task. Also, different degrees of intrusiveness in learning have been incorporated in the same network: More intrusive supervised learning has been implemented in the lower neuron layers and less intrusive in the upper one. This way, the network adapts itself to the handwriting of the user. The network consists of a 13×49 input layer and three pairs of simple and complex neuron layers. It has been trained to recognize 20 symbols of unconnected notes on a musical staff and was tested with a set of unlearned input notes. Its recognition rate for the individual unseen notes was up to 93%, averaging 80% for all categories. These preliminary results indicate that a modified Neocognitron could be a good candidate for identification of handwritten musical notes.  相似文献   

13.
脱机手写体汉字识别研究综述   总被引:7,自引:2,他引:7  
脱机手写体汉字识别是汉字识别领域中最难的课题。该文对目前脱机手写体汉字识别在预处理、特征提取、分类识别及后处理四个阶段主要采用的方法做了简要介绍,阐述了各种方法的优缺点,并提出了一种将支持向量机有效地用于解决多分类问题的策略。最后根据目前的研究状况,指出今后研究中需要注意的问题和研究的发展方向。  相似文献   

14.
This paper describes and analyses the performance of a novel feature extraction technique for the recognition of segmented/cursive characters that may be used in the context of a segmentation-based handwritten word recognition system. The modified direction feature (MDF) extraction technique builds upon the direction feature (DF) technique proposed previously that extracts direction information from the structure of character contours. This principal was extended so that the direction information is integrated with a technique for detecting transitions between background and foreground pixels in the character image.In order to improve on the DF extraction technique, a number of modifications were undertaken. With a view to describe the character contour more effectively, a re-design of the direction number determination technique was performed. Also, an additional global feature was introduced to improve the recognition accuracy for those characters that were most frequently confused with patterns of similar appearance. MDF was tested using a neural network-based classifier and compared to the DF and transition feature (TF) extraction techniques. MDF outperformed both DF and TF techniques using a benchmark dataset and compared favourably with the top results in the literature. A recognition accuracy of above 89% is reported on characters from the CEDAR dataset.  相似文献   

15.
脱机手写体汉字识别中多模板字典的制作方法   总被引:1,自引:0,他引:1  
高彦宇  杨扬  颉斌 《计算机工程》2005,31(2):171-173
针对手写体汉字字形变化复杂的特点,提出了一种基于特征向量分布的多模板字典制作方法,并通过赋予该多模板字典自学习功能,来提高其泛化能力。实验证明,用这种方法制作的字典与用传统方法制作的同类字典相比,能够有效地提高系统的总体识别率。  相似文献   

16.
A stroke-based approach to extract skeletons and structural features for handwritten Chinese character recognition is proposed. We first determine stroke directions based on the directional run-length information of binary character patterns. According to the stroke directions and their adjacent relationships, we split strokes into stroke and fork segments, and then extract the skeletons of the stroke segments called skeleton segments. After all skeleton segments are extracted, fork segments are processed to find the fork points and fork degrees. Skeleton segments that touch a fork segment are connected at the fork point, and all connected skeleton segments form the character skeleton. According to the extracted skeletons and fork points, we can extract primitive strokes and stroke direction maps for recognition. A simple classifier based on the stroke direction map is presented to recognize regular and rotated characters to verify the ability of the proposed feature extraction for handwritten Chinese character recognition. Several experiments are carried out, and the experimental results show that the proposed approach can easily and effectively extract skeletons and structural features, and works well for handwritten Chinese character recognition.  相似文献   

17.
提出了曲线化汉字特征的概念,讨论了3类可曲线化的汉字统计特征.利用P形傅里叶算子,对这些曲线化特征作傅里叶变换,可提取到最终的识别特征.实验结果表明,该方法在手写体汉字识别领域具有良好的应用前景,尤其适用于细分类.  相似文献   

18.
结合距离分类器的神经网络手写体汉字识别   总被引:2,自引:1,他引:1  
手写体汉字识别技术中如何解决复杂的大类别识别问题,是汉字识别中的一个难点。该文介绍了基于笔划的手写体汉字特征抽取方法,提出了一种基于预分类的神经网络汉字识别方法,该方法用一个传统的距离分类器先对汉字进行预分类,神经网络根据预分类结果进行有选择的训练和识别,能有效解决神经网络大类别模式识别中的训练和分类问题,学习时间很短,识别效果较理想。  相似文献   

19.
提出了一种笔画分区矩特征的提取方法。根据汉字笔画分布特点,利用小波变换将汉字分解为4个方向笔画分量,用分区矩分别描述4个笔画于图像,并采用K—L变换对特征进行降维处理。采用该特征对有限集手写体汉字进行识别,初步实验结果表明该方法十分有效。  相似文献   

20.
随着计算能力的飞速增长、训练数据的不断积累以及非线性激活函数的不断完善,卷积神经网络(CNN)在手写体汉字识别中表现出较好的识别性能。针对CNN识别手写体汉字识别速度慢的问题,将二维主成分分析(2DPCA)与CNN相结合识别手写体汉字。首先,利用2DPCA提取手写体汉字的投影特征向量;然后,将得到的投影特征向量组成特征矩阵;其次,用组成的特征矩阵作为CNN的输入;最后,用Softmax函数进行分类。与基于AlexNet的CNN模型相比,所提方法的运行时间降低了78%,与基于ACNN与DCNN的模型相比,所提方法的运行时间分别降低了80%与73%。实验结果表明,该方法在不降低识别精度的同时,可以减少识别手写体汉字的运行时间。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号