共查询到20条相似文献,搜索用时 10 毫秒
1.
一种手写体汉字识别的神经网络多分类器集成方案 总被引:1,自引:1,他引:1
提出了一种基于单字单网的手写体汉字识别纯神经网络的多分类器集成方案,并通过实验证明用该方案实现的神经网络集成系统性能均比任一个神经网络单分类器都好,对1 000种不同的手写体汉字的1 000×10个字进行测试,集成后的识别率最高达到95.22%,比单分类器的识别率高出5.0%-8.7%。 相似文献
2.
Georgios Vamvakas Author Vitae Basilis Gatos Author Vitae Author Vitae 《Pattern recognition》2010,43(8):2807-2816
In this paper, we present a methodology for off-line handwritten character recognition. The proposed methodology relies on a new feature extraction technique based on recursive subdivisions of the character image so that the resulting sub-images at each iteration have balanced (approximately equal) numbers of foreground pixels, as far as this is possible. Feature extraction is followed by a two-stage classification scheme based on the level of granularity of the feature extraction method. Classes with high values in the confusion matrix are merged at a certain level and for each group of merged classes, granularity features from the level that best distinguishes them are employed. Two handwritten character databases (CEDAR and CIL) as well as two handwritten digit databases (MNIST and CEDAR) were used in order to demonstrate the effectiveness of the proposed technique. The recognition result achieved, in comparison to the ones reported in the literature, is the highest for the well-known CEDAR Character Database (94.73%) and among the best for the MNIST Database (99.03%) 相似文献
3.
Since Chinese characters are composed from a small set of fundamental shapes (radicals) the problem of recognising large numbers of characters can be converted to that of extracting a small number of radicals and then finding their optimal combination. In this paper, radical extraction is carried out by nonlinear active shape models, in which kernel principal component analysis is employed to capture the nonlinear variation. Treating Chinese character composition as a discrete Markov process, we also propose an approach to recognition with the Viterbi algorithm. Our initial experiments are conducted on off-line recognition of 430,800 loosely-constrained characters, comprised of 200 radical categories covering 2154 character categories from 200 writers. The correct recognition rate is 93.5% characters correct (writer-independent). Consideration of published figures for existing radical approaches suggests that our method achieves superior performance. 相似文献
4.
5.
针对传统两级手写汉字识别系统中手写汉字识别的特征提取方法的限制问题,提出了一种采用卷积神经网对相似汉字自动学习有效特征进行识别的系统方法。该方法采用来自手写云平台上的大数据来训练模型,基于频度统计生成相似子集,进一步提高识别率。实验表明,相对于传统的基于梯度特征的支持向量机和最近邻分类器方法,该方法的识别率有一定的提高。 相似文献
6.
Previous handwritten numeral recognition algorithms applied structural classification to extract geometric primitives that characterize each image, and then utilized artificial intelligence methods, like neural network or fuzzy memberships, to classify the images. We propose a handwritten numeral recognition methodology based on simplified structural classification, by using a much smaller set of primitive types, and fuzzy memberships. More specifically, based on three kinds of feature points, we first extract five kinds of primitive segments for each image. A fuzzy membership function is then used to estimate the likelihood of these primitives being close to the two vertical boundaries of the image. Finally, a tree-like classifier based on the extracted feature points, primitives and fuzzy memberships is applied to classify the numerals. With our system, handwritten numerals in NIST Special Database 19 are recognized with correct rate between 87.33% and 88.72%. 相似文献
7.
8.
To improve the accuracy of handwritten Chinese character recognition (HCCR), we propose linear discriminant analysis (LDA)-based compound distances for discriminating similar characters. The LDA-based method is an extension of previous compound Mahalanobis function (CMF), which calculates a complementary distance on a one-dimensional subspace (discriminant vector) for discriminating two classes and combines this complementary distance with a baseline quadratic classifier. We use LDA to estimate the discriminant vector for better discriminability and show that under restrictive assumptions, the CMF is a special case of our LDA-based method. Further improvements can be obtained when the discriminant vector is estimated from higher-dimensional feature spaces. We evaluated the methods in experiments on the ETL9B and CASIA databases using the modified quadratic discriminant function (MQDF) as baseline classifier. The results demonstrate the superiority of LDA-based method over the CMF and the superiority of discriminant vector learning from high-dimensional feature spaces. Compared to the MQDF, the proposed method reduces the error rates by factors of over 26%. 相似文献
9.
ART2神经网络在手写体汉字识别中的应用 总被引:4,自引:0,他引:4
该文提出了一种基于神经网络的手写体汉字识别方法,该算法充分利用神经网络的自适应学习能力。ART2网络通过竞争学习和自稳机制原理实现分类,可以在非平稳的、有干扰的环境中进行无教师无监督的自学习。其学习过程是自组织的实时学习,能够迅速识别已学习过的样本,并能迅速适应未学习过的新对象。考虑到Gabor滤波器具有优良的方向性,该算法采用Gabor特征作为字符特征。Gabor特征反映字符的空间分布特征,而且可以组合成高维矢量,特别适用于汉字识别这大型模式识别场合。实验结果显示,该算法对测试样本识别正确率达到94%,比其他方法更准确、更可靠。 相似文献
10.
一种识别手写汉字的多分类器集成方法 总被引:4,自引:1,他引:3
根据多信源信息处理与字符识别的经验知识,提出了一个识别手写汉字的多分类器线性集成模型.这个模型不仅考虑到不同的分类器对不同字符识别能力的不同,而且还考虑了不同的分类器得出的输入字符与参考模板之间相似度的实际大小对判决的影响,及不同分类器提供的候选字符对判决的支持作用,更重要的是提供了一种通过监督学习,利用计算机程序自动计算模型参数的方法,因而实现了一个较好的集成系统.同时,本文还提供了三个用于集成的分类器,它们集成的结果充分显示了本方法的有效性. 相似文献
11.
12.
Orly Yadid-Pecht Moty Gerner Lior Dvir Eliyahu Brutman Uri Shimony 《Machine Vision and Applications》1996,9(2):65-72
A neural network for recognition of handwritten musical notes, based on the well-known Neocognitron model, is described. The Neocognitron has been used for the what pathway (symbol recognition), while contextual knowledge has been applied for the where (symbol placement). This way, we benefit from dividing the process for dealing with this complicated recognition task. Also, different degrees of intrusiveness in learning have been incorporated in the same network: More intrusive supervised learning has been implemented in the lower neuron layers and less intrusive in the upper one. This way, the network adapts itself to the handwriting of the user. The network consists of a 13×49 input layer and three pairs of simple and complex neuron layers. It has been trained to recognize 20 symbols of unconnected notes on a musical staff and was tested with a set of unlearned input notes. Its recognition rate for the individual unseen notes was up to 93%, averaging 80% for all categories. These preliminary results indicate that a modified Neocognitron could be a good candidate for identification of handwritten musical notes. 相似文献
13.
脱机手写体汉字识别研究综述 总被引:7,自引:2,他引:7
脱机手写体汉字识别是汉字识别领域中最难的课题。该文对目前脱机手写体汉字识别在预处理、特征提取、分类识别及后处理四个阶段主要采用的方法做了简要介绍,阐述了各种方法的优缺点,并提出了一种将支持向量机有效地用于解决多分类问题的策略。最后根据目前的研究状况,指出今后研究中需要注意的问题和研究的发展方向。 相似文献
14.
This paper describes and analyses the performance of a novel feature extraction technique for the recognition of segmented/cursive characters that may be used in the context of a segmentation-based handwritten word recognition system. The modified direction feature (MDF) extraction technique builds upon the direction feature (DF) technique proposed previously that extracts direction information from the structure of character contours. This principal was extended so that the direction information is integrated with a technique for detecting transitions between background and foreground pixels in the character image.In order to improve on the DF extraction technique, a number of modifications were undertaken. With a view to describe the character contour more effectively, a re-design of the direction number determination technique was performed. Also, an additional global feature was introduced to improve the recognition accuracy for those characters that were most frequently confused with patterns of similar appearance. MDF was tested using a neural network-based classifier and compared to the DF and transition feature (TF) extraction techniques. MDF outperformed both DF and TF techniques using a benchmark dataset and compared favourably with the top results in the literature. A recognition accuracy of above 89% is reported on characters from the CEDAR dataset. 相似文献
15.
16.
A novel stroke-based feature extraction for handwritten Chinese character recognition 总被引:7,自引:0,他引:7
A stroke-based approach to extract skeletons and structural features for handwritten Chinese character recognition is proposed. We first determine stroke directions based on the directional run-length information of binary character patterns. According to the stroke directions and their adjacent relationships, we split strokes into stroke and fork segments, and then extract the skeletons of the stroke segments called skeleton segments. After all skeleton segments are extracted, fork segments are processed to find the fork points and fork degrees. Skeleton segments that touch a fork segment are connected at the fork point, and all connected skeleton segments form the character skeleton. According to the extracted skeletons and fork points, we can extract primitive strokes and stroke direction maps for recognition. A simple classifier based on the stroke direction map is presented to recognize regular and rotated characters to verify the ability of the proposed feature extraction for handwritten Chinese character recognition. Several experiments are carried out, and the experimental results show that the proposed approach can easily and effectively extract skeletons and structural features, and works well for handwritten Chinese character recognition. 相似文献
17.
18.
结合距离分类器的神经网络手写体汉字识别 总被引:2,自引:1,他引:1
手写体汉字识别技术中如何解决复杂的大类别识别问题,是汉字识别中的一个难点。该文介绍了基于笔划的手写体汉字特征抽取方法,提出了一种基于预分类的神经网络汉字识别方法,该方法用一个传统的距离分类器先对汉字进行预分类,神经网络根据预分类结果进行有选择的训练和识别,能有效解决神经网络大类别模式识别中的训练和分类问题,学习时间很短,识别效果较理想。 相似文献
19.
20.
随着计算能力的飞速增长、训练数据的不断积累以及非线性激活函数的不断完善,卷积神经网络(CNN)在手写体汉字识别中表现出较好的识别性能。针对CNN识别手写体汉字识别速度慢的问题,将二维主成分分析(2DPCA)与CNN相结合识别手写体汉字。首先,利用2DPCA提取手写体汉字的投影特征向量;然后,将得到的投影特征向量组成特征矩阵;其次,用组成的特征矩阵作为CNN的输入;最后,用Softmax函数进行分类。与基于AlexNet的CNN模型相比,所提方法的运行时间降低了78%,与基于ACNN与DCNN的模型相比,所提方法的运行时间分别降低了80%与73%。实验结果表明,该方法在不降低识别精度的同时,可以减少识别手写体汉字的运行时间。 相似文献