共查询到20条相似文献,搜索用时 15 毫秒
1.
In this paper, we propose an off-line recognition method for handwritten Korean characters based on stroke extraction and representation. To recognize handwritten Korean characters, it is required to extract strokes and stroke sequence to describe an input of two-dimensional character as one-dimensional representation. We define 28 primitive strokes to represent characters and introduce 300 stroke separation rules to extract proper strokes from Korean characters. To find a stroke sequence, we use stroke code and stroke relationship between consecutive strokes. The input characters are recognized by using character recognition trees. The proposed method has been tested for the most frequently used 1000 characters by 400 different writers and showed recognition rate of 94.3%. 相似文献
2.
为有效地获取脱机手写体汉字笔划信息,采用过程神经元网络提取手写体汉字基本笔段,分析各类笔段间的拓扑性质,并将手写体汉字图像转化为具有容错表征方式的六种汉字笔划类型在不同位置组成的几何图形.模仿人类汉字形码输入法,统计具有冗余容错形状的笔划类型和相合相交点的数量和位置,建立手写体汉字多维特征知识数据结构表,通过对比和判断仿人容错地识别手写体汉字.对SCUT-IRAC手写体汉字库中汉字进行了实验仿真,该方法具有较强的"认知"手写体汉字的能力. 相似文献
3.
本文介绍用于联机手写中文字自动识别的新方法与新算法.由于下述各点的实现,手写
文字时可以减少许多限制,增加书写自由.①笔划的抽取经由两次分段实现:首先连续采样,
将输入笔划变换成线段组成,再对线段的长度进行比较,删去相对不重要的成份.②用笔划校
正技术将不应分离的笔划重新组合成规范笔划,或者将不应联写的复合笔划重新分解成基本
笔划.③用非完全匹配技术使失真字可以识别.④用混序笔划重排算法可使一个混序笔划输
入的字重新排列笔顺.⑤笔划位置和长度作为进一步特征,可区别模糊字. 相似文献
4.
A novel stroke-based feature extraction for handwritten Chinese character recognition 总被引:7,自引:0,他引:7
A stroke-based approach to extract skeletons and structural features for handwritten Chinese character recognition is proposed. We first determine stroke directions based on the directional run-length information of binary character patterns. According to the stroke directions and their adjacent relationships, we split strokes into stroke and fork segments, and then extract the skeletons of the stroke segments called skeleton segments. After all skeleton segments are extracted, fork segments are processed to find the fork points and fork degrees. Skeleton segments that touch a fork segment are connected at the fork point, and all connected skeleton segments form the character skeleton. According to the extracted skeletons and fork points, we can extract primitive strokes and stroke direction maps for recognition. A simple classifier based on the stroke direction map is presented to recognize regular and rotated characters to verify the ability of the proposed feature extraction for handwritten Chinese character recognition. Several experiments are carried out, and the experimental results show that the proposed approach can easily and effectively extract skeletons and structural features, and works well for handwritten Chinese character recognition. 相似文献
5.
6.
Fangyi Li Qiang Shen Ying Li Neil Mac Parthaláin 《Soft Computing - A Fusion of Foundations, Methodologies and Applications》2016,20(8):2939-2949
The task of handwritten Chinese character recognition is one of the most challenging areas of human handwriting classification. The main reason for this is related to the writing system itself which encompasses thousands of characters, coupled with high levels of diversity in personal writing styles and attributes. Much of the existing work for both online and off-line handwritten Chinese character recognition has focused on methods which employ feature extraction and segmentation steps. The preprocessed data from these steps form the basis for the subsequent classification and recognition phases. This paper proposes an approach for handwritten Chinese character recognition and classification using only an image alignment technique and does not require the aforementioned steps. Rather than extracting features from the image, which often means building models from very large training data, the proposed method instead uses the mean image transformations as a basis for model building. The use of an image-only model means that no subjective tuning of the feature extraction is required. In addition by employing a fuzzy-entropy-based metric, the work also entails improved ability to model different types of uncertainty. The classifier is a simple distance-based nearest neighbour classification system based on template matching. The approach is applied to a publicly available real-world database of handwritten Chinese characters and demonstrates that it can achieve high classification accuracy and is robust in the presence of noise. 相似文献
7.
基于可伸缩矢量图SVG的在线手写汉字是以SVG图像作为汉字图像格式、以SVG的path对象作为笔画的基本存储单元来对汉字进行显示和存储的,笔画的轮廓是以手写过程中记录的坐标值作为特征数值加以确定的。基于此种SVG手写汉字存储和表示形式,本文提出一种基于图论的在线连续手写汉字多步分割方法。该方法根据汉字笔画间的坐标位置关系对手写笔画序列构建无向图模型,并利用图的广度优先搜索将原笔画序列分割为互不连通的笔画部件,使偏旁部首分离较远、非粘连汉字得到正确分割;然后利用改进的tarjan算法对部件中的粘连字符进行分割,最后基于笔画部件间距,利用二分类迭代算法对间距进行分类,找出全局最佳分割位置,对过分割的部件进行重组合并。实验结果表明,该方法对于在线手写汉字的分割是有效可行的。 相似文献
8.
9.
10.
11.
手写体文本识别技术可以将手写文档转录成可编辑的数字文档。但由于手写的书写风格迥异、文档结构千变万化和字符分割识别精度不高等问题,基于神经网络的手写体英文文本识别仍面临着许多挑战。针对上述问题,提出基于卷积神经网络(CNN)和Transformer的手写体英文文本识别模型。首先利用CNN从输入图像中提取特征,而后将特征输入到Transformer编码器中得到特征序列每一帧的预测,最后经过链接时序分类(CTC)解码器获得最终的预测结果。在公开的IAM(Institut für Angewandte Mathematik)手写体英文单词数据集上进行了大量的实验结果表明,该模型获得了3.60%的字符错误率(CER)和12.70%的单词错误率(WER),验证了所提模型的可行性。 相似文献
12.
13.
手写文本识别方法主要应用于文本输入技术,对人机交互领域的发展起关键作用。针对多数在线输入法无法识别中英文混合手写识别的问题,提出一种在线中英文混合手写文本识别方法。通过对文本笔画进行基于水平相对位置、垂直重叠率、面积重叠率规则的整合以及连笔切分,得到一系列字符片段,同时利用笔画个数、宽高比、中心偏离、平滑度等几何特征和识别置信度,对字符片段进行中英文分类。在此基础上,根据分类结果并结合自然语言模型的路径评价及动态规划搜索算法,分别对候选的中、英文字符片段进行合并处理,得到待识别的中、英文字符序列,并将其分别送入卷积神经网络的中、英文识别模型中,得到手写文本识别结果。实验结果表明,在线手写中英文混合文本识别正确率达93.67%,不仅能切分在线手写中文文本行,而且对包含字符连笔的在线手写中英文文本行也有较好的切分效果。 相似文献
14.
Extracting perceptually meaningful strokes plays an essential role in modeling structures of handwritten Chinese characters for accurate character recognition. This paper proposes a cascade Markov random field (MRF) model that combines both bottom-up (BU) and top-down (TD) processes for stroke extraction. In the low-level stroke segmentation process, we use a BU MRF model with smoothness prior to segment the character skeleton into directional substrokes based on self-organization of pixel-based directional features. In the high-level stroke extraction process, the segmented substrokes are sent to a TD MRF-based character model that, in turn, feeds back to guide the merging of corresponding substrokes to produce reliable candidate strokes for character recognition. The merit of the cascade MRF model is due to its ability to encode the local statistical dependencies of neighboring stroke components as well as prior knowledge of Chinese character structures. Encouraging stroke extraction and character recognition results confirm the effectiveness of our method, which integrates both BU/TD vision processing streams within the unified MRF framework. 相似文献
15.
16.
17.
18.
Chinese characters are constructed by strokes according to structural rules. Therefore, the geometric configurations of characters are important features for character recognition. In handwritten characters, stroke shapes and their spatial relations may vary to some extent. The attribute value of a structural identification is then a fuzzy quantity rather than a binary quantity. Recognizing these facts, we propose a fuzzy attribute representation (FAR) to describe the structural features of handwritten Chinese characters for an on-line Chinese character recognition (OLCCR) system. With a FAR. a fuzzy attribute graph for each handwritten character is created, and the character recognition process is thus transformed into a simple graph matching problem. This character representation and our proposed recognition method allow us to relax the constraints on stroke order and stroke connection. The graph model provides a generalized character representation that can easily incorporate newly added characters into an OLCCR system with an automatic learning capability. The fuzzy representation can describe the degree of structural deformation in handwritten characters. The character matching algorithm is designed to tolerate structural deformations to some extent. Therefore, even input characters with deformations can be recognized correctly once the reference dictionary of the recognition system has been trained using a few representative learning samples. Experimental results are provided to show the effectiveness of the proposed method. 相似文献
19.
针对当前移动设备上手写汉字流行的[xml]文件存储格式,提出了一种对用户字笔画与模板字笔画测试匹配的算法,该算法通过方位、拓扑关系和形状3种特征综合量度笔画间的匹配,实验效果良好。该算法在用户字的多笔少笔判别、笔顺的正误性判别、整字的正确性以及工整性判别等方面都有着广泛的应用。 相似文献
20.
借鉴仿生模式识别的认知观点,从汉字的构造机理和人类认识汉字的习惯角度出发,提出一种基于小波变换的图像汉字识别方法。制定了图像汉字笔划特征提取的具体规则,采用小波变换的方法对图像汉字边缘和笔划轮廓进行检测,通过有效提取图像汉字笔段信息,进行笔段合成,生成汉字或汉字的基本笔划。仿真实验结果表明,这种方法提高了图像汉字笔划特征提取的准确率和稳定性,对于印刷体和书写较规范的手写体图像汉字具有极高的识别率。 相似文献