首页 | 本学科首页   官方微博 | 高级检索  
     

基于模糊模型相似测量的字符无监督分类法
引用本文:卢达,钱忆平,谢铭培,浦炜.基于模糊模型相似测量的字符无监督分类法[J].计算机学报,2002,25(4).
作者姓名:卢达  钱忆平  谢铭培  浦炜
作者单位:复旦大学计算机科学系,上海,200433
基金项目:国家自然科学基金(7870 0 12 ),江苏省教委留学回国人员科研基金(199715 5 1),江苏省教委自然科学研究基金 (99KGB14 0 0 0 9)资助
摘    要:该文提出一种基于模糊模型相似测量的文本分析系统的字符预分类方法 ,用于对字符的无监督分类 ,以提高整个字符识别系统的速度、正确性和鲁棒性 .作者在字符印刷结构归类的基础上 ,采用模板匹配方法将各类字符分别转换成基于一非线性加权相似函数的模糊样板集合 .模糊字符的无监督分类是字符匹配的一种自然范例并发展了加权模糊相似测量的研究 .该文讨论了该模糊模型的特性、模糊样板匹配的规则 ,并用于加快字符分类处理 ,经过字符分类 ,在字符识别时由于只需针对较小的模糊样板集合而变得容易和快速

关 键 词:模糊模型  加权模糊相似测量  字符无监督分类  匹配算法  分级归类

A Unsupervised Character Classification Based on Similarity Measure in Fuzzy Model
LU Da,QIAN Yi Ping,XIE Ming Pei,PU Wei.A Unsupervised Character Classification Based on Similarity Measure in Fuzzy Model[J].Chinese Journal of Computers,2002,25(4).
Authors:LU Da  QIAN Yi Ping  XIE Ming Pei  PU Wei
Affiliation:LU Da 1) QIAN Yi Ping 1) XIE Ming Pei 2) PU Wei 1) 1)
Abstract:This paper presents a character preclassification method based on similarity measure in Fuzzy model to perform unsupervised character classification for improvement in robustness, correctness, and speed of a character recognition system. On the basis of character typographical structure categorization, a pattern matching is used to classify the characters in each category into a set of fuzzy prototypes based on a nonlinear weighted similarity function. The emphasis of inequality measure for small characters guarantees no misclassification, but a little redundancy is encountered on the fuzzy prototype set. This redundancy can be removed by self grouping of the final prototype set. The fuzzy unsupervised character classification, which is natural in the representation of prototypes for character matching, is developed and a weighted fuzzy similarity measure is explored. A fuzzy model of prototypes is defined and several propositions of the features of the fuzzy model are given. The characteristics of the fuzzy model and rule based matching of fuzzy prototypes are discussed and used in speeding up the classification process. The fuzzy model of prototype has been verified to reduce the effect of noise. Based on prototypes that are free of noise, the recognition problem will be simplified and the speed as well as recognition rate will be increased. For ambiguous characters, probably as merged, the accuracy of postprocessing also will be improved. After classification, the character recognition, which is simply applied on a smaller set of the fuzzy prototypes, becomes much easier and less time consuming.
Keywords:fuzzy model  weighted fuzzy similarity measure  unsupervised character classification  matching algorithm  classification hierarchy
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号