共查询到19条相似文献,搜索用时 156 毫秒
1.
对支持向量机的多类分类问题进行研究,提出了一种基于核聚类的多类分类方法。利用核聚类方法将原始样本特征映射到高维特征进行聚类分组,对每一组使用一个支持向量机二值分类器进行分类,并用这些二值分类器组成决策树的节点,构成了一个决策分类树。给出决策树的生成算法,提出了利用交叠系数来控制交叠,从而克服错分积累,提高分类准确率。实验结果表明,采用该方法,手写体汉字识别速度和正确率都达到了实用的要求。 相似文献
2.
3.
结合距离分类器的神经网络手写体汉字识别 总被引:1,自引:1,他引:1
手写体汉字识别技术中如何解决复杂的大类别识别问题,是汉字识别中的一个难点。该文介绍了基于笔划的手写体汉字特征抽取方法,提出了一种基于预分类的神经网络汉字识别方法,该方法用一个传统的距离分类器先对汉字进行预分类,神经网络根据预分类结果进行有选择的训练和识别,能有效解决神经网络大类别模式识别中的训练和分类问题,学习时间很短,识别效果较理想。 相似文献
4.
5.
手写体字符识别的多特征多分类器设计 总被引:4,自引:0,他引:4
特征选取和分类器设计是字符识别系统设计的关键。文章针对手写体汉字和阿拉伯数字混和字符集的识别提出了依据不同的分类要求,分别选取不同的字符特征并采用神经网络多分类器进行识别的设计方法。实验结果表明,该方法用于手写体混合字符集的识别是行之有效的。 相似文献
6.
SVM多值分类器在脱机手写体相似汉字识别中的应用 总被引:7,自引:0,他引:7
相似字的普遍存在是影响脱机手写体汉字识别率低的主要原因之一。论文研究了支持向量机(SVM)多值分类器在手写相似汉字识别中的应用,所提出的方法采用了小波弹性网格技术提取汉字的特征,通过实验比较了三种不同的SVM分类器组合策略的分类效果。 相似文献
7.
8.
9.
本文提出一种基于小波包分解的手写体金融汉字识别算法。该算法首先对汉字图像进行小波包分解,利用基于节点子图像能量方差的准则选择适当的部分分解树;然后,将得到的子图像划分成多个局部窗口,计算局部窗口的能量值组成特征向量;再通过主成分分析(PCA)选择分类能力最强的一组特征,降低特征空间的维数;最后,用SVM多类分类方法进行分类判决。实验结果表明,该算法取得了较好的识别效果。 相似文献
10.
11.
Based on a recursive process of reducing the entropy, the general decision tree classifier with overlap has been analyzed. Several theorems have been proposed and proved. When the number of pattern classes is very large, the theorems can reveal both the advantages of a tree classifier and the main difficulties in its implementation. Suppose H is Shannon's entropy measure of the given problem. The theoretical results indicate that the tree searching time can be minimized to the order O(H), but the error rate is also in the same order O(H) due to error accumulation. However, the memory requirement is in the order 0(H exp(H)) which poses serious problems in the implementation of a tree classifier for a large number of classes. To solve these problems, several theorems related to the bounds on the search time, error rate, memory requirement and overlap factor in the design of a decision tree have been proposed and some principles have been established to analyze the behaviors of the decision tree. When applied to classify sets of 64, 450, and 3200 Chinese characters, respectively, the experimental results support the theoretical predictions. For 3200 classes, a very high recognition rate of 99.88 percent was achieved at a high speed of 873 samples/s when the experiment was conducted on a Cyber 172 computer using a high-level language. 相似文献
12.
针对网络中敏感词变形体识别效率不高的问题,提出了基于决策树的敏感词变形体识别算法。首先,通过分析汉字的结构和读音等特征,研究敏感词及变形体;其次,基于敏感词库构建敏感词决策树;最后,通过多因子改进模型,对微博等新媒体的文本敏感程度进行计算。实验结果表明,该算法在识别中文敏感词及变形体时,查全率和查准率最高分别可达95%和94%,与基于确定有穷自动机的改进算法相比,查全率和查准率分别提高了19.8%和21.1%;与敏感信息决策树信息过滤算法相比,查全率和查准率分别提高17.9%和18.1% 。通过分析,该算法对敏感词变形体的识别和自动过滤是有效的。 相似文献
13.
Gu YX Wang QR Suen CY 《IEEE transactions on pattern analysis and machine intelligence》1983,(1):83-89
A multistage classifier with general tree structure has been developed to recognize a large number of Chinese characters. A simple and efficient method of classifying the characters was achieved by choosing the best feature at each stage of the tree. The features used are Walsh coefficients obtained from two profiles of a character projected onto the X-Y orthogonal axes. Some algorithms for aligning the characters were compared and one of them was adopted in this recognition scheme. A high recognition rate of about 99.5 percent was obtained in an experiment with more than 3000 different Chinese characters. 相似文献
14.
基于标点符号分割的汉语句法分析算法 总被引:6,自引:0,他引:6
目前大部分句法解析器都忽略标点符号这一重要的句法特征或者只进行非常简单的处理。本文根据标点符号的句法结构特性,提出单独解析块的概念,并且根据标点符号在句子中的特有特征和位置关系,给出了基于决策树算法(Id3)单独解析块识别方法,将标点融入汉语句法分析中。本文所用的实验数据(包括训练集和测试集)均来自中文宾州树库5.0。对句长大于40个词的汉语长句单独进行了实验,句法分析精度和召回率分别提高1.59%和0.93%,同时时间开销降低了近2/3。实验结果表明,标点对汉语长句句法分析非常有利, 系统性能获得了较大提高。 相似文献
15.
Off-line recognition of Chinese handwriting by multifeature andmultilevel classification 总被引:1,自引:0,他引:1
Yuan Y. Tang Lo-Ting Tu Jiming Liu Seong-Whan Lee Win-Win Lin 《IEEE transactions on pattern analysis and machine intelligence》1998,20(5):556-561
In this paper, an off-line recognition system based on multifeature and multilevel classification is presented for handwritten Chinese characters. Ten classes of multifeatures, such as peripheral shape features, stroke density features, and stroke direction features, are used in this system. The multilevel classification scheme consists of a group classifier and a five-level character classifier, where two new technologies, overlap clustering and Gaussian distribution selector are developed. Experiments have been conducted to recognize 5,401 daily-used Chinese characters. The recognition rate is about 90 percent for a unique candidate, and 98 percent for multichoice with 10 candidates 相似文献
16.
基于决策分类熵的决策树构造算法及应用 总被引:1,自引:0,他引:1
为了更好地完成金融数据集上的分类挖掘任务,以粗糙集理论为基础提出决策分类熵的概念,进而以属性的决策分类熵为属性分裂度量提出基于决策分类熵的决策树构造算法,并针对过拟合问题提出一种抑制参数来实现树规模的良好控制。实例分析及金融数据集上的实验表明:相比经典的C4.5决策树算法,新算法能够较好地克服其缺点和不足,构建更优的决策树,能够更好地完成分类任务。 相似文献
17.
18.
ATM的应用日益广泛,如何部署一个利用率高的ATM已成为一个值得探讨的问题。运用数据挖掘知识和决策树ID3算法,可以对已经部署ATM的地区进行分析,从而找出高利用率ATM地区的特征,并建立ATM选点模型,作为金融机构在何处部署高效的ATM的参考。 相似文献
19.
Determining the firm performance using a set of financial measures/ratios has been an interesting and challenging problem for many researchers and practitioners. Identification of factors (i.e., financial measures/ratios) that can accurately predict the firm performance is of great interest to any decision maker. In this study, we employed a two-step analysis methodology: first, using exploratory factor analysis (EFA) we identified (and validated) underlying dimensions of the financial ratios, followed by using predictive modeling methods to discover the potential relationships between the firm performance and financial ratios. Four popular decision tree algorithms (CHAID, C5.0, QUEST and C&RT) were used to investigate the impact of financial ratios on firm performance. After developing prediction models, information fusion-based sensitivity analyses were performed to measure the relative importance of independent variables. The results showed the CHAID and C5.0 decision tree algorithms produced the best prediction accuracy. Sensitivity analysis results indicated that Earnings Before Tax-to-Equity Ratio and Net Profit Margin are the two most important variables. 相似文献