首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到19条相似文献,搜索用时 187 毫秒
1.
一种识别手写汉字的多分类器集成方法   总被引:6,自引:0,他引:6  
根据多信源信息处理与字符识别的经验知识,提出了一个识别手写汉字的多分类器线性集成模型.这个模型不仅考虑到不同的分类器对不同字符识别能力的不同,而且还考虑了不同的分类器得出的输入字符与参考模板之间相似度的实际大小对判决的影响,及不同分类器提供的候选字符对判决的支持作用,更重要的是提供了一种通过监督学习,利用计算机程序自动计算模型参数的方法,因而实现了一个较好的集成系统.同时,本文还提供了三个用于集成的分类器,它们集成的结果充分显示了本方法的有效性。  相似文献   

2.
针对字符识别对象的多样性,提出了一种基于Bagging集成的字符识别模型,解决了识别模型对部分字符识别的偏好现象。采用Bagging采样策略形成不同的数据子集,在此基础上用决策树算法训练形成多个基分类器,用多数投票机制对基分类器预测结果集成输出。理论分析与仿真实验结果表明,所提模型相比其他分类方法具有更好的分类能力。  相似文献   

3.
基于分类器判决可靠度估计的最优线性集成方法   总被引:6,自引:0,他引:6  
鲁湛  丁晓青 《计算机学报》2002,25(8):890-895
多分类器集成的目标是充分利用每一个分类器的长处,既要发挥它们的最佳性能,又能克服单个分类器的弱点,以达到最优的系统识别性能,该文提出一种基于发类器判决可靠度估计的多分类器集成方法,利用各分类器对当前输入样本的判决兵验概率,实时估计它们的分类判决可靠度,并指导集成权重系数的分配,该方法能够使集成权重更灵活地适应不同的输入变化,充分利用每个分类器当前的识别能力,从而获得更好的集成效果,该文结合最小均方误差准则(MSE)下的最优线性集成方法给出了权重模型和训练算法,并与其它的线性集成模型作了比较,实验表明,不论在最优化准则下的最终误差还是在实际识别性能上,作者提出的方法都取得了最好的效果。  相似文献   

4.
基于笔划特征的单字符汉字字体识别   总被引:1,自引:0,他引:1  
在文档电子化的文本自动分析、理解和识别过程中,除了有关文档内容的字符识别外,还必须解决字体识别问题.字体识别不仅是版面分析、理解和恢复的重要依据,还有助于实现高性能字符识别系统.有别于目前基于多个字符组成的文本块的字体识别方法,本文提出了一种基于单个汉字字符的字体识别方法.在单个汉字字符上提取两类特征:笔划属性特征和笔划分布特征,分别构成两个分类器对单个汉字字符进行字体识别,并集成两个分类器的结果得到最终的识别结果.我们使用的笔划属性特征分类器是文本无关的,而笔划分布特征分类器是文本相关的,集成的分类器属于文本相关的字体识别分类器.我们在包含7种字体的样本集上进行了测试,测试结果显示基于单字的字体识别率达到94.48%.  相似文献   

5.
离线手写数字识别是光学字符识别的一个重要分支,在银行票据识别、邮政编码识别等领域有着广泛的应用。由于单一分类器在识别率上很难达到要求,人们提出了各种集成分类器识别方案。通过对离线手写数字的特征提取,从特征互补的角度出发,采用了最小距离分类器、树分类器和BP网络分类器进行多分类器互补集成,提出了基于置信度的多分类器互补集成方法。通过实验对比,基于置信度的多分类器互补集成手写数字识别在识别率和识别速度上达到了满意的结果。  相似文献   

6.
在集成方法中,神经网络集成方法对研制集成型模式识别系统是有效的.但是,单个 子分类器和集成网络的性能对集成系统的整体识别效果都有影响.因此,要进一步提高系统 性能也必须改进子分类器和集成网络.文中采用竞争监督学习法,构造一个网络集成系统,用 于手写数字字符识别.实验证明,该方法的确能够改进系统的收敛速度和泛化能力.  相似文献   

7.
在离线签名验证的分类器设计中,为了减少特征向量分布不均和维数过高对实验结果的影响,给出一种多分类器集成的方法.根据特征向量数量级的不同进行分组,各组分类器自适应地确定分类器权重,通过投票表决得出集成判决结果.实验结果表明,通过分组和加权后,分类正确率有明显提高.  相似文献   

8.
由于高维数据通常存在冗余和噪声,在其上直接构造覆盖模型不能充分反映数据的分布信息,导致分类器性能下降.为此提出一种基于精简随机子空间多树集成分类方法.该方法首先生成多个随机子空间,并在每个子空间上构造独立的最小生成树覆盖模型.其次对每个子空间上构造的分类模型进行精简处理,通过一个评估准则(AUC值),对生成的一类分类器进行精简.最后均值合并融合这些分类器为一个集成分类器.实验结果表明,与其它直接覆盖分类模型和bagging算法相比,多树集成覆盖分类器具有更高的分类正确率.  相似文献   

9.
本文提出了一种基于模板匹配和神经网络的车牌识别方法.该方法集成了模板匹配识别车牌字符和神经网络识别车牌字符的各自优势.对于字符可单独分割出来的一类车牌,本文提出了一种改进的神经网络来进行字符识别;对于字符不可分割或分割困难的另一类车牌,本文提出了一种基于四灰度加权相似函数模板匹配方法来识别字符.从而克服了单一方法很难同时识别这两类车牌中的字符的不足,同时可有效地提高车牌字符识别的识别率、识别速度或识别系统的泛化能力.实验结果表明:大多数情况下,该方法车牌字符识别率超过90%,识别时间不超过1 200毫秒,能更有效识别各种车牌中的字符,能更好地满足实际系统的要求.  相似文献   

10.
分类器的动态选择与循环集成方法   总被引:1,自引:0,他引:1  
针对多分类器系统设计中最优子集选择效率低下、集成方法缺乏灵活性等问题, 提出了分类器的动态选择与循环集成方法 (Dynamic selection and circulating combination, DSCC). 该方法利用不同分类器模型之间的互补性, 动态选择出对目标有较高识别率的分类器组合, 使参与集成的分类器数量能够随识别目标的复杂程度而自适应地变化, 并根据可信度实现系统的循环集成. 在手写体数字识别实验中, 与其他常用的分类器选择方法相比, 所提出的方法灵活高效, 识别率更高.  相似文献   

11.
Given the complexity of many contemporary software systems, it is often difficult to gauge the overall quality of their underlying software components. A potential technique to automatically evaluate such qualitative attributes is to use software metrics as quantitative predictors. In this case study, an aggregation technique based on fuzzy integration is presented that combines the predicted qualitative assessments from multiple classifiers. Multiple linear classifiers are presented with randomly selected subsets of automatically generated software metrics describing components from a sophisticated biomedical data analysis system. The external reference test is a software developer’s thorough assessment of complexity, maintainability, and usability, which is used to assign corresponding quality class labels to each system component. The aggregated qualitative predictions using fuzzy integration are shown to be superior to the predictions from the respective best single classifiers.  相似文献   

12.
Yuan  Weiwei  Guan  Donghai  Zhu  Qi  Ma  Tinghuai 《Neural computing & applications》2018,29(10):673-683

As a kind of noise, mislabeled training data exist in many applications. Because of their negative effects on learning, many filter techniques have been proposed to identify and eliminate them. Ensemble learning-based filter (EnFilter) is the most widely used filter which employs ensemble classifiers. In EnFilter, first the noisy training dataset is divided into several subsets. Each noisy subset is then checked by the multiple classifiers which are trained based on other noisy subsets. It is noted that since the training data used to train multiple classifiers are noisy, the quality of these classifiers cannot be guaranteed, which might generate poor noise identification result. This problem is more serious when the noise ratio in the training dataset is high. To solve this problem, a straightforward but effective approach is proposed in this work. Instead of using noisy data to train the classifiers, nearly noise-free (NNF) data are used since they are supposed to train more reliable classifiers. To this end, a novel NNF data extraction approach is also proposed. Experimental results on a set of benchmark datasets illustrate the utility of our proposed approach.

  相似文献   

13.
Genetic algorithm (GA) has been used as a conventional method for classifiers to evolve solutions adaptively for classification problems. In this paper, a new approach using class decomposition is proposed to improve the performance of GA-based classifiers. A classification problem is fully partitioned into several class modules in the output domain and each module is responsible for solving a fraction of the original problem. These modules are trained in parallel and independently and the results obtained are integrated and evolved further for a final solution. A scheme based on Fisher's linear discriminant (FLD) computation is used to estimate the difficulty of separating two classes. Based on the FLD information derived, different integration approaches are proposed and their performance is compared. The experiment results on a benchmark data set show that class decomposition can achieve higher classification rate than the normal GA and FLD-based integration improves the classification accuracy further.  相似文献   

14.
This paper presents an analysis of the design of classifiers for use in a hierarchical object recognition approach. In this approach, a cascade of classifiers is arranged in a tree in order to recognize multiple object classes. We are interested in the problem of recognizing multiple patterns as it is closely related to the problem of locating an articulated object. Each different pattern class corresponds to the hand in a different pose, or set of poses. For this problem obtaining labelled training data of the hand in a given pose can be problematic. Given a parametric 3D model, generating training data in the form of example images is cheap, and we demonstrate that it can be used to design classifiers almost as good as those trained using non-synthetic data. We compare a variety of different template-based classifiers and discuss their merits.  相似文献   

15.
基于专家域的多层分类器融合   总被引:1,自引:1,他引:0  
论文提出了一种基于专家域的多层分类器融合模型,专家指不同专长之单分类器。模型思想来自医院诊断流程,模型首先训练n个专家,之后将样本空间按专家专长划分专家域。对于待测样本,先将样本指派到合适的专家域,然后再由指定的专家对样本进行分类。用这种算法对UCI的标准数据集进行分类,实验结果显示,该算法得到比其他算法更低的分类误差,显著提高了分类器的性能。  相似文献   

16.
Adaptive classifier integration for robust pattern recognition   总被引:2,自引:0,他引:2  
The integration of multiple classifiers promises higher classification accuracy and robustness than can be obtained with a single classifier. This paper proposes a new adaptive technique for classifier integration based on a linear combination model. The proposed technique is shown to exhibit robustness to a mismatch between test and training conditions. It often outperforms the most accurate of the fused information sources. A comparison between adaptive linear combination and non-adaptive Bayesian fusion shows that, under mismatched test and training conditions, the former is superior to the latter in terms of identification accuracy and insensitivity to information source distortion.  相似文献   

17.
多分类器选择集成方法   总被引:2,自引:0,他引:2       下载免费PDF全文
针对目前人们对分类性能的高要求和多分类器集成实现的复杂性,从基分类器准确率和基分类器间差异性两方面出发,提出了一种新的多分类器选择集成算法。该算法首先从生成的基分类器中选择出分类准确率较高的,然后利用分类器差异性度量来选择差异性大的高性能基分类器,在分类器集成之前先对分类器集进行选择获得新的分类器集。在UCI数据库上的实验结果证明,该方法优于bagging方法,取得了很好的分类识别效果。  相似文献   

18.
In this work, a mixed integer linear programming (MILP) model is proposed for the multi-class data classification problem using a hyper-box representation. The latter representation is particularly suitable for capturing disjoint data regions. The objective function used is the minimisation of the total number of misclassified data samples. In order to improve the training and testing accuracy of our approach, an iterative solution procedure is developed to assign potential multiple boxes to each single class. Finally, the applicability of the proposed approach is demonstrated through a number of illustrative examples. According to the computational results obtained, the proposed optimisation-based approach is competitive in terms of prediction accuracy when compared with various standard classifiers.  相似文献   

19.
手写体字符识别的多特征多分类器设计   总被引:4,自引:0,他引:4  
特征选取和分类器设计是字符识别系统设计的关键。文章针对手写体汉字和阿拉伯数字混和字符集的识别提出了依据不同的分类要求,分别选取不同的字符特征并采用神经网络多分类器进行识别的设计方法。实验结果表明,该方法用于手写体混合字符集的识别是行之有效的。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号