首页 | 本学科首页   官方微博 | 高级检索  
     

一种改进的最大相关最小冗余选择性贝叶斯分类器
作者姓名:马勇  仝瑶瑶  程玉虎
作者单位:中国矿业大学信息与电气工程学院, 徐州 221116
基金项目:国家自然科学基金;新世纪优秀人才支持计划
摘    要:利用K均值聚类和增量学习算法扩大训练样本规模,提出一种改进的mRMR SBC.一方面,利用K均值聚类预测测试样本的类标签,将已标记的测试样本添加到训练集中,并在属性选择过程中引入一个调节因子以降低K均值聚类误标记带来的风险.另一方面,从测试样本集中选择有助于提高当前分类器精度的实例,把它加入到训练集中,来增量地修正贝叶斯分类器的参数.实验结果表明,与mRMR SBC相比,所提方法具有较好的分类效果,适于解决高维且含有较少类标签的数据集分类问题. 

关 键 词:分类器    属性选择    冗余    K均值聚类    增量学习
收稿时间:2011-04-22

An improved maximum relevance and minimum redundancy selective Bayesian classifier
Affiliation:School of Information and Electrical Engineering, China University of Mining and Technology, Xuzhou 221116, China
Abstract:A kind of improved mRMR SBC was proposed by using K-means clustering and incremental learning algorithms to enlarge the scale of training samples. On one hand, the testing samples are labeled using the K-means clustering algorithm and are added to the training set. A regulatory factor is introduced into the process of attribute selection to reduce the risk of mislabel resulting from K-means clustering. On the other hand, some samples that are most helpful for improving the current classification accuracy are selected from the testing set and are added to the training set. Based on the enlarged training set, parameters in the Bayesian classifier are adjusted incrementally. Experimental results show that compared with mRMR SBC, the proposed Bayesian classifier has better classification results and is applicable for solving the classification problem for the high-dimensional dataset with little labels. 
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《》浏览原始摘要信息
点击此处可从《》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号