首页 | 本学科首页   官方微博 | 高级检索  
     

主动协同半监督粗糙集分类模型
引用本文:高灿,苗夺谦,张志飞,刘财辉.主动协同半监督粗糙集分类模型[J].模式识别与人工智能,2012,25(5):745-754.
作者姓名:高灿  苗夺谦  张志飞  刘财辉
作者单位:同济大学电子与信息工程学院计算机科学与技术系上海201804
同济大学嵌入式系统与服务计算教育部重点实验室上海201804
基金项目:国家自然科学基金项目,中国博士后科学基金项目,上海市重点学科建设项目
摘    要:粗糙集理论是一种有监督学习模型,一般需要适量有标记的数据来训练分类器。但现实一些问题往往存在大量无标记的数据,而有标记数据由于标记代价过大较为稀少。文中结合主动学习和协同训练理论,提出一种可有效利用无标记数据提升分类性能的半监督粗糙集模型。该模型利用半监督属性约简算法提取两个差异性较大的约简构造基分类器,然后基于主动学习思想在无标记数据中选择两分类器分歧较大的样本进行人工标注,并将更新后的分类器交互协同学习。UCI数据集实验对比分析表明,该模型能明显提高分类学习性能,甚至能达到数据集的最优值。

关 键 词:粗糙集  差别矩阵  半监督约简  主动学习  协同训练  
收稿时间:2011-06-20

A Semi-Supervised Rough Set Model for Classification Based on Active Learning and Co-Training
GAO Can , MIAO Duo-Qian , ZHANG Zhi-Fei , LIU Cai-Hui.A Semi-Supervised Rough Set Model for Classification Based on Active Learning and Co-Training[J].Pattern Recognition and Artificial Intelligence,2012,25(5):745-754.
Authors:GAO Can  MIAO Duo-Qian  ZHANG Zhi-Fei  LIU Cai-Hui
Affiliation:Department of Computer Science and Technology,College of Electronics and Information Engineering,Tongji University,Shanghai 201804
Key Laboratory of Embedded System and Service Computing,Ministry of Education,Tongji University,Shanghai 201804
Abstract:Rough set theory, as an effective supervised learning model, usually relies on the availability of an amount of labeled data to train the classifier. Howerer, in many practical problems, large amount of unlabeled data are readily available, and labeled ones are fairly expensive to obtain because of high cost. In this paper, a semi-supervised rough set model is proposed to deal with the partially labeled data. The proposed model firstly employs two diverse semi-supervised reducts to train its base classifiers on labeled data. The unlabeled ramified samples for two base classifiers are selected to be labeled based on the principle of active learning, and then the updated classifiers learn from each other by labeling confident unlabeled samples to its concomitant. The experimental results on selected UCI datasets show that the proposed model greatly improves the classification performance of partially labeled data, and even the best performance of dataset is obtained.
Keywords:Rough Set  Discernibility Matrix  Semi-Supervised Reduction  Active Learning  Co-Training  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《模式识别与人工智能》浏览原始摘要信息
点击此处可从《模式识别与人工智能》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号