首页 | 本学科首页   官方微博 | 高级检索  
     

基于粗糙集理论与CAIM准则的C4.5改进算法
引用本文:于宏涛,贾宇波.基于粗糙集理论与CAIM准则的C4.5改进算法[J].计算机系统应用,2018,27(7):139-144.
作者姓名:于宏涛  贾宇波
作者单位:浙江理工大学 信息学院, 杭州 310018,浙江理工大学 信息学院, 杭州 310018
摘    要:C4.5算法是一种非常有影响力的决策树生成算法,但该方法生成的决策树分类精度不高,分支较多,规模较大.针对C4.5算法存在的上述问题,本文提出了一种基于粗糙集理论与CAIM准则的C4.5改进算法.该算法采用基于CAIM准则的离散化方法对连续属性进行处理,使离散化过程中的信息丢失程度降低,提高分类精度.对离散化后的样本用基于粗糙集理论的属性约简方法进行属性约简,剔除冗余属性,减小生成的决策树规模.通过实验验证,该算法可以有效提高C4.5算法生成的决策树分类精度,降低决策树的规模.

关 键 词:粗糙集  离散化  属性约简  决策树  CAIM  C4.5
收稿时间:2017/11/4 0:00:00
修稿时间:2017/11/27 0:00:00

C4.5 Improved Algorithm Based on Rough Set Theory and CAIM Criterion
YU Hong-Tao and JIA Yu-Bo.C4.5 Improved Algorithm Based on Rough Set Theory and CAIM Criterion[J].Computer Systems& Applications,2018,27(7):139-144.
Authors:YU Hong-Tao and JIA Yu-Bo
Affiliation:School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China and School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China
Abstract:As a decision tree generated algorithm, C4.5 algorithm is very influential. But the decision tree classification by C4.5 algorithm is of less accuracy, more branches, and larger scale. To solve these problems, we propose a C4.5 improved algorithm based on rough set theory and CAIM criterion. The algorithm uses the discretization method based on CAIM criterion to process the continuous attributes, which decreases the information loss degree and improve the classification accuracy in discretization. The discretized sample is reduced by attribute reduction method based on rough set theory, which eliminates the redundant attribute and trims the size of decision tree. Experiments show that the algorithm can effectively improve the classification accuracy of decision tree generated by C4.5 algorithm and reduce the scale of decision tree.
Keywords:rough set  discretization  attribute reduction  decision tree  CAIM  C4  5
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号