首页 | 本学科首页   官方微博 | 高级检索  
     

基于粗糙集的决策树构造算法
引用本文:丁春荣,李龙澍,杨宝华.基于粗糙集的决策树构造算法[J].计算机工程,2010,36(11):75-77.
作者姓名:丁春荣  李龙澍  杨宝华
作者单位:1. 安徽农业大学信息与计算机学院,合肥,230036
2. 安徽大学计算机科学与技术学院,合肥,230039
基金项目:国家自然科学基金资助项目(60273043);安徽省高校省级自然科学基金资助项目(KJ2007B158)
摘    要:针对ID3算法构造决策树复杂、分类效率不高问题,基于粗糙集理论提出一种决策树构造算法。该算法采用加权分类粗糙度作为节点选择属性的启发函数,与信息增益相比,能全面地刻画属性分类的综合贡献能力,并且计算简单。为消除噪声对选择属性和生成叶节点的影响,利用变精度粗糙集模型对该算法进行优化。实验结果表明,该算法构造的决策树在规模与分类效率上均优于ID3算法。

关 键 词:数据挖掘  粗糙集  可变精度粗糙集  决策树  加权分类粗糙度

Decision Tree Constructing Algorithm Based on Rough Set
DING Chun-rong,LI Long-shu,YANG Bao-hua.Decision Tree Constructing Algorithm Based on Rough Set[J].Computer Engineering,2010,36(11):75-77.
Authors:DING Chun-rong  LI Long-shu  YANG Bao-hua
Affiliation:(1. School of Information and Computer, Anhui Agricultural University, Hefei 230036; 2. School of Computer Science and Technology, Anhui University, Hefei 230039)
Abstract:Aiming at the problems of complex and low efficiency decision tree constructed by ID3, this paper proposes a decision tree classification algorithm based on rough set, which takes the weighted classification rough degree as the heuristic function of choosing attribute at a node. This heuristic function can synthetically measure contribution of an attribute for classification, and is simple in calculation. To eliminate the effect of noise data on choosing attributes and generating leaf nodes, a method using variable precision rough set model is used to optimize the algorithm. Experimental results show that the size of trees generated by the new algorithm is smaller and higher accuracy than ID3 algorithm.
Keywords:data mining  rough set  variable precision rough set  decision tree  weighted classification roughness
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号