首页 | 本学科首页   官方微博 | 高级检索  
     

竞争选择分裂属性的决策树分类模型
引用本文:房立,黄泽宇. 竞争选择分裂属性的决策树分类模型[J]. 微机发展, 2006, 16(8): 106-109
作者姓名:房立  黄泽宇
作者单位:北京交通大学计算机与信息技术学院 北京100044
摘    要:构建决策树分类器关键是选择分裂属性。通过分析信息增益和增益比率、Gini索引、基于Goodman-Kruskal关联索引这三种选择分裂属性的标准,提出了一种改进经典决策树分类器C4.5算法的方法(竞争选择分裂属性的决策树分类模型),它综合三种选择分裂属性的标准,通过竞争机制选择最佳分裂属性。实验结果表明它在大多数情况下,使得不牺牲分类精确度而获得更小的决策树成为了可能。

关 键 词:决策树  信息增益  增益比率  Gini索引  Goodman-Kruskal关联索引
文章编号:1673-629X(2006)08-0106-04
修稿时间:2005-11-30

A Decision-Tree Classifier Model of Competition in Choosing Split Attribute
FANG Li,HUANG Ze-yu. A Decision-Tree Classifier Model of Competition in Choosing Split Attribute[J]. Microcomputer Development, 2006, 16(8): 106-109
Authors:FANG Li  HUANG Ze-yu
Abstract:The construction of decision-tree is centered on the selection algorithm of an attribute that generates a partition of the subsets of the training database that is located in the node about to be split.On the basis of analyzing three techniques for choosing the splitting attributes including the entropy gain and the gain ratio,the gini index and Goodman-Kruskal association index,propose a strategy to improve on classical decision-tree classifier C4.5 arithmetic(a decision-tree classifier model of competition in choosing split attribute).Experimental results show it is possible,in most cases,to obtain smaller decision trees without sacrificing accuracy.
Keywords:decision-tree  entropy gain  gain ratio  gini index  Goodman-Kruskal association index
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号