首页 | 官方网站   微博 | 高级检索  
     

决策树算法的优化研究
引用本文:巩固,黄永青,郝国生.决策树算法的优化研究[J].计算机工程与应用,2010,46(13):139-141.
作者姓名:巩固  黄永青  郝国生
作者单位:1. 徐州师范大学计算机科学与技术学院,江苏徐州,221116
2. 徐州师范大学计算机科学与技术学院,江苏徐州221116;中国矿业大学信息与电气工程学院,江苏徐州221008
基金项目:江苏省高校自然科学基础研究No.07KJD520216;;徐州师范大学项目基金No.KY200710~~
摘    要:针对决策树C4.5/5.0分类算法及改进的算法在创建决策树时训练误差率和校验误差率相对较高的缺点,提出一些改进策略,即利用属性相关性进行属性约简与度量以达到解决属性集合中的冗余属性,采用一定置信度值进行决策树的修剪,采用优化的Chi2算法更合理更准确地对连续属性进行离散化,基于改进策略设计并实现一个分类器,将改进的算法应用于Breast-cancer实例,实验结果证明改进的算法生成的决策树具有较高的分类正确率。

关 键 词:属性相关性  属性约束  剪枝策略  离散化  Chi2算法
收稿时间:2008-10-23
修稿时间:2008-12-22  

Analysis and improved implementation of decision tree algorithms
GONG Gu,HUANG Yong-qing,HAO Guo-sheng.Analysis and improved implementation of decision tree algorithms[J].Computer Engineering and Applications,2010,46(13):139-141.
Authors:GONG Gu  HUANG Yong-qing  HAO Guo-sheng
Affiliation:1,21.College of Computer Science and Technology,Xuzhou Normal University,Xuzhou,Jiangsu 221116,China 2.College of Information and Electronic Engineering,China University of Mining and Technology,Xuzhou,Jiangsu 221008,China
Abstract:In order to effectively deal with the problems that the training error and test error are comparatively high when decision tree is built based on C4.5 and C5.0 decision tree algorithms,three improved strategies are presented.The improved strategies are as follows:Attribute correlation that can not only remove irrelevant features,also can find redundant feature with high feature correlation,is to quantify the correlation between attribute and concept;pruning strategy adopts appropriate confidence to good pur...
Keywords:attribute correlation  attribute reduction  pruning strategy  discretization  Chi2 algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号