首页 | 本学科首页   官方微博 | 高级检索  
     

决策树分类算法C4.5中连续属性过程处理的改进
引用本文:李慧慧,万武族. 决策树分类算法C4.5中连续属性过程处理的改进[J]. 计算机与现代化, 2010, 0(8): 8-10. DOI: 10.3969/j.issn.1006-2475.2010.08.003
作者姓名:李慧慧  万武族
作者单位:1. 贵州人民武装学院信息工程系,贵州,贵阳,550025
2. 贵州大学计算机科学系,贵州,贵阳,550025
基金项目:贵州省省长基金资助项目,贵州大学自然科学青年基金资助项目 
摘    要:决策树分类算法C4.5是数据挖掘中最常用、最经典的分类算法。但是C4.5算法也存在一些不足之处,针对C4.5算法处理连续属性比较耗时的特点,本文对连续的处理过程进行改进,以提高算法的计算效率。改进的C4.5算法与原C4.5算法相比,在构造决策树时具有相同的准确率和更高的计算速度。

关 键 词:数据挖掘  决策树  C4.5算法  连续属性

Improvement of Continuous Variables Processing with C4.5 Algorithm
LI Hui-hui,WAN Wu-zu. Improvement of Continuous Variables Processing with C4.5 Algorithm[J]. Computer and Modernization, 2010, 0(8): 8-10. DOI: 10.3969/j.issn.1006-2475.2010.08.003
Authors:LI Hui-hui  WAN Wu-zu
Affiliation:1.Department of Information Engineering,The People's Armed College of Guizhou,Guiyang 550025,China;2.Department of Computer Science,Guizhou University,Guiyang 550025,China)
Abstract:The decision tree classification algorithm C4.5 is the most popular and classical classification algorithm in the data mining.But,there are some defects in it,the processing of continuous variables in the C4.5 algorithm consumes too much time,according to this characteristic,the paper improves the processing of continuous variables to enhance the efficiency of the algorithm.The improved algorithm has better efficiency and has the same accuracy comparing with the C4.5 algorithm when building decision tree.
Keywords:data mining  decision tree  C4.5 algorithm  continuous variables
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号