首页 | 本学科首页   官方微博 | 高级检索  
     

基于属性重要度的决策树算法
引用本文:王蓉,刘遵仁,纪俊.基于属性重要度的决策树算法[J].计算机科学,2017,44(Z11):129-132.
作者姓名:王蓉  刘遵仁  纪俊
作者单位:青岛大学数据科学与软件工程学院 青岛266071,青岛大学计算机科学技术学院 青岛266071,青岛大学计算机科学技术学院 青岛266071
基金项目:本文受国家自然科学基金项目(61503208)资助
摘    要:传统的ID3决策树算法存在属性选择困难、分类效率不高、抗噪性能不强、难以适应大规模数据集等问题。针对该情况,提出一种基于属性重要度及变精度粗糙集的决策树算法,在去除噪声数据的同时保证了决策树的规模不会太庞大。利用多个UCI标准数据集对该算法进行了验证,实验结果表明该算法在所得决策树的规模和分类精度上均优于ID3算法。

关 键 词:决策树  属性重要度  变精度粗糙集  属性约简  数据挖掘

Decision Tree Algorithm Based on Attribute Significance
WANG Rong,LIU Zun-ren and JI Jun.Decision Tree Algorithm Based on Attribute Significance[J].Computer Science,2017,44(Z11):129-132.
Authors:WANG Rong  LIU Zun-ren and JI Jun
Affiliation:Department of Data Science and Software Engineering,Qingdao University,Qingdao 266071,China,Department of Computer Science and Technology,Qingdao University,Qingdao 266071,China and Department of Computer Science and Technology,Qingdao University,Qingdao 266071,China
Abstract:The traditional ID3 decision tree algorithm is difficult in selecting attribute,its classification efficiency is not high,and anti-noise performance is not strong,so it is difficult to adapt to large-scale data set and other issues.Aiming at this situation,a decision tree algorithm based on attribute significance and variable precision rough set was proposed to ensure that the tree size is not too large while removing the noise data.The algorithm was validated by using multiple UCI standard data sets.The experimental results show that the algorithm is superior to the ID3 algorithm in the scale and classification accuracy of the decision tree.
Keywords:Decision tree  Attribute significance  Variable precision rough set  Attribute reduction  Data mining
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号