首页 | 本学科首页   官方微博 | 高级检索  
     

基于原型抽象和分类价值量的决策树算法*
引用本文:周亮,晏立.基于原型抽象和分类价值量的决策树算法*[J].计算机应用研究,2010,27(8):2899-2901.
作者姓名:周亮  晏立
作者单位:江苏大学,计算机科学与通信工程学院,江苏,镇江,212013
基金项目:国家自然科学基金资助项目(70971067);国家科技型中小企业技术创新基金资助项目(09C26213203797)
摘    要:为了克服现有决策树分类算法在大数据集上的有效性和可伸缩性的局限,提出一种新的基于粗糙集理论的决策树算法。首先提出基于代表性实例的原型抽象方法,该方法从原始数据集中抽取代表性实例组成抽象原型,可缩减实例数目和无关属性,从而使算法可以处理大数据集;然后提出属性分类价值量概念,并作为选择属性的启发式测度,该测度描述了属性对分类的贡献价值量的多少,侧重考虑了属性之间以及实例与分类之间的关系。实验表明,新算法比其他算法生成的决策树规模要小,准确率也有显著提高,在大数据集上尤为明显。

关 键 词:决策树    粗糙集    大数据集    代表性实例    原型抽象    属性分类价值量

Decision tree algorithm using archetype abstraction and attribute classification value
ZHOU Liang,YAN Li.Decision tree algorithm using archetype abstraction and attribute classification value[J].Application Research of Computers,2010,27(8):2899-2901.
Authors:ZHOU Liang  YAN Li
Affiliation:(School of Computer Science & Communication Engineering, Jiangsu University, Zhenjiang Jiangsu 212013, China)
Abstract:In order to overcome the shortcomings of decision tree algorithm in large data sets, this paper proposed a novel decision tree algorithm based on rough set. The algorithm put forward a method based on representative instance for archetype abstraction, which extracted representative instances from original data set as abstraction archetype and decreased the number of instances and irrelevant attributes, hence it could deal with large data set. Simultaneity, the algorithm took attribute classification value as a heuristic measure for choosing attribute, which synthetically calculated contribution of an attribute for classification. It principally considered the dependency between attributes or relationship between instances and the classification. Mining experiments show that it can obtain higher accuracy and smaller size of decision tree than other algorithms, which make it more excellent for large data sets.
Keywords:decision tree  rough set  large data sets  representative instance  abstraction archetype  classification value
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号