首页 | 本学科首页   官方微博 | 高级检索  
     

滑坡数据连续属性值处理的研究
引用本文:亓呈明,崔守梅.滑坡数据连续属性值处理的研究[J].微计算机信息,2006,22(24):10-11.
作者姓名:亓呈明  崔守梅
作者单位:1. 100101 北京 北京联合大学自动化学院
2. 255100 山东 淄博 山东省淄博师范高等专科学校
摘    要:数据预处理是提高挖掘过程精度和性能的关键。文章在分析决策树算法和滑坡数据属性值特点基础上,利用聚类将连续属性值划分区间,提出了一种针对滑坡数据连续属性值离散化的方法,通过实验,新方法构造的决策树比原算法的分类正确率高,规则冗余少。

关 键 词:连续属性值  聚类  滑坡
文章编号:1008-0570(2006)08-3-0010-02
修稿时间:2005年12月19

Research in Processing Continuous Property Data of Landslide
Qi Chengming,Cui Shoumei.Research in Processing Continuous Property Data of Landslide[J].Control & Automation,2006,22(24):10-11.
Authors:Qi Chengming  Cui Shoumei
Affiliation:(Automation`s college of Beijing Union University,Beijing 100101,China)Qi,Chengming (Zibo normal college,Zibo 255100,China)Cui,Shoumei
Abstract:Data preprocessing is essential to improving accuracy of data mining. Through analyzing the algorithm of decision tree and property of landslide data, we develop a new method to make continuous property discrete using of cluster in this paper. We compare the performance of the method with the performance of the original algorithm on two properties of data sets. The results provide evi- dence that: (a) new method is competitive with original algorithm with respect to predictive accuracy; and (b) The rule sets discov- ered by new method are simpler (smaller) than the rule sets discovered by original algorithm.
Keywords:continuous property  cluster  Landslide
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号