首页 | 本学科首页   官方微博 | 高级检索  
     

基于连续属性分类规则挖掘的新算法研究
引用本文:厍向阳,薛惠锋.基于连续属性分类规则挖掘的新算法研究[J].计算机工程,2005,31(18):28-30.
作者姓名:厍向阳  薛惠锋
作者单位:1. 西北工业大学自动化学院,西安,710072;西安建筑科技大学建筑勘测研究所,西安,710055
2. 西北工业大学自动化学院,西安,710072
基金项目:陕西省自然科学基金资助项目(200104-G15)
摘    要:分析了针对连续属性样本进行数据挖掘的缺陷,提出一种直接对连续属性样本进行分类规则挖掘的算法.它基于样本属性值分割点对实例样本进行分类,把分割点对实例样本的分类能力作为分割点选择的依据,将所有相容样本划分为分类属性值相同的子集作为停机条件,实现连续属性样本分类规则挖掘的完全自动化.它考虑到数据挖掘的目标和要求,充分利用属性与类间的依赖性、属性间的互补性,达到样本分割点数少、分类规则简单和属性约减的目的.最后通过实例进行了验证,并与C4.5算法进行了比较.

关 键 词:连续属性  数据挖掘  分类规则  新算法
文章编号:1000-3428(2005)18-0028-03
收稿时间:07 24 2004 12:00AM
修稿时间:2004-07-24

New Algorithms of Mining Classification Rules in the Database on Continuous Valued Attributes
SHE Xiangyang,Xue Huifeng.New Algorithms of Mining Classification Rules in the Database on Continuous Valued Attributes[J].Computer Engineering,2005,31(18):28-30.
Authors:SHE Xiangyang  Xue Huifeng
Affiliation:SHE Xiangyang, XUE Huifeng(1. Auto-control College, Northwest Polytechnic University, Xi'an 710072; 2. Institute of Surveying and Mapping, Xi'an Univ. of Arch.
Abstract:The paper analyses the shortcoming in new classification rules mining about continuous valued attributes, and proposes a new algorithms dealing with continuous valued attributes mining. It mines classification rules, by the way of judging the splitting point in classifying, selecting the best one to classify, when the class label of all subclass in consistent samples are sameness, then end. The algorithm considers the aim and demand of data mining, makes the most of the interdependence between class labels and attributes, among the attributes, in the interest of minimizing the number of splitting point, simplifying classification rules, reducing the number of features. Finally, the algorithm is validated by an example, compared with C4.5.
Keywords:Continuous valued attributes  Data mining  Classification rules  New algorithms
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号