首页 | 本学科首页   官方微博 | 高级检索  
     

数量关联规则发现中的聚类方法研究
引用本文:苑森淼,程晓青.数量关联规则发现中的聚类方法研究[J].计算机学报,2000,23(8):866-871.
作者姓名:苑森淼  程晓青
作者单位:1. 吉林工业大学计算机科学与工程系,长春,130025
2. 吉林工业大学应用数学系,长春,130025
基金项目:国家自然科学基金!( 69873 0 19)
摘    要:应用聚类方法研究了数量关联规则提取过程中的连续属性离散化问题,由于现存的方法倾向于将支持度较高的区域划分为多个区间,对高偏数据效果不理想,针对这一问题,提出聚类算法PKCCA,与传统快速聚类不同,PKCCA在迭代过程中动态调整中心个数,避免造成小支持度问题,并继承了传统快速聚类适合大样本的优点。

关 键 词:数据挖掘  数量关联规则  聚类方法  数据库
修稿时间:1999-05-12

Clustering Method for Mining Quantitative Association Rules
YUAN Sen-Miao,CHENG Xiao-Qing.Clustering Method for Mining Quantitative Association Rules[J].Chinese Journal of Computers,2000,23(8):866-871.
Authors:YUAN Sen-Miao  CHENG Xiao-Qing
Abstract:This paper presents a cluster method for discretization in the processing of mining quantitative association rules. Many efficient algorithms for mining association rules were presented in recent years. But association rules containing quantitative attributes is still a bottle neck in these works. Agrawal presented partial k completeness method for partitioning. Fukuda presented the equi depth partitioning and algorithm based on gaining optimized regions. Current method doesn't work very well on highly skewed data because it tends to split intervals with relative high support into several intervals, even if their behaviors are very similar. In order to solve this problem, this paper presents a Cluster Algorithm named PKCCA. It is different from traditional Cluster Algorithm on that PKCCA dynamically adjust the number of median in iterations, but it still fit for large sample that was the character of fast clustering. For efficiency consideration, sampling method is adopted in this paper.
Keywords:data mining  quantitative association rules  cluster algorithm  sample  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号