首页 | 本学科首页   官方微博 | 高级检索  
     

一种混合聚类算法及其应用
引用本文:胡瑞飞,殷国富,谭颖.一种混合聚类算法及其应用[J].四川大学学报(工程科学版),2006,38(5):156-161.
作者姓名:胡瑞飞  殷国富  谭颖
作者单位:四川大学,制造科学与工程学院,四川,成都,610065
摘    要:通过分析基于网格与基于密度的聚类算法特征,提出了一种基于网格和密度的混合聚类算法,通过分阶段聚类并选取代表单元中的种子对象来扩展类, 从而减少区域查询次数,实现快速聚类。该算法保持了基于密度的聚类算法可以发现任意形状的聚类和对噪声数据不敏感的优点,同时保持了基于网格的聚类算法的高效性,适合对大规模数据的挖掘。实验数据分析验证了算法的有效性,对数据挖掘应用于设备状态监测和故障诊断具有指导意义。

关 键 词:数据挖掘  聚类  种子对象
文章编号:1009-3087(2006)05-0156-06
收稿时间:10 20 2005 12:00AM
修稿时间:2005-10-20

A Hybrid Clustering Algorithm and It's Application
HU Rui-fei,YIN Guo-fu,TAN Ying.A Hybrid Clustering Algorithm and It''''s Application[J].Journal of Sichuan University (Engineering Science Edition),2006,38(5):156-161.
Authors:HU Rui-fei  YIN Guo-fu  TAN Ying
Affiliation:School of Manufacturing Sci. and Eng. , Sichuan Univ. , Chengdu 610065, China
Abstract:Grounding on the analysis of features of grid-based and density-based clustering methods, a hybrid clustering algorithm based on grid and density was presented. By clustering in two phases and using only a small number of seed objects in representative units to expand the cluster, the frequency of region query can be decreased, and consequently the cost of time is reduced. An equivalent rule was proposed to make smooth conversion between clustering parameters in that two phases. The algorithm keeps good feature of both density-based and grid-based clustering methods. It can discover clusters with arbitrary shape with high efficiency and is insensitive to noise. So it is applicable for data mining on large database. The application of the hybrid algorithm in data analysis of accelerometer demonstrates its effectiveness. It is of instructional meaning for the application of data mining in equipment monitoring and faults diagnosis.
Keywords:data mining  clustering  seed object
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《四川大学学报(工程科学版)》浏览原始摘要信息
点击此处可从《四川大学学报(工程科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号