首页 | 本学科首页   官方微博 | 高级检索  
     

子空间聚类算法的研究新进展
引用本文:陈慧萍,王煜,王建东.子空间聚类算法的研究新进展[J].计算机仿真,2007,24(3):6-10,34.
作者姓名:陈慧萍  王煜  王建东
作者单位:1. 河海大学计算机信息工程学院,江苏,常州,213022;南京航空航天大学,江苏,南京,210016
2. 河海大学计算机信息工程学院,江苏,常州,213022
3. 南京航空航天大学,江苏,南京,210016
基金项目:国家重点基础研究发展计划(973计划)
摘    要:高维数据聚类是聚类技术的难点和重点,子空间聚类是实现高维数据集聚类的有效途径,它是在高维数据空间中对传统聚类算法的一种扩展,其思想是将搜索局部化在相关维中进行.该文从不同的搜索策略即自顶向下策略和自底向上策略两个方面对子空间聚类算法的思想进行了介绍,对近几年提出的子空间聚类算法作了综述,从算法所需参数、算法对参数的敏感度、算法的可伸缩性以及算法发现聚类的形状等多个方面对典型的子空间聚类算法进行了比较分析,对子空间聚类算法面临的挑战和未来的发展趋势进行了讨论.

关 键 词:数据挖掘  聚类  高维数据集  子空间  空间聚类算法  研究新进展  Clustering  Subspace  Advances  趋势  发展  分析  比较  形状  发现  算法的可伸缩性  敏感度  参数  自底向上  搜索策略  自顶向下  相关维  局部化  思想
文章编号:1006-9348(2007)03-0006-05
修稿时间:2005-12-262006-02-07

Research and Advances of Subspace Clustering
CHEN Hui-ping,WANG Yu,WANG Jian-dong.Research and Advances of Subspace Clustering[J].Computer Simulation,2007,24(3):6-10,34.
Authors:CHEN Hui-ping  WANG Yu  WANG Jian-dong
Affiliation:1. Hohai University, Changzhou Jiangsu 213022, China ;2. College of Information Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing Jiangsu 210016, China
Abstract:The clustering of high dimensional data is a key problem in clustering methods. Subspace clustering is an effective approach to realize clustering in high dimensional data. It is an extension of traditional clustering that seeks to find clusters in different subspaces within a high dimensional dataset and it localizes the search for relevant dimensions. In Ibis paper, the ideas of subspace clustering are introduced from two different search strategies such as top -down approach and bottom - up approach. And the recent subspace clustering algorithms are reviewed. The comparison among the typical subspace clustering algorithms is made from some aspects which are the parameters of the algorithm, the sensitivity of parameters to algorithms, the scalability of the algorithm and the shape of the clustering. At last, the paper proposes the challenges of subspace clustering and discusses the trends in the future.
Keywords:Data mining  Clustering  High dimensional datasets  Subspace
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号