首页 | 本学科首页   官方微博 | 高级检索  
     

基于k-d树的k-means聚类方法
引用本文:孙总参,陶兰,齐建东,王保迎.基于k-d树的k-means聚类方法[J].计算机工程与设计,2004,25(11):2054-2057.
作者姓名:孙总参  陶兰  齐建东  王保迎
作者单位:1. 中国农业大学,信息与电气工程学院,北京,100083
2. 中国农业大学,信息与电气工程学院,北京,100083;深圳大学,信息工程学院,广东,深圳,518060
3. 北京林业大学,信息学院,北京,100083
摘    要:在直接k-means算法的基础上提出了一种新的基于k-d树的聚类方法。通过把所有的对象组织在一棵k-d树中,可以高效地发现给定原型的所有最近邻对象。利用的主要思想是:在根结点,所有的聚类中心(或称为候选原型)都是所有对象的最近邻候选集合,对于根结点的子结点,通过简单几何约束来剪枝该候选集,这种方法可以被递归使用。使用基于k-d树的方法可以使直接k-means算法的总体性能提高一到两个数量级。

关 键 词:k-d树  k-means算法  候选集  k-means聚类  对象组  结点  递归  类方  根结  方法
文章编号:1000-7024(2004)11-2054-04

K-means clustering algorithm based on k-d tree
SUN Zong-can,TAO Lan,QI Jian-dong,WANG Bao-ying.K-means clustering algorithm based on k-d tree[J].Computer Engineering and Design,2004,25(11):2054-2057.
Authors:SUN Zong-can  TAO Lan  QI Jian-dong  WANG Bao-ying
Abstract:A novel algorithm for performing direct k-means clustering is presented based on k-d tree. It organizes all the patterns in a k-d tree that all the patterns can be found which are closest to a given prototype efficiently. The main idea behind this approach is as follows. All the prototypes are potential candidates for the closest prototype at the root level. For the children of the root node, pruning the candidate set by using simple geometrical constraints, this approach can be applied recursively. The algorithm based on k-d tree can improve the overall performance of direct k-means clustering algorithm by an order to two orders of magnitude.
Keywords:k-means algorithm  k-d tree  clustering  data mining
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号