首页 | 本学科首页   官方微博 | 高级检索  
     

多维数据集中聚类数确定算法研究
引用本文:周红芳,李红岩,刘颖,王晓东.多维数据集中聚类数确定算法研究[J].计算机工程,2012,38(9):8-11.
作者姓名:周红芳  李红岩  刘颖  王晓东
作者单位:1. 西安理工大学计算机科学与工程学院,西安,710048
2. 攀枝花学院计算机学院,四川攀枝花,617000
3. 解放军防空兵指挥学院,郑州,450052
基金项目:国家“863”计划基金资助重点项目(2007AA010305);陕西省自然科学基础研究计划基金资助项目(SJ08-ZT14);陕西省教育厅科学研究计划基金资助项目(06JK229,09JK683)
摘    要:在传统确定数据集聚类数算法原理的基础上,提出一种新的算法——MHC算法。该算法采用自底向上的策略生成不同层次的数据集划分,计算每个层次的聚类划分质量,通过聚类质量选择最佳的聚类数。还设计一种新的有效性指标——BIP指标,用于衡量不同划分的聚类质量,该指标主要依托数据集的几何结构。实验结果表明,该算法能准确地确定多维数据集中的最佳聚类数。

关 键 词:多维数据集  聚类数  聚类有效性指标  层次聚类
收稿时间:2011-09-13

Research on Determinition Algorithm of Clustering Number in Multi-dimensional Dataset
ZHOU Hong-fang , Li Hong-yan , LIU Ying , WANG Xiao-dong.Research on Determinition Algorithm of Clustering Number in Multi-dimensional Dataset[J].Computer Engineering,2012,38(9):8-11.
Authors:ZHOU Hong-fang  Li Hong-yan  LIU Ying  WANG Xiao-dong
Affiliation:1.School of Computer Science and Engineering,Xi’an University of Technology,Xi’an 710048,China;2.School of Computer,Panzhihua University,Panzhihua 617000,China;3.PLA Air Defense Forces Command Academy,Zhengzhou 450052,China)
Abstract:In order to better determine the optimal clustering number for multi-dimensional data,this paper proposes an new algorithm——MHC,which is based on the principle of the traditional algorithm to determine the clustering number for the dataset.This algorithm adopts bottom-up method to generate dataset partition of different levels.In every division,the algorithm automatically generates the partition of clustering quality,and chooses the optimal clustering number by the clustering quality.Additionally,it still presents a new clustering validity index——Between-In-Proportion(BIP),which is used to measure the different division of clustering quality,and mainly depends on the geometrical structure of datasets.Theoretical analysis and experimental results verify the effectiveness and good performance of the new validity index and the MHC algorithm.
Keywords:multi-dimensional dataset  clustering number  clustering validity indicator  hierarchy clustering
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号