首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于数据场的层次聚类方法
引用本文:淦文燕,李德毅,王建民.一种基于数据场的层次聚类方法[J].电子学报,2006,34(2):258-262.
作者姓名:淦文燕  李德毅  王建民
作者单位:1. 清华大学计算机系,北京 100084;2. 电子系统工程研究所,北京 100039
摘    要:聚类分析是统计、模式识别和数据挖掘等领域中一个非常重要的研究课题,具有广泛的应用前景.受物理学中场论思想的启发,提出一种基于数据场的层次聚类方法.该方法将物质粒子间的相互作用及其场描述方法引入抽象的数域空间,通过模拟对象在虚拟数据场中的相互作用和运动实现数据对象的自组织层次聚集.实验显示,该方法不依赖于用户输入参数的仔细选择,能够发现任意大小和密度的非球形聚类,对噪声数据不敏感,且具有近似线性的收敛速度.

关 键 词:聚类分析  层次聚类  数据场  
文章编号:0372-2112(2006)02-0258-05
收稿时间:2003-07-24
修稿时间:2003-07-242005-09-12

An Hierarchical Clustering Method Based on Data Fields
GAN Wen-yan,LI De-yi,WANG Jian-min.An Hierarchical Clustering Method Based on Data Fields[J].Acta Electronica Sinica,2006,34(2):258-262.
Authors:GAN Wen-yan  LI De-yi  WANG Jian-min
Affiliation:1. Department of Computer Science & Technology,Tsinghua University,Beijing 100084,China;2. Institute of Electronic System Engineering,Beijing 100039,China
Abstract:Clustering is a promising application area for many fields including statistics,pattern recognition,data mining, etc. The effectiveness and efficiency of existing clustering techniques, however, is somewhat limited, owing to the huge amounts data collected in databases. According the theory of fields in physics, a hierarchical clustering method based on data fields is presented. The basic idea is that the field models is introduced to describe the virtual interaction among data objects in data space and the hierarchical partitioning of the original dataset is then performed by iteratively simulating the interaction and movement of the data objects in the fields. Experimental results show that the proposed approach not only enjoys favorite clustering quality and requires no careful parameters tuning, but also has a time complexity approximately linear with respect to the size of dataset.
Keywords:cluster analysis  hierarchical clustering  data field
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号