首页 | 本学科首页   官方微博 | 高级检索  
     

基于属性分布相似度的超图高维聚类算法研究
引用本文:陈建斌,宋翰涛.基于属性分布相似度的超图高维聚类算法研究[J].计算机工程与应用,2004,40(34):195-198.
作者姓名:陈建斌  宋翰涛
作者单位:1. 北京理工大学,北京,100081;华北工学院,太原,030051
2. 北京理工大学,北京,100081
摘    要:在许多聚类应用中,数据对象是具有高维、稀疏、二元的特征。传统聚类算法无法有效地处理此类数据。该文提出一种基于超图模型的高维聚类算法,通过定义对象属性分布特征向量和对象间属性分布相似度,建立超图模型,并应用超图分割法进行聚类。聚类结果通过簇内奇异特征值进行评价。实验结果和算法分析表明,该算法可以有效地进行聚类知识挖掘。

关 键 词:高维聚类  超图模型  数据挖掘
文章编号:1002-8331-(2004)34-0195-04

A New Clustering Method in High-Dimension Based on SAD Hypergraph Models
Chen Jianbin, Song Hantao.A New Clustering Method in High-Dimension Based on SAD Hypergraph Models[J].Computer Engineering and Applications,2004,40(34):195-198.
Authors:Chen Jianbin  Song Hantao
Affiliation:Chen Jianbin1,2 Song Hantao11
Abstract:The data sets have features such as high-dimensional,sparseness and binary value in many clustering applications.Most of the traditional algorithms fail to produce meaningful clusters in such data sets.In this paper,we propose a new method for clustering data in a high dimensional space based on a hypergraph model.The hypergraph model maps the relationship present in the original data in high dimensional space into a hypergraph.A hyperedge represents the similarity of attribute value distribution between two points.A hypergraph partitioning algorithm is used to find a partitioning of the vertices such that the corresponding data items in each partition are highly related and the weight of the hyperedges cut by the partitioning is minimized.The quality of the clustering result can be evaluated applying the intra-cluster singularity value.Our analysis demonstrates that this approach is applicable and effective in wide ranging scheme.
Keywords:high-dimensional clustering  hypergraph model  data miningA?
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号