首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于随机游动的聚类算法
引用本文:李强,何衍,蒋静坪.一种基于随机游动的聚类算法[J].电子与信息学报,2009,31(3):523-526.
作者姓名:李强  何衍  蒋静坪
作者单位:浙江大学电气工程学院,杭州,310027
摘    要:该文提出一种改进的随机游动模型,并在此模型的基础上,发展了一种数据聚类算法。在此算法中,数据集中的样本点根据改进的随机游动模型,生成有权无向图G(V,E,d),其中每个样本点对应图G的一个顶点,并且假设每个顶点为可以在空间中移动的Agent。随后计算每个顶点向其邻集中顶点转移的概率,在随机选定邻集中的一个顶点作为转移方向后,移动一个单位距离。在所有样本点不断随机游动的过程中,同类的样本点就会逐渐的聚集到一起,而不同类的样本点相互远离,最后使得聚类自动形成。实验结果表明,基于随机游动的聚类算法能使样本点合理有效地被聚类,同时,与其他算法对比也说明了此算法的有效性。

关 键 词:数据聚类  随机游动  无监督学习
收稿时间:2007-10-15
修稿时间:2008-9-23

A Random Walk Based Clustering Algorithm
Li Qiang,He Yan,Jiang Jing-ping.A Random Walk Based Clustering Algorithm[J].Journal of Electronics & Information Technology,2009,31(3):523-526.
Authors:Li Qiang  He Yan  Jiang Jing-ping
Affiliation:College of Electrical Engineering, Zhejiang University, Hangzhou 310027, China
Abstract:In this paper, a modified model of random walk is proposed, and then a clustering algorithm is developed based on this model. In the algorithm, at first a weighted and undirected graph is constructed among data points in a dataset according to the model, where each data point corresponds to a vertex in the graph, and is regarded as an agent who can move randomly in space. Next, the transition probabilities of data points are computed, and then each data point chooses a neighbor randomly in its neighborhood as a transition direction and takes a step to it. As all data points walk in space at random repeatedly, the data points that belong to the same class are located at a same position, whereas those that belong to different classes are away from one another. Consequently, the experimental results demonstrate that data points in datasets are clustered reasonably and efficiently. Moreover, the comparison with other algorithms also provides an indication of the effectiveness of the algorithm.
Keywords:Data clustering  Random walk  Unsupervised learning
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号