首页 | 本学科首页   官方微博 | 高级检索  
     


Semi-supervised clustering via multi-level random walk
Affiliation:1. State Key Laboratory of Software Development Environment, Beihang University, Beijing 100191, China;2. Department of Electrical Engineering, Columbia University, New York, NY 10027, USA;3. Facebook, 1601 Willow Rd, Menlo Park, CA 94025, USA;1. Universidad Autónoma de Aguascalientes, Department of Computer Science, Av. Universidad 940, Col. Ciudad Universitaria, Aguascalientes 20131, Aguascalientes, México;1. Applied Math and Analisis Dept, University of Barcelona, Gran Via de les Corts Catalanes. 585, 08007 Barcelona, Spain;2. Computer Vision Center, Campus UAB, Edifici O, 08193 Bellaterra, Spain;3. Computer Science, Multimedia, and Telecommunications Dept, Universitat Oberta de Catalunya, Rambla del Poblenou 156, 08018 Barcelona, Spain
Abstract:
A key issue of semi-supervised clustering is how to utilize the limited but informative pairwise constraints. In this paper, we propose a new graph-based constrained clustering algorithm, named SCRAWL. It is composed of two random walks with different granularities. In the lower-level random walk, SCRAWL partitions the vertices (i.e., data points) into constrained and unconstrained ones, according to whether they are in the pairwise constraints. For every constrained vertex, its influence range, or the degrees of influence it exerts on the unconstrained vertices, is encapsulated in an intermediate structure called component. The edge set between each pair of components determines the affecting scope of the pairwise constraints. In the higher-level random walk, SCRAWL enforces the pairwise constraints on the components, so that the constraint influence can be propagated to the unconstrained edges. At last, we combine the cluster membership of all the components to obtain the cluster assignment for each vertex. The promising experimental results on both synthetic and real-world data sets demonstrate the effectiveness of our method.
Keywords:Semi-supervised clustering  Pairwise constraint  Influence expansion  Multi-level random walk  Spectral clustering
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号