首页 | 本学科首页   官方微博 | 高级检索  
     


Cluster ranking with an application to mining mailbox networks
Authors:Ziv Bar-Yossef  Ido Guy  Ronny Lempel  Yoëlle S Maarek  Vladimir Soroka
Affiliation:(1) Department of Electrical Engineering, Technion, Haifa, Israel;(2) Department of Computer Science, Technion, Haifa, Israel;(3) IBM Research Lab in Haifa, 31905 Haifa, Israel;(4) Google, Haifa Engineering Center, Haifa, Israel
Abstract:We initiate the study of a new clustering framework, called cluster ranking. Rather than simply partitioning a network into clusters, a cluster ranking algorithm also orders the clusters by their strength. To this end, we introduce a novel strength measure for clusters—the integrated cohesion—which is applicable to arbitrary weighted networks. We then present a new cluster ranking algorithm, called C-Rank. We provide extensive theoretical and empirical analysis of C-Rank and show that it is likely to have high precision and recall. A main component of C-Rank is a heuristic algorithm for finding sparse vertex separators. At the core of this algorithm is a new connection between vertex betweenness and multicommodity flow. Our experiments focus on mining mailbox networks. A mailbox network is an egocentric social network, consisting of contacts with whom an individual exchanges email. Edges between contacts represent the frequency of their co–occurrence on message headers. C-Rank is well suited to mine such networks, since they are abundant with overlapping communities of highly variable strengths. We demonstrate the effectiveness of C-Rank on the Enron data set, consisting of 130 mailbox networks.
Keywords:Clustering  Ranking  Communities  Social networks  Social network analysis  Graph algorithms
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号