首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于随机块模型的快速广义社区发现算法
引用本文:柴变芳,于剑,贾彩燕,王静红.一种基于随机块模型的快速广义社区发现算法[J].软件学报,2013,24(11):2699-2709.
作者姓名:柴变芳  于剑  贾彩燕  王静红
作者单位:交通数据分析与挖掘北京市重点实验室北京交通大学, 北京 100044;石家庄经济学院 信息工程系, 河北 石家庄 050031;交通数据分析与挖掘北京市重点实验室北京交通大学, 北京 100044;交通数据分析与挖掘北京市重点实验室北京交通大学, 北京 100044;河北师范大学 信息技术学院, 河北 石家庄 050024
基金项目:国家自然科学基金(61033013,61370129);教育部创新团队项目(K10JB00440);北京市自然科学基金(4112046);中央高校基本科研业务费专项资金;河北省科技厅项目(13210702D);河北省教育厅项目(ZD2010128);民航局科技基金项目(K12I00051)
摘    要:随机块模型可以生成各种不同结构(称作广义社区,包括传统社区、二分结构、层次结构等)的网络,也可以根据概率对等原则发现网络中的广义社区.但简单的随机块模型在网络生成过程建模和模型学习方面存在许多问题,导致不能很好地发现实际网络的结构,其扩展模型GSB(general stochastic block)基于链接社区思想发现广义社区,但时间复杂度限制其在中大型规模网络中的应用.为了在无任何先验的情形下探索不同规模网络的潜在结构,基于GSB 模型设计一种快速算法FGSB,更快地发现网络的广义社区.FGSB 在迭代过程中动态学习网络结构参数,将GSB 模型的参数重新组织,减少不必要的参数,降低算法的存储空间;对收敛节点和边的参数进行裁剪,减少每次迭代的相关计算,节省算法的运行时间.FGSB 与GSB 模型求解算法有相同的结构发现能力,但FGSB 耗费的存储空间和运行时间比GSB 模型求解算法要低.在不同规模的人工网络和实际网络上验证得出:在近似相同的准确率下,FGSB 比GSB 模型求解算法快,且可发现大型网络的广义社区.

关 键 词:随机块模型  广义社区  时间复杂度  复杂网络
收稿时间:2013/4/22 0:00:00
修稿时间:2013/7/17 0:00:00

Fast Algorithm on Stochastic Block Model for Exploring General Communities
CHAI Bian-Fang,YU Jian,JIA Cai-Yan and WANG Jing-Hong.Fast Algorithm on Stochastic Block Model for Exploring General Communities[J].Journal of Software,2013,24(11):2699-2709.
Authors:CHAI Bian-Fang  YU Jian  JIA Cai-Yan and WANG Jing-Hong
Affiliation:Beijing Key Laboratory of Traffic Data Analysis and Mining Beijing Jiaotong University, Beijing 100044, China;Department of Information Engineering, Shijiazhuang University of Economics, Shijiazhuang 050031, China;Beijing Key Laboratory of Traffic Data Analysis and Mining Beijing Jiaotong University, Beijing 100044, China;Beijing Key Laboratory of Traffic Data Analysis and Mining Beijing Jiaotong University, Beijing 100044, China;College of Information Technology, Hebei Normal University, Shijiazhuang 050024, China
Abstract:A stochastic block model can produce a wide variety of networks with different structures (named as general community, including traditional community, bipartite structure, hierarchical structure and etc); it also can detect general community in networks according to the rules of stochastic equivalence. However, the simple stochastic block model has some problems in modeling the generation of the networks and learning the models, showing poor results in fitting the practical networks. The GSB (general stochastic block) model is an extension of the stochastic block model, which is based on the idea of link community and is provided to detect general communities. But its complexity limits its applications in medium and large networks. In order to explore the latent structures of networks with different scales without prior knowledge about networks, a fast algorithm on the GSB model (FGSB) is designed to explore general communities in networks faster. FGSB dynamically learns the parameters related to the network structure in the process of iterations. It reduces the storage memory by reorganizing parameters to cut down unnecessary parameters, and saves the running time by pruning the related parameters of converging nodes and edges to decrease the computing time of each iteration. FGSB has the same ability of structure detection as the GSB model, but its complexities of time and storage are lower. Tests on synthetic benchmarks and real-world networks have demonstrated that FGSB not only can run faster than the algorithm of the GSB model in the similar accuracy, but also can detect general communities for large networks.
Keywords:stochastic block model  general community  time complexity  complex network
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号