首页 | 本学科首页   官方微博 | 高级检索  
     

基于染色随机游走的可重叠社区发现
引用本文:昌阳,马慧芳.基于染色随机游走的可重叠社区发现[J].计算机工程与科学,2022,44(5):834-844.
作者姓名:昌阳  马慧芳
作者单位:(1.西北师范大学计算机科学与工程学院,甘肃 兰州 730070; 2.广西师范大学广西多源信息挖掘与安全重点实验室,广西 桂林 541001; 3.桂林电子科技大学广西可信软件重点实验室,广西 桂林 541004)
基金项目:西北师范大学青年教师能力提升计划;西北师范大学研究生科研资助项目;国家自然科学基金
摘    要:社区发现是一个基础性的且被广泛研究的问题。现有的社区发现方法大多聚焦于网络拓扑结构,然而随着真实网络中实体可用属性的激增,捕获图中结构和属性的丰富交互关系来进行社区发现变得尤为必要。据此面向属性图提出了一种基于染色随机游走的可重叠社区发现算法OCDC,该算法解决了传统的基于随机游走的社区发现算法利用结构转移矩阵造成社区发现效果不佳的问题。具体地,首先利用经典的初始种子策略选出网络中差异度较大的节点,在此基础上设计种子替换策略,挖掘网络中质量更佳的种子替换路径集合对初始种子集合进行替换;其次构建结构-属性交互节点转移矩阵并执行染色随机游走过程得到高质量种子节点的染色分布向量;最后基于融合结构和属性的并行电导值对社区进行扩展。在人工网络和现实网络上的实验表明,本文提出的算法能够准确地识别属性社区并显著优于基准算法。

关 键 词:社区发现  种子  染色随机游走  并行电导  
收稿时间:2020-09-21
修稿时间:2021-11-12

Overlapping community detection based on colored random walk
CHANG Yang,MA Hui-fang.Overlapping community detection based on colored random walk[J].Computer Engineering & Science,2022,44(5):834-844.
Authors:CHANG Yang  MA Hui-fang
Affiliation:(1.College of Computer Science and Engineering,Northwest Normal University,Lanzhou 730070; 2.Guangxi Key Laboratory of Multi-source Information Mining and Security,Guangxi Normal University,Guilin 541001; 3.Guangxi Key Laboratory of Trusted Software,Guilin University of Electronic Technology,Guilin 541004,China)
Abstract:Community detection is a fundamental and widely-studied problem. Most existing community detection methods focus on network topology. However, with the proliferation of rich information available for entities in real-world networks, it is indispensable to capture the rich interaction between the structure and attributes in the graph for community detection. This paper proposes an Overlapping Community Detection method based on Colored random walk (OCDC). The algorithm is able to conquer the limitation of random walk-based community detection methods that directly utilize the original network topology. Specifically, initial seed nodes in the network is firstly selected. Secondly, a seed replacement strategy is developed to capture a better-quality seed replacement path set. Thirdly, the structure-attribute interaction node transition matrix is generated to perform the colored random walk in order to obtain the colored distribution vector. Finally, based on the combination of structure and attri- bute, the parallel conductance is captured to expand the community. Experiments on synthetic networks and real-world network show that our proposed algorithm can accurately identify attributed communities and significantly outperform the state-of-the-art methods.
Keywords:community detection  seed  colored random walk  parallel conductance  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号