首页 | 本学科首页   官方微博 | 高级检索  
     

基于主题和链接分析的微博社区发现算法
引用本文:闫光辉,舒 昕,马志程,李 祥. 基于主题和链接分析的微博社区发现算法[J]. 计算机应用研究, 2013, 30(7): 1953-1957
作者姓名:闫光辉  舒 昕  马志程  李 祥
作者单位:1. 兰州交通大学 电子与信息工程学院, 兰州 730070; 2. 甘肃电力信息通信中心, 兰州 730050
基金项目:国家自然科学基金资助项目(61163010); 甘肃省陇原青年创新人才扶持计划资助项目(252003); 兰州市科技计划资助项目(2008-1-28); 甘肃省电力信息通信中心项目(KJ[2012]80号)
摘    要:针对传统社区发现方法大多基于链接或主题关系, 且没有考虑获取微博用户社会信息时的限制, 无法有效识别微博中多个社区的问题, 提出了一种综合基于主题和链接分析的微博社区发现算法来挖掘微博中多个社区。算法首先研究微博用户的链接及博文主题特性, 定义了链接相关度和主题相关度公式; 然后推出用户总相关度公式, 以此来计算节点间的传递概率, 用改进后的标签传递算法对用户分类; 最终划分出兴趣相似且社会联系紧密的用户群。真实数据集上的仿真实验验证了该方法的合理性和有效性。

关 键 词:微博   社区发现   潜层Dirichlet分配   主题模型   链接分析   标签传递算法

Community discovery for microblog based on topic and link analysis
YAN Guang-hui,SHU Xin,MA Zhi-cheng,LI Xiang. Community discovery for microblog based on topic and link analysis[J]. Application Research of Computers, 2013, 30(7): 1953-1957
Authors:YAN Guang-hui  SHU Xin  MA Zhi-cheng  LI Xiang
Affiliation:1. College of Electronic & Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China; 2. Gansu Electric Power Information & Communication Center, Lanzhou 730050, China
Abstract:Tranditional community discovery algorithms are generally based on either links or interests and don't take limits of obtaining microblog users' social information into consideration, so they can't detect multiple communities effectively. Therefore, this paper proposed a microblog community discovery algorithm based on both links and topics to discover communities in microblog. It first studied characteristics of links and blog's topics, then deduced user's relationship formulas, on basis of which, it calculated transfer probability and used improved label propagation algoritym to divide communities. Finally, it distinguished different clusters of people who close relationships and similar interests. The simulation results on real social dataset verify that the proposed method is reasonable and effective.
Keywords:
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号