首页 | 本学科首页   官方微博 | 高级检索  
     

异质信息网络中基于元路径的社团发现算法研究
引用本文:郑玉艳,王明省,石川,王锐.异质信息网络中基于元路径的社团发现算法研究[J].中文信息学报,2018,32(9):132-142.
作者姓名:郑玉艳  王明省  石川  王锐
作者单位:1.北京邮电大学 计算机学院,北京 100876;
2.广州市城市规划勘测设计院 地理信息中心,广东 广州 510060
基金项目:国家重点基础研究发展计划(2017YFB0803304);国家自然科学基金(61772082,61375058);北京市自然科学基金(4182043)
摘    要:实际的网络化数据往往包含多种类型的对象和关系,采用异质信息网络可以更好地对其建模,因此异质信息网络分析逐渐成为数据挖掘的研究热点。虽然同质信息网络中的社团发现已经被深入研究,但是异质信息网络中的社团发现还很少被研究。该文研究异质信息网络中的社团发现问题,提出了一个新的社团发现算法框架HCD(heterogeneous community detection)。该框架由两部分组成: 基于单条元路径的社团发现算法HCD_sgl和融合多条元路径的社团发现算法HCD_all。HCD_sgl首先确定在给定元路径下所有节点的初始标签,再利用改进的标签传递算法进行最终的社团发现;HCD_all是在HCD_sgl的基础上将基于多条元路径的社团发现结果进行融合。通过在真实数据集和人工数据集上的实验验证了HCD算法的有效性。

关 键 词:异质信息网络  社团发现  元路径  语义相似性度量  

Research on Community Detection Algorithm Based on Meta Path in Heterogeneous Information Network
ZHENG Yuyan,WANG Mingsheng,SHI Chuan,WANG Rui.Research on Community Detection Algorithm Based on Meta Path in Heterogeneous Information Network[J].Journal of Chinese Information Processing,2018,32(9):132-142.
Authors:ZHENG Yuyan  WANG Mingsheng  SHI Chuan  WANG Rui
Affiliation:1.School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, China;
2.Geographic Informaton Center, Guangzhou City Planning Survey and Design Institute, Guangzhou, Guangdong 510060, China
Abstract:The real networked data often contain different types of objects and relations,which can be better modeled with heterogeneous information network. Although the community detection in homogeneous information networks has been intensively studied,few works are done in heterogeneous information networks.In this paper,we study the community detection problem in heterogeneous information networks,and propose a novel method based on meta path called HCD (heterogeneous community detection). This method consists of two parts: a HCD_sgl algorithm based on single meta path,and a HCD_all algorithm combining multiple meta paths. The HCD_sgl decides the initial community label,then detecting the final community structures through the improved label propagation algorithm. HCD_all combined the results of multipie meta paths.Experiments on real dataset and artificial dataset demonstrate that the proposed method can detect community structures in heterogeneous information networks effectively.
Keywords:heterogeneous information network  community detection  meta path  semantic similarity measure  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号