首页 | 本学科首页   官方微博 | 高级检索  
     

基于特征映射的微博用户标签兴趣聚类方法
引用本文:秦雨,余正涛,王炎冰,石林宾,潘华山.基于特征映射的微博用户标签兴趣聚类方法[J].数据采集与处理,2015,30(6):1246-1252.
作者姓名:秦雨  余正涛  王炎冰  石林宾  潘华山
作者单位:1.昆明理工大学信息工程与自动化学院,昆明,650500; 2.昆明理工大学智能信息处理重点实验室,昆明,650500
摘    要:针对现有的用户兴趣聚类方法没有考虑用户标签之间存在的语义相关性问题,提出了一种基于特征映射的微博用户标签兴趣聚类方法。首先,获取待分析用户及其所关注用户的用户标签,选取出现频数高于设定阈值的标签构建模糊矩阵的特征维;然后,考虑标签之间的语义相关性,利用特征映射的思想将用户标签根 据其与特征维标签之间的语义相似度映射到每个特征维下,计算每个特征维所对应的特征值;最后,利用模糊聚类得到了不同阈值下的用户兴趣聚类结果。实验结果表明,本文提出的基于特征映射的微博用户标签兴趣聚类方法有效地改善了用户兴趣聚类效果。

关 键 词:微博  特征映射  模糊聚类  语义相似度

Micro blog User Label Interest Clustering Method Based on Feature Mapping
Qin Yu,Yu Zhengtao,Wang Yanbin,Shi Linbin.Micro blog User Label Interest Clustering Method Based on Feature Mapping[J].Journal of Data Acquisition & Processing,2015,30(6):1246-1252.
Authors:Qin Yu  Yu Zhengtao  Wang Yanbin  Shi Linbin
Affiliation:1. Institute of Information Engineering and Automation, Kunming University of S cience and Technology, Kunming, 650500, China; 2. Key Laboratory of Intelligent Information Processing, Kunming University of S cience and Technology, Kunming, 650500, China
Abstract:Since many methods for cluster user interest does not consider the semantic similarity of the user labels, a micro-blog user label interest clustering method is introduced based on feature mapping. Firstly, the user labels of the target users and their focus users are obtained, then the labels with the higher frequency than the threshold value is chosen. Therefore, a feature space is created. Secondly, the user labels are mapped to the feature space by calculating the semantic similarity based on the feature mapping. Finally, the fuzzy clustering is utilized to obtain the clustering result of different threshold value. Experimental results show that the method greatly improves the clustering accuracy rate for user interest clustering.
Keywords:micro blog  feature mapping  fuzzy clustering  semantic similarity
本文献已被 万方数据 等数据库收录!
点击此处可从《数据采集与处理》浏览原始摘要信息
点击此处可从《数据采集与处理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号