首页 | 本学科首页   官方微博 | 高级检索  
     

利用博客链接平台选取联合关键字的博客聚类研究
引用本文:王琦,霍纬纲.利用博客链接平台选取联合关键字的博客聚类研究[J].计算机应用研究,2017,34(12).
作者姓名:王琦  霍纬纲
作者单位:运城学院 计算机科学与技术系,中国民航大学 计算机科学与技术学院
基金项目:国家自然科学青年基金(No.61301245)
摘    要:针对全文本关键字检索的时间成本高,以及采用标签/类别会产生语句歧义和同义词等问题,提出在博客链接平台上选取联合关键字进行博客聚类。假设一个博客文章被查询的候选关键字(或者联合关键字)可以用于表示这个博客文章的主题。为验证该假设,首先将跟踪代码嵌入到博客链接(BC)组件中,以收集读者查询的关键字。然后,选取适当的候选关键字作为联合关键字。最后,使用重叠投影、交互信息投影、分布式分布信息和肯德尔 系数这四种相似性度量以验证BC组件提取的联合关键字。实验结果表明,提出的方法可以为查询者提供一条找到对应博客的快速通道。此外,生成的联合关键字可以减少全文本关键字检索过程的复杂度和冗余度,很好地满足了博客用户的需求。

关 键 词:关键字提取  博客链接平台(BC)  博客聚类  联合关键字  相似性
收稿时间:2016/11/17 0:00:00
修稿时间:2017/10/23 0:00:00

The Research of Blog Clustering Method by Selecting Joint Keywords on Blog Connect Platform
wangqi and HUO Weigang.The Research of Blog Clustering Method by Selecting Joint Keywords on Blog Connect Platform[J].Application Research of Computers,2017,34(12).
Authors:wangqi and HUO Weigang
Affiliation:Yuncheng University,
Abstract:Concerning that the time cost of full-text keyword search time is high, and the label / category statement will produce ambiguity and synonyms problems, the way to select joint keywords in the Blog Connect platform for blog clustering is proposed. It is assumed that the candidate keywords of a blog post by the query (or joint keyword) can be used to represent the theme of this blog. In order to verify this assumption, a tracing code is embedded in Blog Connect firstly so as to collecting the keywords queried by readers. Then, FKRP is used to select candidate keywords as co-keywords. Finally, Similarity measures, including overlapping projection, mutual information projection, distributed information and the Kendall coefficient is used to validate the BC component extraction. The experimental results show that the proposed method can provide a fast channel for the query to find the corresponding blog. In addition, the joint key generation can reduce the search process complexity and redundancy, which could well meet the needs of blog users.
Keywords:
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号