首页 | 本学科首页   官方微博 | 高级检索  
     

基于主题特征的关键词抽取
引用本文:刘 俊,邹东升,邢欣来,李英豪.基于主题特征的关键词抽取[J].计算机应用研究,2012,29(11):4224-4227.
作者姓名:刘 俊  邹东升  邢欣来  李英豪
作者单位:重庆大学 计算机学院,重庆,400044
基金项目:中国博士后科学基金资助项目(20110490807); 中央高校基金科研业务资助项目(CDJXS10181131)
摘    要:为了使抽取出的关键词更能反映文档主题,提出了一种新的词的主题特征(topic feature,TF)计算方法,该方法利用主题模型中词和主题的分布情况计算词的主题特征。并将该特征与关键词抽取中的常用特征结合,用装袋决策树方法构造一个关键词抽取模型。实验结果表明提出的主题特征可以提升关键词抽取的效果,同时验证了装袋决策树在关键词抽取中的适用性。

关 键 词:关键词抽取  主题特征  主题模型  装袋决策树

Keyphrase extraction based on topic feature
LIU Jun,ZOU Dong-sheng,XING Xin-lai,LI Ying-hao.Keyphrase extraction based on topic feature[J].Application Research of Computers,2012,29(11):4224-4227.
Authors:LIU Jun  ZOU Dong-sheng  XING Xin-lai  LI Ying-hao
Affiliation:College of Computer, Chongqing University, Chongqing 400044, China
Abstract:Keyphrase extraction is a process for extracting a set of terms from a document. This paper proposed a novel topic feature for extracting keyphrase. This topic feature was computed based on topic model which modeled the topic-word distributions and the topic distributions of document. Moreover, it proposed a keyphrase extraction approach based on bagged decision trees. This approach jointed common features and the proposed topic feature. Experimental results demonstrate that the proposed topic feature can make an improvement for keyphrase extraction. At the mean time, an effective performance can be achieved by the bagged decision trees based approach.
Keywords:keyphrase extraction  topic feature  topic model  bagged decision trees
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号