首页 | 本学科首页   官方微博 | 高级检索  
     

一种有效的多关键词词频统计方法
引用本文:马志柔,叶屹.一种有效的多关键词词频统计方法[J].计算机工程,2006,32(10):191-192,203.
作者姓名:马志柔  叶屹
作者单位:北京化工大学信息科学与技术学院,北京,100029
摘    要:针对词频统计的特点,设计了一种多关键词词频统计方法。该方法以一种树形的数据结构来存储待处理关键词集合的信息。实现了多关键词的高效匹配,扫描一次文档就可统计出全部关健词词频信息。通过理论分析与实验表明,其性能比传统的关键词词频统计方法有较大的提高。

关 键 词:模式匹配  多关键词  词频统计
文章编号:1000-3428(2006)10-0191-02
收稿时间:2005-06-28
修稿时间:2005-06-28

An Efficient Approach for Counting Multiple Keywords Frequency
MA Zhirou,YE Yi.An Efficient Approach for Counting Multiple Keywords Frequency[J].Computer Engineering,2006,32(10):191-192,203.
Authors:MA Zhirou  YE Yi
Affiliation:School of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029
Abstract:For the characteristic of the word frequency statistic, this paper designs an approach for counting multiple keywords frequency. In this method, for taking full use of the redundancy information between keywords, it stores the set of keywords with a kind of data structure of tree form. This method realizes the matching high efficiently of many keywords. Scanning the file once is able to get the information of frequency of all keywords. Through theory analysis and experiment result, its performance is more efficient than other algorithms.
Keywords:Pattern lnatching  Multiple keywords  Word frequency slatistic
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号