首页 | 本学科首页   官方微博 | 高级检索  
     

信息检索中主题式词典的构建方法
引用本文:许静芳,李星,李粤.信息检索中主题式词典的构建方法[J].计算机工程,2005,31(21):143-145.
作者姓名:许静芳  李星  李粤
作者单位:清华大学电子工程系,北京,100084;清华大学电子工程系,北京,100084;清华大学电子工程系,北京,100084
摘    要:提出了一种基于用户查询日志的主题式词典的构建方法,用于中文信息检索中分词。利用互信息从用户查询日志中提取短语并与通用词典相结合构建主题式词典。该词典能提高信息检索的准确率和效率,并有助于解决未登录词问题。

关 键 词:主题式词典  信息检索  中文分词  短语提取
文章编号:1000-3428(2005)21-0143-03
收稿时间:2004-09-21
修稿时间:2004-09-21

A Topic-specific Dictionary Construction Algorithm for Information Retrieval
XU Jingfang,LI Xing,LI Yue.A Topic-specific Dictionary Construction Algorithm for Information Retrieval[J].Computer Engineering,2005,31(21):143-145.
Authors:XU Jingfang  LI Xing  LI Yue
Affiliation:Department of Electronics Engineering, Tsinghua University, Beijing 100084
Abstract:This paper proposes a novel algorithm to construct a topic-specific dictionary, based on user query log, for information retrieval, According to their mutual information, phrases are extracted from the log and they are combined with a general dictionary to construct a topic-specific dictionary. The experiment result shows that the constructed dictionary greatly improves the retrieval performance and helps to detect many out-of-vocabulary words.
Keywords:Topic-specific dictionary  Information retrieval  Chinese word segmentation  Phrase extraction
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号