首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于词典的搜索引擎系统动态更新模型
引用本文:雷鸣,刘建国,王建勇,陈葆珏. 一种基于词典的搜索引擎系统动态更新模型[J]. 计算机研究与发展, 2000, 37(10): 1265-1270
作者姓名:雷鸣  刘建国  王建勇  陈葆珏
作者单位:北京大学计算机科学技术系,北京,100871
基金项目:国家“九五”重点科技攻关项目基金资助!(项目编号 96 -743 -0 1-0 5 -0 1)
摘    要:基于词汇标注的特征项提取方法是中文信息处理的有效方法,但词汇的析取是基于词典的,词典的涵盖程度决定了词汇切分的准确率,因而不断地学习新词汇、动态地维护词典,使整个中文信息处理系统具有自适应性和动态性就成了一个关键问题,以搜索引擎系统为例,提出了一种基于词典动态变化的搜索引擎系统更新理论模型和实现模型,相关实验表明,该模型对缩短搜索引擎信息库的更新时间、提高查询准确率等方面十分有效。

关 键 词:万维网 词典 搜索引擎系统 中文信息处理

A MODEL FOR DYNAMIC INFORMATION UPDATING IN LEXICON BASED SEARCH ENGINE
LEI Ming,LIU Jian-Guo,WANG Jian-Yong,CHEN Bao-Jue. A MODEL FOR DYNAMIC INFORMATION UPDATING IN LEXICON BASED SEARCH ENGINE[J]. Journal of Computer Research and Development, 2000, 37(10): 1265-1270
Authors:LEI Ming  LIU Jian-Guo  WANG Jian-Yong  CHEN Bao-Jue
Abstract:Lexicon based feature extraction is an effective method in Chinese information processing. But it highly depends on the lexicon used. The coverage of a lexicon determines the correctness of word segmentation. Therefore, it is crucial to learn new words continuously and to update lexicon dynamically, making the whole Chinese information processing system more adaptive and dynamic. Proposed in this paper are an innovative theoretical model and the implementation model for dynamic information updating in lexicon based search engine. The results of testing show that this model can reduce the time for re establishing the information database and can greatly improve the precision of a search engine.
Keywords:search engine   natural language processing   Chinese information processing   World Wide Web
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号