一种基于词典的搜索引擎系统动态更新模型 A MODEL FOR DYNAMIC INFORMATION UPDATING IN LEXICON BASED SEARCH ENGINE期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种基于词典的搜索引擎系统动态更新模型

引用本文：	雷鸣,刘建国,王建勇,陈葆珏. 一种基于词典的搜索引擎系统动态更新模型[J]. 计算机研究与发展, 2000, 37(10): 1265-1270

作者姓名：	雷鸣刘建国王建勇陈葆珏

作者单位：	北京大学计算机科学技术系,北京,100871

基金项目：	国家“九五”重点科技攻关项目基金资助!(项目编号 96 -743 -0 1-0 5 -0 1)

摘要：	基于词汇标注的特征项提取方法是中文信息处理的有效方法，但词汇的析取是基于词典的，词典的涵盖程度决定了词汇切分的准确率，因而不断地学习新词汇、动态地维护词典，使整个中文信息处理系统具有自适应性和动态性就成了一个关键问题，以搜索引擎系统为例，提出了一种基于词典动态变化的搜索引擎系统更新理论模型和实现模型，相关实验表明，该模型对缩短搜索引擎信息库的更新时间、提高查询准确率等方面十分有效。
关键词：	万维网词典搜索引擎系统中文信息处理
A MODEL FOR DYNAMIC INFORMATION UPDATING IN LEXICON BASED SEARCH ENGINE

LEI Ming,LIU Jian-Guo,WANG Jian-Yong,CHEN Bao-Jue. A MODEL FOR DYNAMIC INFORMATION UPDATING IN LEXICON BASED SEARCH ENGINE[J]. Journal of Computer Research and Development, 2000, 37(10): 1265-1270

Authors:	LEI Ming LIU Jian-Guo WANG Jian-Yong CHEN Bao-Jue

Abstract:	Lexicon based feature extraction is an effective method in Chinese information processing. But it highly depends on the lexicon used. The coverage of a lexicon determines the correctness of word segmentation. Therefore, it is crucial to learn new words continuously and to update lexicon dynamically, making the whole Chinese information processing system more adaptive and dynamic. Proposed in this paper are an innovative theoretical model and the implementation model for dynamic information updating in lexicon based search engine. The results of testing show that this model can reduce the time for re establishing the information database and can greatly improve the precision of a search engine.

Keywords:	search engine natural language processing Chinese information processing World Wide Web
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏