基于文摘的信息检索模型 Summary-Based Information Retrieval Model期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于文摘的信息检索模型

引用本文：	李卫疆,赵铁军,臧文茂. 基于文摘的信息检索模型[J]. 软件学报, 2008, 19(9): 2329-2338

作者姓名：	李卫疆赵铁军臧文茂

作者单位：	哈尔滨工业大学,计算机科学与技术学院,黑龙江,哈尔滨,150001

基金项目：	国家自然科学基金，国家高技术研究发展计划(863计划)

摘要：	基于文摘的检索模型是基于一个假设。即出现在文摘中的词要比未出现在文摘中的词更能表达文章的主题,因此对检索贡献更大.提出了两个基于文摘的语言检索模型,一个是用文摘模型代替文档模型直接检索文件(SQL),另一个是用文摘模型平滑文档模型(SBDM).在TREC数据集上的实验表明,该模型能够提高检索的性能.其中,SBDM的性能一致接近或优于传统的标准文档查询相似模型.有两个方面的贡献,一方面提出了面向检索的文摘抽取方法并考察了这些文摘方法对检索性能的影响;另一方面提出了新的检索模型,即基于文摘的检索模型.
关键词：	信息检索语言模型文摘文摘检索模型平滑方法
收稿时间：	2007-06-14
修稿时间：	2007-09-30
Summary-Based Information Retrieval Model

LI Wei-Jiang,ZHAO Tie-Jun and ZANG Wen-Mao. Summary-Based Information Retrieval Model[J]. Journal of Software, 2008, 19(9): 2329-2338

Authors:	LI Wei-Jiang ZHAO Tie-Jun ZANG Wen-Mao

Abstract:	Summary-Based retrieval is based on the hypothesis that terms in summary should be more important than other terms not in summary.Recent developments in the language modeling approach to information retrieval have motivated the study of this problem within this new retrieval framework.In the proposed research,two approaches to summary=based retrieval,namely ranking documents directly(SQL)and smoothing documents with summaries(SBDM)are investigated.Results on TREC collections show that,with the proposed models, summary-based retrieval models can perform consistently across collections and significant improvements over document-based retrieval can be obtained.There are two main contributions in this paper.On the one hand, summarization method of retrieval-oriented is examed and effect of this method on information retrieval.On the other hand,the new retrieval model for summary-based information retrieval models is proposed.

Keywords:	information retrieval language model summarization summary-based model smoothing method
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《软件学报》浏览原始摘要信息
	点击此处可从《软件学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏