基于N-Level VSM在Web信息检索中的研究 Study of Web Information Retrieval Based on N-Level Vector Space Model期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于N-Level VSM在Web信息检索中的研究

引用本文：	付克志,林鸿飞. 基于N-Level VSM在Web信息检索中的研究[J]. 计算机工程与应用, 2006, 42(19): 158-160,179

作者姓名：	付克志林鸿飞

作者单位：	大连理工大学计算机系,大连,116024;大连理工大学计算机系,大连,116024

摘要：	分析了传统向量空间检索模型在Web信息检索中的不足,给出了基于N-Level向量空间模型,这种模型是将一篇文档从逻辑上划分为N个相对独立的文本段,然后按照文本段的内容建立文本特征向量以及文本权值向量,在此基础上可以更加精确地定义特征值向量和相似度的计算方法,使之能比较好地适应文档集合的动态扩充。同时进行了两种模型算法时间的复杂度的比较分析。理论分析和实验结果表明,基于此模型实现的信息检索算法具有较快的查找速度和较高的查准率。
关键词：	向量空间模型查全率查准率相似性时间复杂度
文章编号：	1002-8331-（2006）19-0158-03
收稿时间：	2005-11-01
修稿时间：	2005-11-01
Study of Web Information Retrieval Based on N-Level Vector Space Model

Fu Kezhi,Lin Hongfei. Study of Web Information Retrieval Based on N-Level Vector Space Model[J]. Computer Engineering and Applications, 2006, 42(19): 158-160,179

Authors:	Fu Kezhi Lin Hongfei

Affiliation:	Department of Computer Science, Dalian University of Technology, Dalian 116024

Abstract:	Based on the analysis of the deficiency of the traditional vector space retrieval model,the N-level vector model is proposed.The N-level vector model partitions a document into N level text paragraphs.The text feature vectors and the text weight vectors are defined according to the text paragraphs' context.The calculation method of the feature vectors and the similarity are defined much more precisely such that the algorithm can adapt the dynamic extension of the document set.Meanwhile the time complexity of the algorithm is analyzed between the models.The theoretic analysis and the experimental results show that the new algorithm has higher precision and faster computation speed.

Keywords:	Vector Space Model(VSM) recall precision similarity time complexity
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏