ＸＭＬ搜索引擎研究 Research of XML Search Engine期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

ＸＭＬ搜索引擎研究

引用本文：	王海波,姜吉发,耿晖,白硕,祝明发. ＸＭＬ搜索引擎研究[J]. 计算机应用研究, 2001, 18(4): 68-71

作者姓名：	王海波姜吉发耿晖白硕祝明发

作者单位：	1. 中国科学院计算技术研究所, 2. 中国国家智能信息中心,

摘要：	ＷＷＷ上大量信息的涌现，对信息的查询提出了严峻的挑战，ＸＭＬ作为一种扩展标记语言，具有多ＨＴＭＬ所不具备的优点，使得开展ＷＷＷ上的深层应用成为可能，对基于ＸＭＬ的搜索引擎中涉及的关键技术进行了研究，并提出了对ＸＭＬ这种半结构化文化档建立索引和查询时采用的数据结构和算法，它在不丢失文档中结构信息的情况下，充分利用ＸＭＬ的标签所带来的上下文信息，能够大幅度提高查询的准确率。
关键词：	XML 搜索引擎信息检索 WWW Internet
文章编号：	1001-3695(2001)04-0068-04
修稿时间：	2000-08-04
Research of XML Search Engine

WANG Hai-bo,JIANG Ji-fa,GENG Hui,Bai Shuo,ZHU Ming-fa. Research of XML Search Engine[J]. Application Research of Computers, 2001, 18(4): 68-71

Authors:	WANG Hai-bo JIANG Ji-fa GENG Hui Bai Shuo ZHU Ming-fa

Abstract:	In recent years, many documents are beginning to be provided in the structured format of XML. However, conventional information retrieval techniques do not scale up well in XML documents. This paper gave a research to several key techniques inside XML. search engine. It utilizes the context information in XML document to promote the ratio of accuracy of query. It discussed the details of spider technique and structure of the index file. It Saved the hierarchy relation of tag with a low storing overhead.

Keywords:	XML Search engine Spider Inverted file
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏