首页 | 本学科首页   官方微博 | 高级检索  
     

基于PATRICIA-TRIES的XML路径索引设计
引用本文:易平,胡运安,陈福生,张世永.基于PATRICIA-TRIES的XML路径索引设计[J].小型微型计算机系统,2006,27(3):474-480.
作者姓名:易平  胡运安  陈福生  张世永
作者单位:1. 上海交通大学,信息安全工程学院,上海,200030
2. 复旦大学,计算机与信息技术系,上海,200433
3. 同济大学,计算机科学与技术系,上海,200433
基金项目:中国科学院资助项目;国家重大项目
摘    要:随着XML逐渐成为Internet数据表示与交换的标准,如何快速准确地访问XML文档中的数据已成为亟待解决的关键问题,建立路径索引是提高查询效率的一种重要手段.本文设计了一种基于PATRICIA-TRIES的路径索引,简称PT索引.该索引有如下特点:一、基于PATRICIA-TRIES结构,实现快速检索.二、采用压缩编码能够将路径索引放入内存,三、索引含有结构和文本信息,通过查询索引就能提供结果,无需打开原文档.其后,分析了PT索引的时间和空间复杂性,并与三种的典型的索引结构进行了对比实验,结果证明了其在路径查询方面具有更高的效率.

关 键 词:查询  路径索引
文章编号:1000-1220(2006)03-0474-07
收稿时间:11 16 2004 12:00AM
修稿时间:2004-11-162005-06-15

Path Index for XML Data Based on PATRICIA-TRIES
YI Ping,HU Yun-an,Chen Fu-shen,ZHANG Shi-yong.Path Index for XML Data Based on PATRICIA-TRIES[J].Mini-micro Systems,2006,27(3):474-480.
Authors:YI Ping  HU Yun-an  Chen Fu-shen  ZHANG Shi-yong
Affiliation:1 School of Information Engineering, Shanghai Jiaotong University, Shanghai 200030, China; 2 Department of Computing and Information Technology Fudan University, Shanghai 200433, China; 3 Department of Computer Science and Technology Tongji University, Shanghai 200433, China
Abstract:with the advent of XML as a standard for data representation and exchange on the Internet, storing and querying XML data becomes more and more important. This poses a new challenge concerning indexing and searching XML data, because conventional approaches based on relational model may not meet the processing requirements for XML data. In this paper, we propose a path index based on PATRICIA tries, namely PT index. Our PT index structure offers several novel features. First, it can support to fast search data by its structure based on PATRICIA-tries. Second, the path indexes are compressed so that they can be store in memory. Thirdly, because PT index includes structure and text of XML data, we can achieve results form the PT index without reading original XML data. We address time complexity and space complexity of PT index. Experimental results from our prototype system implementation show that the PT index can outperform some representative index approaches, such as DataGuide, B+tree index and Index Fabric.
Keywords:XML  PATRICIA-TRIES
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号