首页 | 本学科首页   官方微博 | 高级检索  
     

基于互关联后继树的XML索引技术
引用本文:雷向欣,胡运发,杨智应,刘勇,张凯.基于互关联后继树的XML索引技术[J].计算机研究与发展,2005,42(7):1261-1271.
作者姓名:雷向欣  胡运发  杨智应  刘勇  张凯
作者单位:复旦大学计算机与信息技术系,上海,200433
基金项目:国家自然科学基金项目(60473070),国家“八六三”高技术研究发展计划基金项目(2001AA115020)
摘    要:提出了一种新的根树节点编码方法——基于叶序区间的节点编码(LOINS).编码方法只需对根树后序遍历一次即可完成,能实现常数时间内对任意两个树节点间前后代关系的判断.同时,结合互关联后继树模型(IRST)的标引性、可压缩性等特点,提出基于IRST的根树索引模型Ist3aRTI-Ⅰ,及对该模型空间优化的索引模型IstBaRTI-Ⅱ.IsBaRTI-Ⅰ,Ⅱ采用树节点名称(标签)及其在根树(XML文档树)中的出现计数索引节点间的父子关系和节点叶序区间编码,实现索引结构和节点编码的相互统一,IsBaRTI-Ⅰ,Ⅱ索引建立时间、空间代价小,可快速查询满足XPath表达式在XML文档树中的节点序列和路径.

关 键 词:XML  XPath  互关联后继树  索引  查询

XML Indexing Technology Based on IRST
Lei Xiangxin,Hu Yunfa,Yang Zhiying,Liu Yong,Zhang Kai.XML Indexing Technology Based on IRST[J].Journal of Computer Research and Development,2005,42(7):1261-1271.
Authors:Lei Xiangxin  Hu Yunfa  Yang Zhiying  Liu Yong  Zhang Kai
Abstract:A new numbering scheme for labeling rooted trees based on leaf order interval numbering scheme (LOINS) is proposed. The nodes in a rooted tree can be encoded in once traversal of the tree, and the ancestor-descendantship among nodes can be determined in constant time based on LOINS. Furthermore, IsBaRTI-I, a novel index for rooted tree structure data model, is proposed which takes the advantages of IRST, such as indexing and compressibility. IsBaRTI-II, the space optimization version of IsBaRTI-I, is also introduced. IsBaRTI-I,II indexes the ancestor-descendantship among nodes and the LOINS number of node by the name (label) of the node and the count of its appearance in the rooted tree. In this way, indexing structure and numbering scheme becomes a unit unity. Theory analysis and experiment result illustrates that IsBaRTI-I,II needs more little time and capacity to be built; the node series and path matching XPath expressions can be obtained more quickly than the previous XML indexes through IsBaRTI-I,II.
Keywords:XML  XPath  inter-relevant successive trees  index  query
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号