共查询到19条相似文献,搜索用时 78 毫秒
1.
2.
3.
现有XML文档树编码存储空间较大,动态更新困难。结合DeweyBFN提出一种新的基于素数的XML动态编码方法。新方法采用子树域映射机制,避免使用较大素数,平均编码长度较小。理论分析和实验结果表明,该编码方法可以有效降低编码存储空间,具有较高的查询效率,支持XML数据的动态更新。 相似文献
4.
陈富强 《数字社区&智能家居》2007,2(10):1025-1026
提出了配比查询的概念,分析了现有XML编码方案在应用于配比查询时的不足,提出了一种新的XML编码方案,并给出了相应的查询算法。 相似文献
5.
6.
当对XML文档进行插入操作时面临调整编码问题,目前提出的很多编码方案不能同时很好地支持XPath查询和XML文档更新。在分析现有编码方案的基础上,提出了基于完全树的编码方案,该编码方案采用序号冗余和虚拟节点两种冗余方式,不仅支持XPath的查询,而且能有效降低因插入节点需要对XML文档进行二次编码率。实验结果表明,完全树以及相应编码有效提高了XML文档插入节点的效率。 相似文献
7.
BSC:一种高效的动态XML树编码方案 总被引:1,自引:0,他引:1
确定一篇XML文档中任意两个节点之间是否存在某种结构关系,是XML查询处理过程的一个重要组成部分.XML树编码方案为每个节点分配唯一编号,仅通过比较节点编号而不必访问原XML文档,就可以快速有效地确定节点间的结构关系.随着XML应用不断普及,能否高效地支持更新操作,已成为XML树编码方案研究的一个重要课题.本文基于二进制小数的特性,提出了一种新的XML树编码方案-BSC,它可以完全高效地支持XML更新操作而不需要重新编码.实验结果证明,与已有的动态编码方案相比,BSC编码无论在静态编码方面还是在动态更新方面都具有很好的性能. 相似文献
8.
一种新的XML文档编码机制 总被引:7,自引:1,他引:7
XML查询中正则路径表达式的实现,需要快速判断元素间父子关系或祖先一后代关系。目前,基于树遍历的XML文档编码是一种主流的方法,但父子关系的判断需要在编码之外附加辅助的措施,部分实现不支持文档更新,提出一种新的编码方法,能够在常数复杂度的时间内实现两个元素间父子关系、祖先一后代关系的判断,计算祖先一后代结点间的辈数差异,并支持文档更新功能。 相似文献
9.
10.
11.
XML data can be represented by a tree or graph structure and XML query processing requires the information of structural relationships among nodes. The basic structural relationships are parent-child and ancestor-descendant, and finding all occurrences of these basic structural relationships in an XML data is clearly a core operation in XML query processing. Several node labeling schemes have been suggested to support the determination of ancestor-descendant or parent-child structural relationships simply by comparing the labels of nodes. However, the previous node labeling schemes have some disadvantages, such as a large number of nodes that need to be relabeled in the case of an insertion of XML data, huge space requirements for node labels, and inefficient processing of structural joins. In this paper, we propose the nested tree structure that eliminates the disadvantages and takes advantage of the previous node labeling schemes. The nested tree structure makes it possible to use the dynamic interval-based labeling scheme, which supports XML data updates with almost no node relabeling as well as efficient structural join processing. Experimental results show that our approach is efficient in handling updates with the interval-based labeling scheme and also significantly improves the performance of the structural join processing compared with recent methods. 相似文献
12.
Indexing and querying XML using extended Dewey labeling scheme 总被引:1,自引:0,他引:1
Jiaheng LuAuthor Vitae Xiaofeng MengAuthor VitaeTok Wang LingAuthor Vitae 《Data & Knowledge Engineering》2011,70(1):35-59
Finding all the occurrences of a tree pattern in an XML database is a core operation for efficient evaluation of XML queries. The Dewey labeling scheme is commonly used to label an XML document to facilitate XML query processing by recording information on the path of an element. In order to improve the efficiency of XML tree pattern matching, we introduce a novel labeling scheme, called extended Dewey, which effectively extends the existing Dewey labeling scheme to combine the types and identifiers of elements in a label, and to avoid the scan of labels for internal query nodes to accelerate query processing (in I/O cost). Based on extended Dewey, we propose a series of holistic XML tree pattern matching algorithms. We first present TJFast to answer an XML twig pattern query. To efficiently answer a generalized XML tree pattern, we then propose GTJFast, an optimization that exploits the non-output nodes. In addition, we propose TJFastTL and GTJFastTL based on the tag + level data partition scheme to further reduce I/O costs by level pruning. Finally, we report our comprehensive experimental results to show that our set of XML tree pattern matching algorithms are superior to existing approaches in terms of the number of elements scanned, the size of intermediate results and query performance. 相似文献
13.
Jun-Ki Min Author Vitae 《Journal of Systems and Software》2009,82(3):503-515
In this paper, we propose an efficient encoding and labeling scheme for XML, called EXEL, which is a variant of the region labeling scheme using ordinal and insert-friendly bit strings. We devise a binary encoding method to generate the ordinal bit strings, and an algorithm to make a new bit string inserted between bit strings without any influences on the order of preexisting bit strings. These binary encoding method and bit string insertion algorithm are the bases of the efficient query processing and the complete avoidance of re-labeling for updates. We present query processing and update processing methods based on EXEL. In addition, the Stack-Tree-Desc algorithm is used for an efficient structural join, and the String B-tree indexing is utilized to improve the join performance. Finally, the experimental results show that EXEL enables complete avoidance of re-labeling for updates while providing fairly reasonable query processing performance. 相似文献
14.
XML数据扩展前序编码的更新方法 总被引:15,自引:0,他引:15
大部分XML查询技术都是基于某种对XML树的编码方法.对XML树的编码,是指按照某种规则对XML树的每一个结点分配唯一的编码,目的是通过任意两个结点的编码,能够直接判断两个结点之间是否具有祖先后代关系.最常用的编码方法是区域编码方法(region based numbering scheme).然而,XML数据也会面临插入删除等更新问题.数据一旦更新,区域编码也要作相应的调整,才能保证基于这个编码的各种索引和查询算法的正确性.在编码的更新方面,目前研究得还不多.主要研究区域编码的更新问题,采用预留编码空间的方法,针对不同特征的XML数据和应用环境提出了一整套预留算法和编码更新算法,并做了大量的实验,检验这些算法的有效性. 相似文献
15.
Su-Cheng Haw Author Vitae Chien-Sing Lee Author Vitae 《Journal of Systems and Software》2009,82(6):1025-1035
Optimizing query processing is always a challenging task in the XML database community. Current state-of-the-art approaches focus mainly on simple query. Yet, as the usage of XML shifts towards the data-oriented paradigm, more and more complex query processing needs to be supported. In this paper, we present TwigX-Guide, a hybrid system, which takes advantage of the beautiful features of path summary in DataGuide and region encoding in TwigStack to improve complex query processing. Experimental results indicate that TwigX-Guide can process complex queries on an average 38% better than the TwigStack algorithm, 31% better than TwigINLAB, 11% better than TwigStackList and about 9% better than TwigStackXB in terms of execution time. 相似文献
16.
在对XML数据模型和XML查询语言中的顺序性进行分析的基础上,提出了一种用于顺序XML树的前缀编码方法,并从唯一性、确定性、动态性、灵活性和简洁性五个方面论证了这种编码的正确性和有效性;同时,运用分层编码的思想解决当XML文档规模增大时编码长度增加的问题。 相似文献
17.
在各种XML查询语言中普遍采用路径表达式来表示对象间的嵌套和引用关系,路径表达式的求解是查询处理中的一个关键问题.本文提出一种基于路径索引与编码模式的路径连接方法,利用路径索引能够以与路径长度成比例的时间求出对象的后代或祖先的目标集,利用编码模式则可以用常数时间确定对象之间的祖先一后代关系.实验结果表明,本文提出的方法具有较高的效率,当对大量对象进行连接以及当路径的长度、路径上结点的出度或入度较大时,本文提出的方法明显优干自顶向下或自底向上遍历的方法。 相似文献
18.
结构连接是XML查询处理的核心操作,受到了研究界的关注.高效的算法是高效查询处理的关键.目前已经提出了许多结构连接的算法,它们中的大多数都基于如下的前提条件之一:输入元素集合存在索引或者有序.当这些条件不成立时,由于对输入数据临时排序或建索引的代价,这些算法的性能会大大下降.基于这样的观察,提出了一种基于区域划分的结构连接算法.该算法基于任务分解的思想,利用区域编码的特点对输入集合进行划分.给出了详细的算法设计,并对算法的I/O复杂性进行了分析.大量的实验结果显示,该算法具有良好的 性能,在输入数据无序或没有索引的情况下优于现有的排序合并算法,可以为查询计划提供更多的选择. 相似文献
19.
图像区域标记和边沿检测的两步法 总被引:4,自引:3,他引:1
区域是边沿的互补物,综合考虑二者有利于获得更好的图像分割算法。提出一种基于马尔可夫神经网络的区域标记和边沿检测的两步法。输入图像首先通过一个单层马尔可夫神经网进行多类别的区域标记处理,然后将区域轮廓作为初值,送入另一个神经网络进行边沿检测和修整处理。介绍了一种利用直方图分析的类别初始标记方法。对仿真图像和真实图像的实验均给出了很好的效果。 相似文献