期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Efficient structural joins with clustered extents

Jongik Kim Sang Ho Lee 《Information Processing Letters》2004,91(2):69-75

相似文献

2.

A path-based node filtering method for efficient structural joins

Kyung-Sub Min Hyoung-Joo Kim 《Information Processing Letters》2005,95(5):480-486

相似文献

3.

路径分区编码优化小枝查询 总被引：1，自引：1，他引：0

徐小双冯玉才王锋周英飚张俊《计算机科学》2010,37(3):182-187204

有效地存储查询XML文档已经成为当今数据库领域的研究热点。从XML文档的路径统计出发,提出了路径分区存储编码方案,并依此消除了小枝查询的后裔边和通配符。针对这类不含//和*的小枝查询,利用路径分区编码的特性,给出了基于结构约束节点的Twig查询算法,极大地减少了结构连接次数。实验表明,该算法能有效滤除无关元素,提高小枝查询效率。相似文献

4.

Query processing of multi-way stream window joins

Moustafa A. Hammad Walid G. Aref Ahmed K. Elmagarmid 《The VLDB Journal The International Journal on Very Large Data Bases》2008,17(3):469-488

This paper introduces a class of join algorithms, termed W-join, for joining multiple infinite data streams. W-join addresses the infinite nature of the data streams by joining stream data items that lie within a sliding window and that match a certain join condition. In addition to its general applicability in stream query processing, W-join can be used to track the motion of a moving object or detect the propagation of clouds of hazardous material or pollution spills over time in a sensor network environment. We describe two new algorithms for W-join and address variations and local/global optimizations related to specifying the nature of the window constraints to fulfill the posed queries. The performance of the proposed algorithms is studied experimentally in a prototype stream database system, using synthetic data streams and real time-series data. Tradeoffs of the proposed algorithms and their advantages and disadvantages are highlighted, given variations in the aggregate arrival rates of the input data streams and the desired response times per query. This is an extended version of the paper published in the Proceedings of the 15th International Conference on Scientific and Statistical Database Management, SSDBM 2003, Boston, U.S.A., pp. 75–84. 相似文献

5.

Generalizing prefix filtering to improve set similarity joins

Leonardo Andrade Ribeiro Theo Härder 《Information Systems》2011

Identification of all pairs of objects in a dataset whose similarity is not less than a specified threshold is of major importance for management, search, and analysis of data. Set similarity joins are commonly used to implement this operation; they scale to large datasets and are versatile to represent a variety of similarity notions. Most methods proposed so far present two main phases at a high level of abstraction: candidate generation producing a set of candidate pairs and verification applying the actual similarity measure to the candidates and returning the correct answer. Previous work has primarily focused on the reduction of candidates, where candidate generation presented the major effort to obtain better pruning results. Here, we propose an opposite approach. We drastically decrease the computational cost of candidate generation by dynamically reducing the number of indexed objects at the expense of increasing the workload of the verification phase. Our experimental findings show that this trade-off is advantageous: we consistently achieve substantial speed-ups as compared to known algorithms. 相似文献

6.

XML数据库结构连接算法之分析

门爱华周立柱张亚鹏《计算机科学》2007,34(6):136-138

结构连接是XML查询处理的核心操作,受到了计算机研究界的高度关注.高效的算法是高效查询处理的关键,目前已经提出许多结构连接的算法.本文介绍了几种典型的算法,并分析了这几种算法的优缺点. 相似文献

7.

Native XML数据库的结构连接算法研究

张鹏冯建华房志峰《计算机应用》2008,28(4):869-871

Native XML数据库的快速查询,可以通过基于XML文档编码的结构连接算法实现。在对现有结构连接算法进行综述的前提下,提出一种新的Native XML数据库的结构连接算法——基于深度均匀划分的结构连接算法(DRIAM)。该算法不要求输入数据AList和DList有序或在其节点编码上建有索引,避免了排序和索引所增加的额外开销;不需要输入数据AList和Dlist全部加载到内存中,可以适应不同内存大小限制的情况,并且该算法时间复杂度非常低。相似文献

8.

基于索引的XML查询技术研究 总被引：2，自引：0，他引：2

肖袁吉根林《计算机科学》2007,34(1):78-80

介绍了目前XML数据查询技术的研究现状,对主要的XML索引查询技术作了较深入的探讨,其中包括：基于路径索引的XML查询方法,如DataGuide、1-index、A（k）索引等;基于编码的XML索引查询方法,如Anc_Desc_B^＋、XR树＋XR-Stack算法等。文中对相关XML索引查询方法的优点和不足进行了分析。相似文献

9.

基于扩展区间编码的XML结构连接算法 总被引：1，自引：0，他引：1

朱晓娟《计算机工程》2010,36(22):49-51

结构连接的效率直接影响XML查询的性能。经典的Anc-Des-B+算法在判断双亲/孩子关系时跳过双亲节点的后裔(非孩子)节点的能力不强。为此,基于区间编码的思想提出一种改进的编码方法,把每个节点译码为六元组,并增加双亲节点的信息。给出的ZParent算法可以跳过孩子列表中所有不参与连接的元素节点,只需要扫描一次列表P和列表C,即可实现基于该编码的结构连接计算。实验结果表明,该方法具有较好的时间性能。相似文献

10.

关联数据分布树在分布式查询中的应用

陈庆奎 HUANG Yu-tian 《计算机工程与设计》2008,29(15)

连接操作是影响分布式查询性能的关键因素,数据存储是影响连接操作的重要因素.为了提高分布式系统的查询性能,通过研究数据之间的关系,提出一个关联数据分布树.利用该关联数据分布树来构造一系列的关联元组集合,然后按照各个站点的负载能力,把这些关联数据集合分配给相关站点.实验结果表明,当多个关系频繁的进行连接操作时,关联数据分布树能有效地提高整个分布式系统的查询性能. 相似文献

11.

下载免费PDF全文

Chang-Xuan Wan Xi-Ping Liu 《计算机科学技术学报》2007,22(2):171-181

The processing of XML queries can result in evaluation of various structural relationships. Efficient algorithms for evaluating ancestor-descendant and parent-child relationships have been proposed. Whereas the problems of evaluating preceding-sibling-following-sibling and preceding-following relationships are still open. In this paper, we studied the structural join and staircase join for sibling relationship. First, the idea of how to filter out and minimize unnecessary reads of elements using parent＇s structural information is introduced, which can be used to accelerate structural joins of parent-child and preceding-sibling-following-sibling relationships. Second, two efficient structural join algorithms of sibling relationship are proposed. These algorithms lead to optimal join performance： nodes that do not participate in the join can be judged beforehand and then skipped using B^＋-tree index. Besides, each element list joined is scanned sequentially once at most. Furthermore, output of join results is sorted in document order. We also discussed the staircase join algorithm for sibling axes. Studies show that, staircase join for sibling axes is close to the structural join for sibling axes and shares the same characteristic of high efficiency. Our experimental results not only demonstrate the effectiveness of our optimizing techniques for sibling axes, but also validate the efficiency of our algorithms. As far as we know, this is the first work addressing this problem specially. 相似文献

12.

A partition index for XML and semi-structured data

Jongik Hyoung-Joo 《Data & Knowledge Engineering》2004,51(3):349-368

XML and other semi-structured data can be represented by a graph model. The paths in a data graph are used as a basic constructor of a query. Especially, by using patterns on paths, a user can formulate more expressive queries. Patterns in a path enlarge the search space of a data graph and current research for indexing semi-structured data focuses on reducing the search space. However, the existing indexes cannot reduce the search space when a data graph has some references.

In this paper, we introduce a partitioning technique for all paths in a data graph and an index graph which can effectively find appropriate path partitions for a path query with patterns. 相似文献

13.

Coding-based Join Algorithms for Structural Queries on Graph-Structured XML Document

Hongzhi Wang Jianzhong Li Wei Wang Xuemin Lin 《World Wide Web》2008,11(4):485-510

In many applications, XML documents need to be modelled as graphs. The query processing of graph-structured XML documents brings new challenges. In this paper, we design a method based on labelling scheme for structural queries processing on graph-structured XML documents. We give each node some labels, the reachability labelling scheme. By extending an interval-based reachability labelling scheme for DAG by Rakesh et al., we design labelling schemes to support the judgements of reachability relationships for general graphs. Based on the labelling schemes, we design graph structural join algorithms to answer the structural queries with only ancestor-descendant relationship efficiently. For the processing of subgraph query, we design a subgraph join algorithm. With efficient data structure, the subgraph join algorithm can process subgraph queries with various structures efficiently. Experimental results show that our algorithms have good performance and scalability. Support by the Key Program of the National Natural Science Foundation of China under Grant No.60533110; the National Grand Fundamental Research 973 Program of China under Grant No. 2006CB303000; the National Natural Science Foundation of China under Grant No. 60773068 and No. 60773063. 相似文献

14.

Extending path summary and region encoding for efficient structural query processing in native XML databases

Su-Cheng Haw Author Vitae Chien-Sing Lee Author Vitae 《Journal of Systems and Software》2009,82(6):1025-1035

Optimizing query processing is always a challenging task in the XML database community. Current state-of-the-art approaches focus mainly on simple query. Yet, as the usage of XML shifts towards the data-oriented paradigm, more and more complex query processing needs to be supported. In this paper, we present TwigX-Guide, a hybrid system, which takes advantage of the beautiful features of path summary in DataGuide and region encoding in TwigStack to improve complex query processing. Experimental results indicate that TwigX-Guide can process complex queries on an average 38% better than the TwigStack algorithm, 31% better than TwigINLAB, 11% better than TwigStackList and about 9% better than TwigStackXB in terms of execution time. 相似文献

15.

有效支持XML结构化连接的索引——CATI 总被引：1，自引：0，他引：1

于亚新王国仁张海宁李建新《计算机研究与发展》2007,44(1):111-118

结构化连接的效率直接影响着XML查询的性能,目前对XML的结构化连接大多都是基于编码的方法.介绍了一种全新的有效支持XML结构化连接的树索引CATI(compact ancestor tree index)CATI的基本思想是,对于给定的一个祖先后代查询(A-D查询)或Twig查询,遍历XML文档,找出所有的祖先A的实例,用以建立CATI的主干;对于每个A实例,找出它的直接后代D的实例链接在它的后面.因为经典的结构连接算法Stack-Tree算法效率较高且使用较广,因此应用基于CATI的结构连接算法和基于Stack-Tree的结构连接算法就A-D查询和Twig查询做了大量实验.实验结果表明,基于CATI的结构化连接在一般查询情况下性能明显优于基于Stack-Tree的结构化连接. 相似文献

16.

基于扩展前缀编码的左孩子右兄弟结构连接算法

文思文贵华《计算机工程与设计》2010,31(10)

为了更加有效实现XML文档的结构查询,加强结构连接操作的效率,提出一种新结构连接算法.该算法采用扩展的前缀编码方案,在编码中增加了type、index等字段以利于定位树中结点在祖先结点列表或者后裔结点列表中的位置.该算法通过将XML文档树转换成左孩子右兄弟树,并定位树中一个祖先元素的起始点下标和终结点下标来找到该祖先元素的后裔结点列表.算法时间复杂度分析表明了该算法比现有算法的性能更好. 相似文献

17.

基于关系数据库有效地实现RPE查询 总被引：5，自引：1，他引：5

刘云生万常选徐升华《小型微型计算机系统》2003,24(10):1764-1771

各种XML查询语言的共同特点就是利用正则路径表达式(RPE)来导航XML文档的查询。本文结合我们提出的一种新的XML数据的关系存储模式，对有效地实现RPE查询的相关研究工作进行了总结，并提出了两个有效地实现包含连接的索引改进归并连接算法。算法采用索引定位技术、短路技术和预侦技术来减少连接代价。因此，不仅能够在当前上下文计算环境下有效地实现包含连接的计算，而且能够大量地避免包含连接中不必要的扫描和搜索。相似文献

18.

DM XML加速线性Twig查询的实现

徐小双冯玉才周英飚《计算机工程与应用》2009,45(16):9-12

每一个复杂的Twig查询都由线性Twig查询构成,有效地处理线性Twig查询显得非常重要。DM XML系统以国产DM5.6关系数据库为平台,融合结构映射和模型映射,实现独特的路径分区编码方案来存储XML数据。在系统中,线性Twig查询解析后,形成线性Twig查询的路径集,而该集合中的每一个路径可被唯一变换为关系数据库中整型主键的范围查询。实验结果显示,路径分区编码方案能加速线性Twig查询,它将为高效实现复杂Twig查询奠定基础。相似文献

19.

一种新的XML文档的存储平台SDML的实现技术

洪晓光《计算机科学》2005,32(2):80-83

目前,XML文档数据库(NXD—Native XML DBMS)的设计和存储正受到越来越多的关注,这是由于它可以灵活地表示各种数据,尤其是那些关系模式无法表达的复杂的数据。已经有一些NXD产品出现。而对XML文档的存储的好坏直接影响到它的查询效率,基于此我们自主提出了一种高效的XML文档存储平台SDML。详细讨论了它的存储结构和实现细节。特别提出了如何解决具有大量结构相同元素的存储方法,并给出了在其上进行查询、插入、删除和索引维护等操作的解决方案。给出了这种结构I／O费用代价,并进行了相关的实现,为NXD的存储优化提供一种新的途径。相似文献

20.

以目标节点为导向的XML路径查询处理 总被引：18，自引：4，他引：14

下载免费PDF全文

王静孟小峰王宇王珊《软件学报》2005,16(5):827-837

XML查询语言将复杂路径表达式作为核心内容.为了加速路径表达式处理,基于路径分解和结构连接操作的处理策略需要更深入的研究.以目标节点为导向的XML路径查询处理框架被提了出来.该方法利用了扩展基本操作来减少连接操作的数目.在路径分解和查询计划选择的过程中,利用查询树中的目标节点来避免中间结果的传递.除了分解规则和策略以外,提出了一组扩展的基本操作和实现算法.初步的实验结果显示,该方法具有良好的性能.它为路径查询处理提供了更多的选择. 相似文献