共查询到19条相似文献,搜索用时 46 毫秒
1.
欧书书 《数字社区&智能家居》2010,(7X):5679-5681
借鉴关系数据库成熟高效的查询技术,并采取以空间换时间的策略。对频繁的子查询进行登记处理,使频繁的子查询能够直接得出结果;采用路径划分,将路径划分成简单路径并行处理求出子路径的中间结果;借用以输出结点为导向的结构连接顺序,减少无效的结构连接和中间结果。提高查询效率。 相似文献
2.
一种高效的XML路径查询索引 总被引:1,自引:0,他引:1
XML文档的查询索引是当前研究的热点。提出一种高效的XML路径查询索引KDXI,首先对XML文档进行编码,然后建立结构索引并对结构索引进行编码。研究了基于KDXI索引结构的半结构连接算法和路径查询处理过程。通过KDXI索引机制,可以有效执行一般的路径查询语句,并避免冗余的结构连接操作。实验证明了KDXI索引机制的优越性。 相似文献
3.
在关键字查询领域,目前提出的大多数索引结构主要考虑的是静态的XML文档.当XML文档出现频繁更新时,这些索引结构可能面临着大范围的重新编码,从而增加了数据库索引维护的代价.为了能在XML文档动态更新的环境下保持其索引结构的稳定,提出了一种支持动态XML文档上关键字查询的索引结构DLSS( DDE Level Structure Summary).该索引结构采用了一种针对动态更新改进的Dewey编码,该编码只需在文档更新时对新的节点赋予相应的编码,而不需要调整原有的编码结构.实验证明,DLSS索引结构可以在XML文档频繁更新或者较少更新时都能保持索引结构的相对稳定,并能在其上实现较高的关键字查询效率. 相似文献
4.
5.
基于索引的XML查询技术研究 总被引:2,自引:0,他引:2
介绍了目前XML数据查询技术的研究现状,对主要的XML索引查询技术作了较深入的探讨,其中包括:基于路径索引的XML查询方法,如DataGuide、1-index、A(k)索引等;基于编码的XML索引查询方法,如Anc_Desc_B^+、XR树+XR-Stack算法等。文中对相关XML索引查询方法的优点和不足进行了分析。 相似文献
6.
7.
8.
对XML数据索引的回顾 总被引:4,自引:1,他引:3
随着Internet的迅速发展,XML已成为Internet同上数据表示与交换的事实标准.大量应用采纳了XML.例如Web Service中的数据表示和交换、MPEG7中定义的多媒体特征描述子等。目前.查询XML数据需要用XPath.由于查询语句的复杂性.很难找到一种通用的索引结构能有效支持任意查询。因此,在近十年的研究历程中.为了实现XML数据的快速查询,人们提出了大量索引结构。本文就是对已经提出的一些代表性的索引结构进行分类和总结.并指出其优缺点和所能支持的查询。 相似文献
9.
《计算机学报》2014,(8)
针对基于k双拟的结构索引创建和更新低效问题、查询结果重复验证问题以及标签路径不可获得性问题,提出了一种新的结构索引L(k)-index.L(k)-index通过引入标签路径,在创建时无须k次遍历原数据,并采取批量更新策略,大大提高索引创建和更新的效率,而在空间上仅有很小增加.对于长度大于k+1的路径查询,L(k)-index无须访问原数据进行验证,并支持批量节点的标签路径获得.通过大量实验表明,同A(k)-index相比,L(k)-index创建时间平均提高66.7%,查询处理时间效率平均提高68.9%,批量更新效率平均每节点提高58.8%,而空间仅增加22.5%. 相似文献
10.
一种高效的XML多分支路径查询算法 总被引:2,自引:0,他引:2
肖袁 《计算机应用与软件》2010,27(7):152-155
目前XML单路径查询和简单的分支路径查询已经得到了较好的解决,但如何高效地实现XML多分支路径查询还没有很好的方法。提出一种高效的XML多分支查询算法MBPQ。算法MBPQ首先对XML文档和被查询的多分支路径结点分别按照各自不同的方式进行编码,并将被查询的多分支路径拆分成单路径,最后将单路径查询匹配成多分支查询结果。在单路径查询结果匹配过程中,算法MBPQ利用栈控制匹配过程,按照查询树从左到右、自底向上的顺序匹配具有共同祖先结点的单路径查询结果,从而提高匹配效率。实验表明,与现有的XML多分支查询一般算法相比,算法MBPQ的查询效率高。 相似文献
11.
XML查询语言将复杂路径表达式作为核心内容.为了加速路径表达式处理,基于路径分解和结构连接操作的处理策略需要更深入的研究.以目标节点为导向的XML路径查询处理框架被提了出来.该方法利用了扩展基本操作来减少连接操作的数目.在路径分解和查询计划选择的过程中,利用查询树中的目标节点来避免中间结果的传递.除了分解规则和策略以外,提出了一组扩展的基本操作和实现算法.初步的实验结果显示,该方法具有良好的性能.它为路径查询处理提供了更多的选择. 相似文献
12.
In many applications, XML documents need to be modelled as graphs. The query processing of graph-structured XML documents
brings new challenges. In this paper, we design a method based on labelling scheme for structural queries processing on graph-structured
XML documents. We give each node some labels, the reachability labelling scheme. By extending an interval-based reachability
labelling scheme for DAG by Rakesh et al., we design labelling schemes to support the judgements of reachability relationships
for general graphs. Based on the labelling schemes, we design graph structural join algorithms to answer the structural queries
with only ancestor-descendant relationship efficiently. For the processing of subgraph query, we design a subgraph join algorithm.
With efficient data structure, the subgraph join algorithm can process subgraph queries with various structures efficiently.
Experimental results show that our algorithms have good performance and scalability.
Support by the Key Program of the National Natural Science Foundation of China under Grant No.60533110; the National Grand
Fundamental Research 973 Program of China under Grant No. 2006CB303000; the National Natural Science Foundation of China under
Grant No. 60773068 and No. 60773063. 相似文献
13.
Twig query pattern matching is a core operation in XML query processing. Indexing XML documents for twig query processing
is of fundamental importance to supporting effective information retrieval. In practice, many XML documents on the web are
heterogeneous and have their own formats; documents describing relevant information can possess different structures. Therefore
some “user-interesting” documents having similar but non-exact structures against a user query are often missed out. In this
paper, we propose the RRSi, a novel structural index designed for structure-based query lookup on heterogeneous sources of XML documents supporting
proximate query answers. The index avoids the unnecessary processing of structurally irrelevant candidates that might show
good content relevance. An optimized version of the index, oRRSi, is also developed to further reduce both space requirements and computational complexity. To our knowledge, these structural
indexes are the first to support proximity twig queries on XML documents. The results of our preliminary experiments show
that RRSi and oRRSi based query processing significantly outperform previously proposed techniques in XML repositories with structural heterogeneity.
相似文献
Vincent T. Y. NgEmail: |
14.
The processing of XML queries can result in evaluation of various structural relationships. Efficient algorithms for evaluating ancestor-descendant and parent-child relationships have been proposed. Whereas the problems of evaluating preceding-sibling-following-sibling and preceding-following relationships are still open. In this paper, we studied the structural join and staircase join for sibling relationship. First, the idea of how to filter out and minimize unnecessary reads of elements using parent's structural information is introduced, which can be used to accelerate structural joins of parent-child and preceding-sibling-following-sibling relationships. Second, two efficient structural join algorithms of sibling relationship are proposed. These algorithms lead to optimal join performance: nodes that do not participate in the join can be judged beforehand and then skipped using B^+-tree index. Besides, each element list joined is scanned sequentially once at most. Furthermore, output of join results is sorted in document order. We also discussed the staircase join algorithm for sibling axes. Studies show that, staircase join for sibling axes is close to the structural join for sibling axes and shares the same characteristic of high efficiency. Our experimental results not only demonstrate the effectiveness of our optimizing techniques for sibling axes, but also validate the efficiency of our algorithms. As far as we know, this is the first work addressing this problem specially. 相似文献
15.
结构连接是XML查询处理的核心操作,受到了研究界的关注.高效的算法是高效查询处理的关键.目前已经提出了许多结构连接的算法,它们中的大多数都基于如下的前提条件之一:输入元素集合存在索引或者有序.当这些条件不成立时,由于对输入数据临时排序或建索引的代价,这些算法的性能会大大下降.基于这样的观察,提出了一种基于区域划分的结构连接算法.该算法基于任务分解的思想,利用区域编码的特点对输入集合进行划分.给出了详细的算法设计,并对算法的I/O复杂性进行了分析.大量的实验结果显示,该算法具有良好的 性能,在输入数据无序或没有索引的情况下优于现有的排序合并算法,可以为查询计划提供更多的选择. 相似文献
16.
Dunren Che Karl Aberer M. Tamer Özsu 《The VLDB Journal The International Journal on Very Large Data Bases》2006,15(3):263-289
While the information published in the form of XML-compliant documents keeps fast mounting up, efficient and effective query
processing and optimization for XML have now become more important than ever. This article reports our recent advances in
XML structured-document query optimization. In this article, we elaborate on a novel approach and the techniques developed
for XML query optimization. Our approach performs heuristic-based algebraic transformations on XPath queries, represented
as PAT algebraic expressions, to achieve query optimization. This article first presents a comprehensive set of general equivalences
with regard to XML documents and XML queries. Based on these equivalences, we developed a large set of deterministic algebraic
transformation rules for XML query optimization. Our approach is unique, in that it performs exclusively deterministic transformations
on queries for fast optimization. The deterministic nature of the proposed approach straightforwardly renders high optimization
efficiency and simplicity in implementation. Our approach is a logical-level one, which is independent of any particular storage
model. Therefore, the optimizers developed based on our approach can be easily adapted to a broad range of XML data/information
servers to achieve fast query optimization. Experimental study confirms the validity and effectiveness of the proposed approach. 相似文献
17.
The security of published XML data receives exceptional attention due to its sensitive nature in many applications. This paper proposes an XML view publishing method called XFlat. Compared with other methods, XFlat focuses on query performance over the published XML view while simultaneously protecting the sensitive data via encryption techniques. XFlat decomposes an XML tree into a set of sub-trees, in each of which multiple users have the same accessibility to all nodes, and may encrypt and store each sub-tree in a flat, sequential manner. This storage strategy can avoid the nested encryption cost in view construction and the nested decryption cost in query evaluation. In addition, we discuss how to generate a user-specific schema and how to minimize the total space cost of the published XML view when considering the overhead of the relationships among the sub-trees. We also propose an XML schema index to enhance query performance over the final XML view. The experimental results demonstrate the effectiveness and efficiency of the proposed XFlat method. 相似文献
18.
Indexing and querying XML using extended Dewey labeling scheme 总被引:1,自引:0,他引:1
Jiaheng LuAuthor Vitae Xiaofeng MengAuthor VitaeTok Wang LingAuthor Vitae 《Data & Knowledge Engineering》2011,70(1):35-59
Finding all the occurrences of a tree pattern in an XML database is a core operation for efficient evaluation of XML queries. The Dewey labeling scheme is commonly used to label an XML document to facilitate XML query processing by recording information on the path of an element. In order to improve the efficiency of XML tree pattern matching, we introduce a novel labeling scheme, called extended Dewey, which effectively extends the existing Dewey labeling scheme to combine the types and identifiers of elements in a label, and to avoid the scan of labels for internal query nodes to accelerate query processing (in I/O cost). Based on extended Dewey, we propose a series of holistic XML tree pattern matching algorithms. We first present TJFast to answer an XML twig pattern query. To efficiently answer a generalized XML tree pattern, we then propose GTJFast, an optimization that exploits the non-output nodes. In addition, we propose TJFastTL and GTJFastTL based on the tag + level data partition scheme to further reduce I/O costs by level pruning. Finally, we report our comprehensive experimental results to show that our set of XML tree pattern matching algorithms are superior to existing approaches in terms of the number of elements scanned, the size of intermediate results and query performance. 相似文献