首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 29 毫秒
1.
XPath求值性能极大关系到XML应用程序的整体性能.提出的M2(Matrix Match)方法基于节点关系矩阵查询.根据XML区间编码构建关系矩阵,通过查询原语序列的执行实现XPath表达式求值.M2的导航式特点使得XPath的各种查询语义容易实现,包括反向轴操作和以谓词表达的分支查询.对应相同XML数据的关系矩阵可...  相似文献   

2.
XPath是XML的基本查询语言,XPath查询最小化对于提高XML数据库的查询性能具有重要意义.但是,由于XPath查询最小化是一个coNP完备问题,大部分已有的算法局限于处理简单的XPath片段.本文从一个新的角度入手,综合考虑完备性和高效性,提出了一个新的查询最小化框架,与已有算法"面向结点",即逐个删除冗余结点的解决思路不同,本文提出"面向树模式"的方式,即通过计算树模式的自同态映射,寻找目标结点集最小的自同态映射,进而求解最小等价查询树的方法.该方法具有较高的效率,而且在--Z..情况下是完备的,尤其是可以进一步扩展到更复杂的XPath片段.本文以此框架为基础,给出一个可以计算复杂查询模式的算法.  相似文献   

3.
一种基于DTD的XPath逻辑优化方法   总被引:12,自引:1,他引:12  
高军  杨冬青  唐世渭  王腾蛟 《软件学报》2004,15(12):1860-1868
Xpath成为XML数据查询的基本机制.Xpath中表达节点之间的祖孙关系的‘//'和任意匹配字符的‘*'等非确定操作符,增强了Xpath表达方式的灵活性,但同时引入了Xpath处理的复杂性.如何利用DTD减少Xpath中的不确定操作符,从而提高Xpath的执行效率成为一个基本的研究问题.传统方法主要侧重于特定受限Xpath的确定化重写.利用树自动机在一个框架中表达Xpath和DTD,提出了一种新的Xpath树自动机和DTD树自动机的乘积运算,并证明了乘积的结果就是基于DTD的Xpath优化形式,在多项式时间内基于代价获取了Xpath的优化结果.实验数据表明,基于提出的Xpath的逻辑优化方法,能够有效地提高Xpath执行器的执行效率.  相似文献   

4.
In this paper, we address the problem of cardinality estimation of XPath queries over XML data stored in a distributed, Internet-scale environment such as a large-scale, data sharing system designed to foster innovations in biomedical and health informatics. The cardinality estimate of XPath expressions is useful in XQuery optimization, designing IR-style relevance ranking schemes, and statistical hypothesis testing. We present a novel gossip algorithm called XGossip, which given an XPath query estimates the number of XML documents in the network that contain a match for the query. XGossip is designed to be scalable, decentralized, and robust to failures—properties that are desirable in a large-scale distributed system. XGossip employs a novel divide-and-conquer strategy for load balancing and reducing the bandwidth consumption. We conduct theoretical analysis of XGossip in terms of accuracy of cardinality estimation, message complexity, and bandwidth consumption. We present a comprehensive performance evaluation of XGossip on Amazon EC2 using a heterogeneous collection of XML documents.  相似文献   

5.
The XML stream filtering is gaining widespread attention from the research community in recent years. There have been many efforts to improve the performance of the XML filtering system by utilizing XML schema information. In this paper, we design and implement an XML stream filtering system, SFilter, which uses DTD or XML schema information for improving the performance. We propose the simplification and two kinds of optimization, one is static and the other is dynamic optimization. The Simplification and static optimization transform the XPath queries to make automata as an index structure for the filtering. The dynamic optimization are done in runtime at the filtering time. We developed five kinds of static optimization and two kinds of dynamic optimization. We present the novel filtering algorithm for the resulting transformed XPath queries and runtime optimizing. The experimental result shows that our system filters the XML streams efficiently.  相似文献   

6.
介绍了一种能够统一地索引全文数据与XML树型结构数据的模型——互关联后继树,提出了后继模式树的概念,使用后继模式树有效地处理了XPath中的正则查询问题。在后继模式树的基础上提出了XPath中节点与文本的联合查询方法。结果表明该方法能够有效地提高XPath中节点与文本联合查询的效率。  相似文献   

7.
The important challenge of evaluating XPath queries over XML streams has sparked much interest in the past few years. A number of algorithms have been proposed, supporting wider fragments of the query language, and exhibiting better performance and memory utilization. Nevertheless, all the algorithms known to date use a prohibitively large amount of memory for certain types of queries. A natural question then is whether this memory bottleneck is inherent or just an artifact of the proposed algorithms.In this paper we initiate the first systematic and theoretical study of lower bounds on the amount of memory required to evaluate XPath queries over XML streams. We present a general lower bound technique, which given a query, specifies the minimum amount of memory that any algorithm evaluating the query on a stream would need to incur. The lower bounds are stated in terms of new graph-theoretic properties of queries. The proofs are based on tools from communication complexity.We then exploit insights learned from the lower bounds to obtain a new algorithm for XPath evaluation on streams. The algorithm uses space close to the optimum. Our algorithm deviates from the standard paradigm of using automata or transducers, thereby avoiding the need to store large transition tables.  相似文献   

8.
Providing efficient access to XML documents becomes crucial in XML database systems. More and more concurrency control protocols for XML database systems were proposed in the past few years. Being an important language for addressing data in XML documents, XPath expressions are the basis of several query languages, such as XQurey and XSLT. In this paper, we propose a lock-based concurrency control protocol, called XLP, for transactions accessing XML data by the XPath model. XLP is based on the XPath model and has the features of rich lock modes, low lock conflict and lock conversion. XLP is also proved to ensure conflict serializability. In sum, there are three major contributions in this paper. The proposed XLP supports most XPath axes, rather than simple path expressions only. Conflict conditions and rules in the XPath model are analyzed and derived. Moreover, a lightweighted lock mode, P-lock, is invented and integrated into XLP for better concurrency.  相似文献   

9.
如何在XML数据流上高效地执行XPath查询,是XML数据流管理的关键问题。DTD结构信息对提高XML查询效率有很大帮助,已有的大部分算法没有利用这一资源。提出了一种使用DTD进行XML数据流查询处理的方法,具有以下特征:利用树自动机表示XPath;通过XPath树自动机与DTD树匹配,预先标识不匹配查询结构的DTD节点;给出一种利用DTD的XML流索引方法DBXSI;执行查询时,根据流索引信息直接跳过某些与查询不匹配的节点及子树。实验结果表明:该方法可有效支持Xpath查询,效率优于传统算法。  相似文献   

10.
XQuery是由W3C提出的XML查询语言标准。XPath路径表达式是XQuery的基础,其查询处理的效能在很大程度上决定着XQuery的处理效率。提出了一种基于代价估算的XPath优化技术,基于层次编码对结构连接的执行代价进行估算,并用动态规划方法求解对于该路径表达式最佳的执行顺序;根据计算结果对查询计划树进行重构得到优化的查询计划。对此算法进行了实验测试和分析,结果表明该算法能得到较佳的查询优化效果。  相似文献   

11.
通过分析XQuery查询与XPath查询的区别与联系,定义了扩展的基本XSIEQ机E-XSIEQ,它是一种被索引化、基于栈的自动机。提出用变量表来收集XPath查询结果,并将这些中间结果组织成原子表集合,结果构造时能够根据原子表元组之间的上下文关系,对原子表集合快速地进行连接。描述了XQuery查询过程中的结构化连接算法,给出了结果构造的时间性能分析。  相似文献   

12.
XPath is ubiquitous in XML applications for navigating XML trees and selecting a set of element nodes.In XPath query processing,one of the most important issues is how to effciently check containment relationship between two XPath expressions.To get out of the intricacy and complexity caused by numerous XPath features,we investigate this issue on a frequently used fragment of XPath expressions that consists of node tests,the child axis(/),the descendant axis(//),branches([])and label wildcards(*).Prior work has shown that homomorphism technology can be used for containment checking.However,homomorphism is the sufficient but not necessary condition for containment.For special classes of this fragment,the homomorphism algorithm returns false negatives.To address this problem,this paper proposes two containment techniques,conditioned homomorphism and hidden conditioned homomorphism,and then presents sound algorithms for checking containment.Experimental results confirm the practicability and efficiency of the proposed algorithms.  相似文献   

13.
基于Object-Z的XPath形式化语义   总被引:1,自引:1,他引:0  
本文描述了XPath语言的形式化语义。一个统一的面向对象的语义视角用于建模所有XPath语言构造。语义的表示采用形式化规范语言Object-Z的符号系统。这种高度结构化的语义模型具有简洁、可组合性和可复用性的特点。  相似文献   

14.
为解决XML管理安全问题,提出了DTD可选的XML访问控制系统OD-XACS(XML Access Control System with Optional DTD),并给出了安全性分析.OD-XACS支持访问控制规则中带有{//,*,[ ]}的复杂XPath式.有DTD时,ODXACS利用XPath式对DTD的可满足性验证访问控制规则的有效性,并对由规则中XPath式构造的不确定有限自动机进行具体化,消除了这些XPath式中的冗余.实验表明,访问控制规则的验证和具体化可以极大地减轻XML查询引擎的负担.  相似文献   

15.
沈洁  印桂生  王向辉 《计算机科学》2010,37(11):172-174
提出了一种算法来分析用正则树表示的XML数据中对XPath的决策问题,并用该方法检查XPath静态类型的数据。此外根据对有限顺序树的带逆操作的逻辑的判定性定理,证明了算法时间复杂度为简单的一个表达式的指数级大小。提出了一套实用的、有效的数学模型来解决XPath表达式中的可满足性问题。并通过对一些决策问题,例如带或者不带类型约束的XPath的空、包含、重叠和覆盖问题的实验对算法进行了证实,实验证明该系统能够有效用于对操作XPath表达式和XML类型注释的程序语言的静态分析器中。  相似文献   

16.
对XPath模式定位能力的扩充   总被引:5,自引:0,他引:5  
基于一般结构模式的定位手段对一个完备的面向XML文档内容的定位机制来说是重要的,而XPath在这方面还有所欠缺。首先通过实例分析阐明了XPath提供基本一般结构模式的定位手段的必要性,随后通过一些扩充定义来改善它的表示能力。扩充从概念、语法描述和执行机制3个方面展开,概念上,引入了结点序列模式和结点正则表达式的定义;语法描述上,为XPath扩充了数据类型NRegexType和结点序列模式匹配函数match;执行机制上,定义了一个用于分析结点序列模式匹配过程的形式化自动机。  相似文献   

17.
含XPath的表达式的解析与应用   总被引:5,自引:0,他引:5  
提出了在应用XML数据中的一类普遍性问题,指出通过定义、处理含XPath的规则式可有效地提高软件的可扩展性.阐述如何运用JavaCC实现对XPath规则式的解析处理,以及扩展的思路。  相似文献   

18.
Existing encoding schemes and index structures proposed for XML query processing primarily target the containment relationship, specifically the parent–child and ancestor–descendant relationship. The presence of preceding-sibling and following-sibling location steps in the XPath specification, which is the de facto query language for XML, makes the horizontal navigation, besides the vertical navigation, among nodes of XML documents a necessity for efficient evaluation of XML queries. Our work enhances the existing range-based and prefix-based encoding schemes such that all structural relationships between XML nodes can be determined from their codes alone. Furthermore, an external-memory index structure based on the traditional B+-tree, XL+-tree(XML Location+-tree), is introduced to index element sets such that all defined location steps in the XPath language, vertical and horizontal, top-down and bottom-up, can be processed efficiently. The XL+-trees under the range or prefix encoding scheme actually share the same structure; but various search operations upon them may be slightly different as a result of the richer information provided by the prefix encoding scheme. Finally, experiments are conducted to validate the efficiency of the XL+-tree approach. We compare the query performance of XL+-tree with that of R-tree, which is capable of handling comprehensive XPath location steps and has been empirically shown to outperform other indexing approaches.  相似文献   

19.
提出一种将XML数据解析为DOM节点后,把DOM节点映射为LDAP目录项,使用LDAP服务器储存XML数据的方案;在LDAP目录服务上实现了对XML数据的XPath查询。实验数据说明了方案的可行性。对比实验表明方案除了具有较快的储存速度,在LDAP上实现的XPath查询效率高于传统的基于DOM的XPath查询效率。  相似文献   

20.
One of the primary issues confronting XML message brokers is the difficulty associated with processing a large set of continuous XPath queries over incoming XML streams. This paper proposes a novel system designed to present an effective solution to this problem. The proposed system transforms multiple XPath queries before their run-time into a new data structure, called an XP-table, by sharing their common constraints. An XP-table is matched with a stream relation (SR) transformed from a target XML stream by a SAX parser. This arrangement is intended to minimize the run-time workload of continuous query processing. In addition, an early-query-termination strategy is proposed as an improved alternative to the basic approach. It optimizes query processing by arranging the evaluation sequence of the member-lists (m-lists) of an XP-table adaptively and offers increased efficiency, especially in cases of low selectivity. System performance is estimated and verified through a variety of experiments, including comparisons with previous approaches such as YFilter and LazyDFA. The proposed system is practically linear-scalable and stable for evaluating a set of XPath queries in a continuous and timely fashion.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号