首页 | 本学科首页   官方微博 | 高级检索  
     

连续概率XML数据查询处理技术
引用本文:张晓琳,郑珍珍,刘立新,李玉峰.连续概率XML数据查询处理技术[J].计算机工程与科学,2012,34(12):134-139.
作者姓名:张晓琳  郑珍珍  刘立新  李玉峰
作者单位:内蒙古科技大学信息工程学院,内蒙古包头,014010
基金项目:国家自然科学基金资助项目,内蒙古自然科学基金重点项目
摘    要:目前查询连续概率XML数据多采用离散化方法,需要处理大量直方图分段,查询效率较低。本文提出了一种基于p-文档模型的连续概率XML数据查询处理技术,首先利用cont节点扩展p-文档模型支持任意的连续分布,在cont节点中编码概率密度函数以及他们的参数;其次采用twig模式匹配找到符合用户要求的路径;然后根据要查询的连续分布类型确定概率查询应该使用符号表示法、积分法或直方图近似法:标准连续分布通过符号表示法中的参数或复杂的累积分布函数计算查询结果,满足积分条件的非标准连续分布采用积分法,其它情况采用直方图近似法。实验结果表明,该方法在概率查询的精确度以及响应时间上比现有方法更高效。

关 键 词:p-文档模型  概率XML  连续分布  查询处理

Query Processing Technology on Continuous Probabilistic XML
ZHANG Xiao-lin , ZHENG Zhen-zhen , LIU Li-xin , LI Yu-feng.Query Processing Technology on Continuous Probabilistic XML[J].Computer Engineering & Science,2012,34(12):134-139.
Authors:ZHANG Xiao-lin  ZHENG Zhen-zhen  LIU Li-xin  LI Yu-feng
Affiliation:(School of Information Engineering,Inner Mongolia University of Science and Technology,Baotou 014010,China)
Abstract:At present,most methods of querying the continuous probabilistic XML are discretized.They are not very efficient because the query operators have to process a large number of histogram segments during the query execution.A continuous probabilistic XML query processing technology based on the p-document model is proposed.Firstly,the p-document model is expanded to support any continuous distribution by cont node,and the probability density functions and their parameters are encoded in cont node. Secondly, the path that meet user's requirements is found by using the twig pattern match,and then whether a probability query should be executed is decided by using the symbolic form,histograms or using integrals according to the type of continuous distributions to be queried. Standard continuous distributions use the parameters of the symbolic representation in conjunction with some sophisticated functions to compute a query answer,non-standard continuous distributions that meet integral condition adopt the integral method,and other distributions use the histograms approximating. Experimental results show that this approach has a higher efficiency on both accuracy and response time than the existing approach.
Keywords:p-document model  probabilistic XML  continuous distribution  query process
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号