首页 | 本学科首页   官方微博 | 高级检索  
     

半结构化数据查询的处理和优化
引用本文:陈 滢,王能斌.半结构化数据查询的处理和优化[J].软件学报,1999,10(8):883-890.
作者姓名:陈 滢  王能斌
作者单位:东南大学计算机科学与工程系,南京,210096;东南大学计算机科学与工程系,南京,210096
基金项目:本文研究得到国家自然科学基金资助.
摘    要:半结构化数据的特点是数据的结构不规则或不完整,其模型都基于带根有向图,因此,查询处理过程本质上是对图的搜索过程.另外,通配路径使查询处理更加复杂化.文章详细介绍了异构数据源集成系统Versatile中采取的半结构数据OIM(model for object integration)对象的查询和优化策略,包括查询计划的生成、路径扩展和路径索引、层次索引和基于数据源知识这3种查询优化方法.文章介绍的方法同样适用于其他的半结构化数据模型.

关 键 词:半结构化数据  查询处理  优化.
收稿时间:6/2/1998 12:00:00 AM
修稿时间:9/1/1998 12:00:00 AM

Querying and Optimizing Semistructured Data
CHEN Ying and WANG Neng-bin.Querying and Optimizing Semistructured Data[J].Journal of Software,1999,10(8):883-890.
Authors:CHEN Ying and WANG Neng-bin
Affiliation:Department of Computer Science and Engineering Southeast University Nanjing 210096
Abstract:Semistructured data has irregular or incomplete structure. In recent research on semistructured data sources and integration for heterogeneous data sources, models for semistructured data are based on direct graph with root vertex, so querying semistructured data is equivalent with searching in graph. In addition, path with wildcard characters brings more complexity in query processing. In this paper, the authors present the strategies deployed in querying and optimizing OIM (model for object integrating) data in Versatile-a system for integrating heterogeneous data sources. Algorithms for generating query plan and extending path are dis-cussed in detail and three optimization methods, path index (Pindex), level index(Lvindex) and knowledge of data source are introduced. Also the approach can be applicable to other graph-based semistructured data easily.
Keywords:Semistructured data  query processing  optimization  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号