首页 | 本学科首页   官方微博 | 高级检索  
     


An algebraic transformation framework for multidatabase queries
Authors:Ee-Peng Lim  Jaideep Srivastava  San-Yih Hwang
Affiliation:(1) School of Applied Science, Nanyang Technological University, Nanyang Avenue, 2263, Singapore;(2) Computer Science Department, University of Minnesota, 55455 Minneapolis, MN;(3) Distributed Computing System Department, Computer and Communication Research Laboratories, Industrial Technology Research Institute (ITRI), K200, Bldg. 14, 195 Sec. 4, Chung-Hsing Rd., Chu-Tung, Hsin-Chu, Taiwan 31015 ROC
Abstract:Existence of semantic conflicts between component databases severely impacts query processing in a multidatabase system. In this paper, we describe two types of semantic conflicts that have to be dealt with in the integration of databases modeling information about related sets of real-world entities. These are the entityidentification problem and theattribute value conflict problem. While thetwo-way outerjoin operation has been commonly used for resolving entity identification problem between two component relations, outerjoins using regular equality comparisons between component relation keys is shown to produce counter-intuitive entity identification result. We remedy this by defining a newkey-equality comparator in place of regular equality comparator, for outerjoins. For the attribute value conflict problem, we define aGeneralized Attribute Derivation (GAD) operation which allows user-defined attribute derivation functions to be used to compute new attributes from the component relations' attributes. By adding two-way outerjoin andGAD to the set of relational operations, the traditional algebraic transformation framework for relational queries is no longer adequate for multidatabase query processing and optimization. As a result, we introduceconstrained query tree as the multidatabase query representation. We show that some knowledge about query predicates and attribute derivation functions can be used to simplify queries. Such knowledge is modeled as an outerjoin graph attached to every outerjoin operation in the query tree. Based on this, we further extend the traditional algebraic transformation framework to include two-way outerjoins andGAD operations. Our framework demonstrates that properties of selection/join predicates and attribute derivation functions can be used to provide interesting transformation alternatives. This framework also serves as a formal ground for developing optimization strategies for multidatabase queries.Recommended by: Clement Yu
Keywords:multidatabase query  integration operation  algebraic transformation  constrained query tree  outerjoin graph
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号