MapMerge: correlating independent schema mappings |
| |
Authors: | Bogdan Alexe Mauricio Hernández Lucian Popa Wang-Chiew Tan |
| |
Affiliation: | 1. IBM Research-Almaden, San Jose, CA, USA 2. UC Santa Cruz, Santa Cruz, CA, USA
|
| |
Abstract: | One of the main steps toward integration or exchange of data is to design the mappings that describe the (often complex) relationships
between the source schemas or formats and the desired target schema. In this paper, we introduce a new operator, called MapMerge,
that can be used to correlate multiple, independently designed schema mappings of smaller scope into larger schema mappings.
This allows a more modular construction of complex mappings from various types of smaller mappings such as schema correspondences
produced by a schema matcher or pre-existing mappings that were designed by either a human user or via mapping tools. In particular,
the new operator also enables a new “divide-and-merge” paradigm for mapping creation, where the design is divided (on purpose)
into smaller components that are easier to create and understand and where MapMerge is used to automatically generate a meaningful
overall mapping. We describe our MapMerge algorithm and demonstrate the feasibility of our implementation on several real
and synthetic mapping scenarios. In our experiments, we make use of a novel similarity measure between two database instances
with different schemas that quantifies the preservation of data associations. We show experimentally that MapMerge improves
the quality of the schema mappings, by significantly increasing the similarity between the input source instance and the generated
target instance. Finally, we provide a new algorithm that combines MapMerge with schema mapping composition to correlate flows
of schema mappings. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|