首页 | 本学科首页   官方微博 | 高级检索  
     


Instance-based attribute identification in database integration
Authors:Email author" target="_blank">Cecil?Eng?H?ChuaEmail author  Roger?H?L?Chiang  Ee-Peng?Lim
Affiliation:(1) J. Mack Robinson College of Business, Georgia State University,;(2) College of Business Administration, University of Cincinnati,;(3) School of Computer Engineering, Nanyang Technological University,
Abstract:Most research on attribute identification in database integration has focused on integrating attributes using schema and summary information derived from the attribute values. No research has attempted to fully explore the use of attribute values to perform attribute identification. We propose an attribute identification method that employs schema and summary instance information as well as properties of attributes derived from their instances. Unlike other attribute identification methods that match only single attributes, our method matches attribute groups for integration. Because our attribute identification method fully explores data instances, it can identify corresponding attributes to be integrated even when schema information is misleading. Three experiments were performed to validate our attribute identification method. In the first experiment, the heuristic rules derived for attribute classification were evaluated on 119 attributes from nine public domain data sets. The second was a controlled experiment validating the robustness of the proposed attribute identification method by introducing erroneous data. The third experiment evaluated the proposed attribute identification method on five data sets extracted from online music stores. The results demonstrated the viability of the proposed method.Received: 30 August 2001, Accepted: 31 August 2002, Published online: 31 July 2003Edited by L. Raschid
Keywords:Attribute identification  Database integration  Measures of association
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号