首页 | 本学科首页   官方微博 | 高级检索  
     


ETL workflow reparation by means of case-based reasoning
Authors:Artur?Wojciechowski  author-information"  >  author-information__contact u-icon-before"  >  mailto:artur.wojciechowski@cs.put.poznan.pl"   title="  artur.wojciechowski@cs.put.poznan.pl"   itemprop="  email"   data-track="  click"   data-track-action="  Email author"   data-track-label="  "  >Email author  author-information__orcid u-icon-before icon--orcid u-icon-no-repeat"  >  http://orcid.org/---"   itemprop="  url"   title="  View OrcID profile"   target="  _blank"   rel="  noopener"   data-track="  click"   data-track-action="  OrcID"   data-track-label="  "  >View author&#  s OrcID profile
Affiliation:1.Institute of Computing Science,Poznan University of Technology,Poznan,Poland
Abstract:Data sources (DSs) being integrated in a data warehouse frequently change their structures/schemas. As a consequence, in many cases, an already deployed ETL workflow stops its execution, yielding errors. Since in big companies the number of ETL workflows may reach dozens of thousands and since structural changes of DSs are frequent, an automatic repair of an ETL workflow after such changes is of high practical importance. In our approach, we developed a framework, called E-ETL, for handling the evolution of an ETL layer. In the framework, an ETL workflow is semi-automatically or automatically (depending on a case) repaired as the result of structural changes in DSs, so that it works with the changed DSs. E-ETL supports two different repair methods, namely: (1) user defined rules, (2) and Case-Based Reasoning. In this paper, we present how Case-Based Reasoning may be applied to repairing ETL workflows. In particular, we contribute an algorithm for selecting the most suitable case for a given ETL evolution problem. The algorithm applies a technique for reducing cases in order to make them more universal and capable of solving more problems. The algorithm has been implemented in prototype E-ETL and evaluated experimentally. The obtained results are also discussed in this paper.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号