首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于Agent和活动优先度的ETL过程并行方法
引用本文:陈刚,杜鑫霖,曾司凤,安宝冉.一种基于Agent和活动优先度的ETL过程并行方法[J].计算机工程与科学,2017,39(9):1594-1601.
作者姓名:陈刚  杜鑫霖  曾司凤  安宝冉
作者单位:;1.中国工程物理研究院计算机应用研究所
基金项目:国家863计划(2007AA1236);中国工程物理研究院发展基金(14-FZJJ-0442)
摘    要:ETL是数据仓库获得高质量数据的关键环节,在数据仓库的构建和实施中占有重要地位。针对传统ETL串行执行方式的不足,提出一种基于Agent和活动优先度相结合的ETL并行执行方法。该方法计算ETL执行过程中各个活动的优先度,利用Agent理论和多线程并行计算技术实现并行执行具有相同优先度且相互间没有依赖关系的ETL活动。实验结果表明,该方法在数据量较大时具有较好的加速比,提高了ETL过程的执行效率。

关 键 词:Agent  活动优先度  ETL  并行
收稿时间:2015-11-10
修稿时间:2017-09-25

A parallel method for ETL process based on agent and activity priority
CHEN Gang,DU Xin-lin,ZENG Si-feng,AN Bao-ran.A parallel method for ETL process based on agent and activity priority[J].Computer Engineering & Science,2017,39(9):1594-1601.
Authors:CHEN Gang  DU Xin-lin  ZENG Si-feng  AN Bao-ran
Affiliation:(Institute of Computer Application,China Academy of Engineering Physics,Mianyang 621900,China)
Abstract:ETL is the essential step to obtain high-quality data for data warehouse, and plays an important role in the construction and implementation of data warehouse. Aiming at the deficiency of traditional serial ETL process, we propose a parallel method for ETL based on agent and activity priority. This method first calculates the priority of each ETL activity and then utilizes the agent theory and multi-thread computing techniques to achieve parallel execution of independent ETL activities with the same priority. Experimental results show that this method achieves high speedup when the data volume is large and improves the efficiency of ETL process.
Keywords:agent  activity priority  ETL  parallel  
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号