首页 | 本学科首页   官方微博 | 高级检索  
     


An Evaluation of the Cost and Performance of Scientific Workflows on Amazon EC2
Authors:Gideon Juve  Ewa Deelman  G. Bruce Berriman  Benjamin P. Berman  Philip Maechling
Affiliation:1. USC Information Sciences Institute, Marina Del Rey, CA, 90292, USA
2. NASA Exoplanet Science Institute, Infrared, Processing and Analysis Center, Caltech, Pasadena, CA, USA
3. USC Epigenome Center, Los Angeles, CA, USA
4. Southern California Earthquake Center, Los Angeles, CA, USA
Abstract:Workflows are used to orchestrate data-intensive applications in many different scientific domains. Workflow applications typically communicate data between processing steps using intermediate files. When tasks are distributed, these files are either transferred from one computational node to another, or accessed through a shared storage system. As a result, the efficient management of data is a key factor in achieving good performance for workflow applications in distributed environments. In this paper we investigate some of the ways in which data can be managed for workflows in the cloud. We ran experiments using three typical workflow applications on Amazon’s EC2 cloud computing platform. We discuss the various storage and file systems we used, describe the issues and problems we encountered deploying them on EC2, and analyze the resulting performance and cost of the workflows.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号