首页 | 本学科首页   官方微博 | 高级检索  
     


Workflow performance improvement using model-based scheduling over multiple clusters and clouds
Affiliation:1. Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL, 60439, USA;2. Leadership Computing Facility Division, Argonne National Laboratory, Argonne, IL, 60439, USA;1. Institute of Computer Science, University of Innsbruck, Austria;2. Departamento de Engenharia Informática, Universidade do Porto, Portugal;1. ITIC, Universidad Nacional de Cuyo. Mendoza, Argentina;2. Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Argentina;3. ISISTAN-UNICEN-CONICET, Tandil, Buenos Aires, Argentina;4. Departamento de Lenguajes y Ciencias de la Computación, Universidad de Málaga, Spain;1. Department of Computing, Macquarie University, Australia;2. Department of Computer Science, Dongduk Women’s University, Republic of Korea;3. School of Information Technologies, The University of Sydney, Australia;4. T-Systems International, USA
Abstract:In recent years, a variety of computational sites and resources have emerged, and users often have access to multiple resources that are distributed. These sites are heterogeneous in nature and performance of different tasks in a workflow varies from one site to another. Additionally, users typically have a limited resource allocation at each site capped by administrative policies. In such cases, judicious scheduling strategy is required in order to map tasks in the workflow to resources so that the workload is balanced among sites and the overhead is minimized in data transfer. Most existing systems either run the entire workflow in a single site or use naïve approaches to distribute the tasks across sites or leave it to the user to optimize the allocation of tasks to distributed resources. This results in a significant loss in productivity. We propose a multi-site workflow scheduling technique that uses performance models to predict the execution time on resources and dynamic probes to identify the achievable network throughput between sites. We evaluate our approach using real world applications using the Swift parallel and distributed execution framework. We use two distinct computational environments-geographically distributed multiple clusters and multiple clouds. We show that our approach improves the resource utilization and reduces execution time when compared to the default schedule.
Keywords:System modeling  Workflow  Optimization  Swift  Clouds
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号