A spatiotemporal compression based approach for efficient big data processing on Cloud |
| |
Authors: | Chi Yang Xuyun Zhang Changmin Zhong Chang Liu Jian Pei Kotagiri Ramamohanarao Jinjun Chen |
| |
Affiliation: | 1. Faculty of Engineering and Information Technology, University of Technology Sydney, PO Box 123, Broadway, NSW 2007, Australia;2. Joowing Australia Pty Ltd., 26 Entally Drive, Wheelers Hill, VIC 3150, Australia;3. School of Computing Science, Simon Fraser University, 8888 University Drive, Burnaby, BC V5A 1S6, Canada;4. Department of Computing and Information Systems, The University of Melbourne, VIC 3110, Australia |
| |
Abstract: | It is well known that processing big graph data can be costly on Cloud. Processing big graph data introduces complex and multiple iterations that raise challenges such as parallel memory bottlenecks, deadlocks, and inefficiency. To tackle the challenges, we propose a novel technique for effectively processing big graph data on Cloud. Specifically, the big data will be compressed with its spatiotemporal features on Cloud. By exploring spatial data correlation, we partition a graph data set into clusters. In a cluster, the workload can be shared by the inference based on time series similarity. By exploiting temporal correlation, in each time series or a single graph edge, temporal data compression is conducted. A novel data driven scheduling is also developed for data processing optimisation. The experiment results demonstrate that the spatiotemporal compression and scheduling achieve significant performance gains in terms of data size and data fidelity loss. |
| |
Keywords: | Big data Graph data Spatiotemporal compression Cloud computing Scheduling |
本文献已被 ScienceDirect 等数据库收录! |
|