A parallel computing framework for big data |
| |
Authors: | Guoliang Chen Rui Mao Kezhong Lu |
| |
Affiliation: | 1.Guangdong Province Key Laboratory of Popular High Performance Computers,Shenzhen,China;2.College of Computer Science and Software Engineering,Shenzhen University,Shenzhen,China |
| |
Abstract: | Big data has received great attention in research and application. However, most of the current efforts focus on system and application to handle the challenges of “volume” and “velocity”, and not much has been done on the theoretical foundation and to handle the challenge of “variety”. Based on metric-space indexing and computationalcomplexity theory, we propose a parallel computing framework for big data. This framework consists of three components, i.e., universal representation of big data by abstracting various data types into metric space, partitioning of big data based on pair-wise distances in metric space, and parallel computing of big data with the NC-class computing theory. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|