Scalable processing and autocovariance computation of big functional data |
| |
Authors: | Nieves R. Brisaboa Ricardo Cao José R. Paramá Fernando Silva‐Coira |
| |
Affiliation: | 1. LBD Group, Computer Science Department, Facultade de Informática, Universidade da Coru?a, CITIC, Campus de Elvi?a, A Coru?a, Spain;2. MODES Group, Mathematics Departament, Facultade de Informática, CITIC, ITMATI, Campus de Elvi?a, Universidade da Coru?a, A Coru?a, Spain |
| |
Abstract: | This paper presents 2 main contributions. The first is a compact representation of huge sets of functional data or trajectories of continuous‐time stochastic processes, which allows keeping the data always compressed even during the processing in main memory. It is oriented to facilitate the efficient computation of the sample autocovariance function without a previous decompression of the data set, by using only partial local decoding. The second contribution is a new memory‐efficient algorithm to compute the sample autocovariance function. The combination of the compact representation and the new memory‐efficient algorithm obtained in our experiments the following benefits. The compressed data occupy in the disk 75% of the space needed by the original data. The computation of the autocovariance function used up to 13 times less main memory, and run 65% faster than the classical method implemented, for example, in the R package. |
| |
Keywords: | autocovariance function big data compact representation efficient computation functional data R package |
|
|