Efficient retrieval of replicated data |
| |
Authors: | Ali Şaman Tosun |
| |
Affiliation: | (1) Department of Computer Science, University of Texas at San Antonio, San Antonio, TX, 78249 |
| |
Abstract: | Declustering is a common technique used to reduce query response times. Data is declustered over multiple disks and query
retrieval can be parallelized. Most of the research on declustering is targeted at spatial range queries and investigates
schemes with low additive error. Recently, declustering using replication has been proposed to reduce the additive overhead.
Replication significantly reduces retrieval cost of arbitrary queries. In this paper, we propose a disk allocation and retrieval
mechanism for arbitrary queries based on design theory. Using the proposed c-copy replicated declustering scheme,
buckets can be retrieved using at most k disk accesses. Retrieval algorithm is very efficient and is asymptotically optimal with
complexity for a query Q. In addition to the deterministic worst-case bound and efficient retrieval, proposed algorithm handles nonuniform data, high
dimensions, supports incremental declustering and has good fault-tolerance property. Experimental results show the feasibility
of the algorithm.
Recommended by: Sunil Prabhakar |
| |
Keywords: | Declustering Parallel I/O Design theory |
本文献已被 SpringerLink 等数据库收录! |
|