View selection for real conjunctive queries |
| |
Authors: | Foto Afrati Rada Chirkova Manolis Gergatsoulis Vassia Pavlaki |
| |
Affiliation: | (1) Department of Electrical and Computing Engineering, National Technical University of Athens (NTUA), 15773 Athens, Greece;(2) Computer Science Department, North Carolina State University, Campus Box 8206, Raleigh, NC 27695-8206, USA;(3) Department of Archive and Library Sciences, Ionian University, Palea Anaktora, Plateia Eleftherias, 49100 Corfu, Greece |
| |
Abstract: | Given a query workload, a database and a set of constraints, the view-selection problem is to select views to materialize
so that the constraints are satisfied and the views can be used to compute the queries in the workload efficiently. A typical
constraint, which we consider in the present work, is to require that the views can be stored in a given amount of disk space.
Depending on features of SQL queries (e.g., the DISTINCT keyword) and on whether the database relations on which the queries
are applied are sets or bags, the queries may be computed under set semantics, bag-set semantics, or bag semantics. In this
paper we study the complexity of the view-selection problem for conjunctive queries and views under these semantics. We show
that bag semantics is the “easiest to handle” (we show that in this case the decision version of view selection is in NP),
whereas under set and bag-set semantics we assume further restrictions on the query workload (we only allow queries without
self-joins in the workload) to achieve the same complexity. Moreover, while under bag and bag-set semantics filtering views
(i.e., subgoals that can be dropped from the rewriting without impacting equivalence to the query) are practically not needed,
under set semantics filtering views can reduce significantly the query-evaluation costs. We show that under set semantics
the decision version of the view-selection problem remains in NP only if filtering views are not allowed in the rewritings.
Finally, we investigate whether the cgalg algorithm for view selection introduced in Chirkova and Genesereth (Linearly bounded reformulations of conjunctive databases,
pp. 987–1001, 2000) is suitable in our setting. We prove that this algorithm is sound for all cases we examine here, and that
it is complete under bag semantics for workloads of arbitrary conjunctive queries and under bag-set semantics for workloads
of conjunctive queries without self-joins.
Rada Chirkova’s work on this material has been supported by the National Science Foundation under Grant No. 0307072.
The project is co-funded by the European Social Fund (75%) and National Resources (25%)- Operational Program for Educational
and Vocational Training II (EPEAEK II) and particularly the program PYTHAGORAS.
A preliminary version of this paper appears in F. Afrati, R. Chirkova, M. Gergatsoulis, V. Pavlaki. Designing Views to Efficiently
Answer Real SQL Queries. In Proc. of SARA 2005, LNAI Vol. 3607, pages 332-346, Springer-Verlag, 2005. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|