共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
James Bailey Guozhu Dong Mukesh Mohania X. Sean Wang 《Distributed and Parallel Databases》1998,6(3):287-309
The incremental view maintenance problem deals with the efficient updating of materialized views in response to updates to base relations. This paper considers the problem in a distributed database environment, with communication cost minimization as the primary objective. The views considered are defined based on the relational join operation. The approach is to use yes/no tags as auxiliary data on tuples in the base relations to indicate whether the tuples participate in joins. These tags will help avoid sending irrelevant data over the network and thus reduce the communication cost. Two basic view maintenance algorithms are proposed using the tags. In addition to reducing communication costs, an important feature of these two basic algorithms is that they derive the exact change to views without looking at the old views. This feature allows us to maintain certain aggregates on views without actually materializing the views themselves; this feature is useful in applications such as active databases where many conditions or constraints must be tested whenever updates occur, since a condition is true exactly when some corresponding view has nonzero number of tuples. The paper then combines the use of tags with the counting algorithm to derive a tagged counting algorithm that further reduces the communication cost. The paper illustrates the algorithms by examples and studies their performance via a statistical analysis. The illustrating examples and the performance analysis show that, under uniform distribution with reasonable join participation rates, the use of tags significantly improves the efficiency of view maintenance over similar algorithms without tags. The performance analysis also identifies the situations where a particular algorithm is superior to others. The use of tags for memoing values of subexpressions in a view definition is also explored in the paper. 相似文献
3.
数据仓库中的信息是由各个独立分布的数据源的数据汇集而成,数据仓库中实化视图的联机维护是数据仓库系统维护的一项关键技术。本文通过例子说明了视图维护中会产生的数据不一致性问题,并详细分析了基于补偿、基于维护库和基于版本控制的三种视图维护方法。 相似文献
4.
当前数据库并行处理已经引起了很大的关注,而利用主动触发规则导出数据库数据(如视图)的研究也很多见,但在物化视图增量保持领域这方面的研究还不多见,尤其是利用并行处理能力来解决物化视图增量保持的多触发问题的研究。文章提出了并行环境下物化视图增量保持的主动多触发规则处理机制。 相似文献
5.
数据仓库中实体化视图的重计算问题实际上就是由视图的结构发生变化而引起的。对基本关系的每个可能的Schema模式变化,必须对视图重计算而得到新视图,这种重计算过程是需要付出代价的,为了使视图的重计算代价最小化,不应该对新视图中的所有数据都重新计算一遍,而应该通过一定的算法保留旧视图中的数据,只通过重计算而获取新数据,这样就可以使视图的重计算代价最小化。 相似文献
6.
7.
为了加快对大量数据的查询处理速度,通常在数据仓库以实视图方式存储数据,当基础数据发生变化时,这些实视图也必须随着更新,因而视图自维护和一致性维护成为数据仓库的重要问题。本文提出利用视图计算的中间结果创建辅助视图,在数据仓库中进行实体化,采用有效的增量维护算法计算实视图的精确变化,实现数据仓库视图自维护。 相似文献
8.
首先给出了一种面向对象的实视图模型———对象视图模型,可以在数据仓库中描述复杂对象,并有利于前端工具OLAP的实现。然后在此基础上提出了对象视图模型增量维护算法———OMVIMA,该算法能够有效地实现从数据源到数据仓库中对象(数据)的增量抽取和加载等维护工作。最后给出了算法的性能和效率分析,并通过应用实例证明了该算法比基于纯关系实视图的实现方法具有更高的效率。 相似文献
9.
View materialization is an important way of improving the performance of query
processing. When an update occurs to the source data from which a materialized view is
derived, the materialized view has to be updated so that it is consistent with the source
data. This update process is called view maintenance. The incremental method of view
maintenance, which computes the new view using the old view and the update to the
source data, is widely preferred to full view recomputation when the update is small in
size. In this paper we investigate how to incrementally maintain views in object-relational
(OR) databases. The investigation focuses on maintaining views defined in OR-SQL, a
language containing the features of object referencing, inheritance, collection, and aggregate
functions including user-defined set aggregate functions. We propose an architecture and
algorithms for incremental OR viewmaintenance. We implement all algorithms and analyze
the performance of them in comparison with full view recomputation. The analysis shows
that the algorithms significantly reduce the cost of updating a vieww hen the size of an
update to the source data is relatively small.
Received 23 May 2000 / Revised 27 March 2001 / Accepted in revised form 30 April 2001
Correspondence and offprint requests to: Jixue Liu, School of Computer and Information Science, University of South Australia, Mawson Lakes, Adelaide SA5084, Australia.
Email: jixue.liu@unisa.edu.auau 相似文献
10.
Designing data warehouses 总被引:9,自引:0,他引:9
A Data Warehouse (DW) is a database that collects and stores data from multiple remote and heterogeneous information sources. When a query is posed, it is evaluated locally, without accessing the original information sources. In this paper we deal with the issue of designing a DW, in the context of the relational model, by selecting a set of views to materialize in the DW. First, we briefly present a theoretical framework for the DW design problem, which concerns the selection of a set of views that (a) fit in the space allocated to the DW, (b) answer all the queries of interest, and (c) minimize the total query evaluation and view maintenance cost. We then formalize the DW design problem as a state space search problem by taking into account multiquery optimization over the maintenance queries (i.e., queries that compute changes to the materialized views) and the use of auxiliary views for reducing the view maintenance cost. Finally, incremental algorithms and heuristics for pruning the search space are presented. 相似文献
11.
Incremental maintenance of data warehouses has attracted a lot of research attention for the past few years. Nevertheless, most of the previous work is confined to the relational setting. Recently, object-oriented data warehouses have been regarded as a better means to integrate data from modern heterogeneous data sources. However, existing approaches to incremental maintenance of data warehouses do not directly apply to object-oriented data warehouses. In this paper, therefore, we propose an approach to incremental maintenance of object-oriented data warehouses. We focus on two primary issues specifically. First, we identify six categories of potential updates to an object-oriented view and propose an algorithm to find potential updates from the definition of the view. Second, we propose an incremental view maintenance algorithm for maintaining object-oriented data warehouses. We have implemented a prototype system for incremental maintenance of object-oriented data warehouses. Performance evaluation has been conducted, which indicates that our approach is correct and efficient. 相似文献
12.
13.
14.
Schema Evolution in Data Warehouses 总被引:2,自引:0,他引:2
Zohra Bellahsene 《Knowledge and Information Systems》2002,4(3):283-304
In this paper, we address the issues related to the evolution and maintenance of data warehousing systems, when underlying
data sources change their schema capabilities. These changes can invalidate views at the data warehousing system. We present
an approach for dynamically adapting views according to schema changes arising on source relations. This type of maintenance
concerns both the schema and the data of the data warehouse. The main issue is to avoid the view recomputation from scratch
especially when views are defined from multiple sources. The data of the data warehouse is used primarily in organizational
decision-making and may be strategic. Therefore, the schema of the data warehouse can evolve for modeling new requirements
resulting from analysis or data-mining processing. Our approach provides means to support schema evolution of the data warehouse
independently of the data sources.
Received 20 March 2000 / Revised 5 January 2001 / Accepted in revised form 20 April 2001 相似文献
15.
For the materialized views in the fast LAN or computing grid environment, it is a very important problem that how to refresh them efficiently when data sources have changed. In this paper, we take the update frequencies and the size of source relations into account and present a partition strategy and an efficient algorithm by creating auxiliary views. Our algorithm may decrease the cost of join operation and communication on network as low aspossible. 相似文献
16.
提出数据仓库动态增量维护算法和模型.文中阐述了动态增量维护算法、模型以及利用该算法对数据仓库视图的维护技术,并以基于网络的数据仓库为例,描述了动态增量维护算法在数据仓库系统中的实现技术.本算法与技术对数据仓库技术的发展及应用有着重要的理论意义和实用价值。 相似文献
17.
实化视图维护是指在数据源的原始数据发生改变时,有效地将这种变化反映到数据仓库中,使相应的实化视图得到及时更新.当前的视图维护方法主要以C/S结构为基础,当更新频繁时将会导致数据仓库超载而崩溃.针对这种现状,提出基于维护查询任务进行分解的实化视图并行增量维护P3Sweep算法,算法修正了Sweep算法只能顺序处理的限制,赋予其并行处理能力.对于单个更新,P3Sweep算法通过对维护查询任务进行分解,并行执行左右扫描过程来完成维护演算;对于并发更新,算法通过递归分解方式进行细粒度并行演算.性能计算和实验测试结果表明,该方法增加了系统对更新的吞吐能力,减少了维护的延迟,从维护查询任务本身实施对实化视图维护的优化. 相似文献
18.
View selection for designing the global data warehouse 总被引:1,自引:0,他引:1
A global data warehouse (DW) integrates data from multiple distributed heterogeneous databases and other information sources. A global DW can be abstractly seen as a set of materialized views. The selection of views for materialization in a DW is an important decision in the design of a DW. Current commercial products do not provide tools for automatic DW design. We provide a general method that, given a set of select-project-join queries to be satisfied by the DW, generates sets of materialized views that satisfy all the input queries. This process is complex since ‘common subexpressions' between the queries need to be detected and exploited. Our method is then applied to solve the problem of selecting such a materialized view set that fits in the space allocated to the DW for materialization and minimizes the combined overall query evaluation and view maintenance cost. We design algorithms which are implemented and we report on their experimental evaluation. 相似文献
19.
数据仓库中实体化视图的一致性维护问题的研究 总被引:2,自引:0,他引:2
实现实体化视图是提高系统响应时间的一个关键技术和有效的解决方案,但在具体的实现过程中面临着实体化视图的一致性维护问题。由于源数据的不稳定性,其结构和数据的变化必须及时传播到实体化视图中,以保持实体化视图与源数据的变化一致性,否则会降低实体化视图中数据的新鲜度,并影响OLAP查询结果的真实性和有效性。为此本文针对视图的一致性维护问题,介绍了视图中数据的维护方法,同时还就实体化视图结构的维护,提出了使视图重计算代价最小化的解决方案和基本算法。 相似文献
20.
A warehouse is a data repository containing integrated information for efficient querying and analysis. Maintaining the consistency of warehouse data is challenging, especially if the data sources are autonomous and views of the data at the warehouse span multiple sources. Transactions containing multiple updates at one or more sources, e.g., batch updates, complicate the consistency problem. In this paper we identify and discuss three fundamental transaction processing scenarios for data warehousing. We define four levels of consistency for warehouse data and present a new family of algorithms, the Strobe family, that maintain consistency as the warehouse is updated, under the various warehousing scenarios. All of the algorithms are incremental and can handle a continuous and overlapping stream of updates from the sources. Our implementation shows that the algorithms are practical and realistic choices for a wide variety of update scenarios. 相似文献