共查询到20条相似文献,搜索用时 46 毫秒
1.
如何排列多连接表达式的连接顺序,是优化并行查询和提高数据库性能的关键技术之一.但多连接最优顺序是NPC问题。本文章研究了数据库多连接的特征,提出一种结合启发式和穷举搜索的Ghme算法,结合启发式和穷举搜索处理多连接查询,并从实验和理论证明其有效性。 相似文献
2.
褚龙现 《计算机技术与发展》2012,(10)
查询优化是分布式数据库研究和关注的核心问题之一,为提高分布式数据库的查询效率,通过分析分布式数据库管理系统的特点,找出了影响分布式数据库管理系统查询效率的关键因素,并讨论了直接连接查询的常见策略和半连接查询的原理、实现方法以及所花费的传输代价,同时针对分布式数据库应用中存在大量重复查询操作的特点,提出了一种改进的半连接查询策略.最后结合应用实例对比传统半连接算法与改进后的算法,表明改进后的算法降低了数据传输量,缩短了查询处理时间,提高了查询效率. 相似文献
3.
一种分布式数据库多元连接查询优化算法及改进 总被引:18,自引:1,他引:18
论文对于分布式数据库中的多元连接查询提出了一种优化算法。该算法产生的操作序列具有全局优化特征。在远程网中,采用最小生成树算法,生成最佳的连接序对能使得总的代价达到最小。而在局域网中,采用改进的最小生成树算法,生成尽可能多的并行连接序对,最大程度地应用并行性,使得响应时间最小。 相似文献
4.
多连接查询优化是提高数据库性能的关键问题之一。Chiang Lee提出了一种启发式多连接查询优化算法MVP,分析发现该算法并没有考虑减小执行计划的计算代价。该文结合哈希过滤的特点提出一种改进的多连接查询优化算法,与MVP算法相比该算法降低了执行计划的计算代从,从而使查询响应时间更短。 相似文献
5.
6.
查询操作是数据库中最常用的操作,由于分布式数据库的数据分布性和冗余性,使得查询优化处理成为分布式数据库研究的核心问题之一。为了提高分布式数据库查询效率,分析讨论了基于直接连接的常见执行策略和查询优化算法,同时针对分布式数据库应用中多表连接时存在多连接属性,提出一种改进的直接连接查询优化策略。改进后的算法提高了查询执行的并行性,缩短了查询处理时间,提高了查询效率。 相似文献
7.
首先对分布式数据库查询执行代价模型进行分析,然后对直接连接中的连接运算的方法、连接关系的传输方法和执行场地等问题进行研究,并计算所有评估方法的执行代价,从中选择出最小执行代价的执行方法,最终确定了执行的场地、连接的方法和传输方法. 相似文献
8.
9.
分布式数据库中多元连接查询优化的研究 总被引:1,自引:0,他引:1
论文对分布式数据库中多元连接查询操作次序的确定问题提出了优化,通过引入收益代价比的概念,提出了一基于贪心算法的选择模型。通过该模型,可以得到理想的连接次序的选取方案。 相似文献
10.
11.
Dataflow query execution in a parallel main-memory environment 总被引:2,自引:0,他引:2
In this paper, the performance and characteristics of the execution of various join-trees on a parallel DBMS are studied. The results of this study are a step into the direction of the design of a query optimization strategy that is fit for parallel execution of complex queries.Among others, synchronization issues are identified to limit the performance gain from parallelism. A new hash-join algorithm is introduced that has fewer synchronization constraints than the known hash-join algorithms. Also, the behavior of individual join operations in a join-tree is studied in a simulation experiment. The results show that the introduced Pipelining hash-join algorithm yields a better performance for multi-join queries. The format of the optimal join-tree appears to depend on the size of the operands of the join: A multi-join between small operands performs best with a bushy schedule; larger operands are better off with a linear schedule. The results from the simulation study are confirmed with an analytic model for dataflow query execution. 相似文献
12.
本文提出了一种基于LBT(Linear-Bushy-Tree)树的查询优化方法,它是对基于浓密树(Bushy-Tree)查询优化方法的一种改进。这种优化方法大大地缩减了查询执行计划空间,确保了并行查询执行计划的优化性。这种优化方法已经在我们自主研制的并行数据库管理系统PDBMS^[1,4]中得到实现。 相似文献
13.
本文分析了关系型数据库的查询优化问题,针对多连接查询提出将遗传算法和爬山法结合,从而构造了关系型数据库多连接查询优化问题的混合遗传算法,并进行了实验计算。结果表明,用混合遗传算法解决多连接查询优化问题,可以发挥遗传算法和爬山法的不同优势,从而得到较满意的查询优化性能。 相似文献
14.
15.
基于机群系统的数据存储分布是并行数据库领域的一个重要问题。已有的研究工作多集中在基于单个关系的存储分布,不能有效支持复杂多连接查询处理。文章提出了多个关系整体分布方法,给出分布属性选择和处理机分配算法。实验结果表明,算法具有良好的性能,有助于提高并行多连接查询效率。 相似文献
16.
Many applications often require finding sets of entities of interest that meet certain constraints. Such set-based queries (SQs) can be broadly classified into two types: optimization SQs that involve some optimization constraint and enumerative SQs that do not have any optimization constraint. While there has been much research on the evaluation of optimization SQs, there is very little work on the evaluation of enumerative SQs, which represent the most fundamental fragment of set-based queries. In this paper, we address the problem of evaluating enumerative SQs using RDBMS. While enumerative SQs can be expressed using SQL, existing relational engines, unfortunately, are not able to efficiently evaluate such queries due to their complexity. In this paper, we propose a novel evaluation approach for enumerative SQs. Our experimental results on PostgreSQL demonstrate that our proposed approach outperforms the conventional approach by up to three orders of magnitude. 相似文献
17.
18.
Parallel database systems will very probably be the future for high-performance data-intensive applications. In the past decade, many parallel database systems have been developed, together with many languages and approaches to specify operations in these systems. A common background is still missing, however. This paper proposes an extended relational algebra for this purpose, based on the well-known standard relational algebra. The extended algebra provides both complete database manipulation language features, and data distribution and process allocation primitives to describe parallelism. It is defined in terms of multi-sets of tuples to allow handling of duplicates and to obtain a close connection to the world of high-performance data processing. Due to its algebraic nature, the language is well suited for optimization and parallelization through expression rewriting. The proposed language can be used as a database manipulation language on its own, as has been done in the PRISMA parallel database project, or as a formal basis for other languages, like SQL.Recommended by: Patrick Valduriez 相似文献
19.
基于多重加权树的并行数据库查询优化方法 总被引:1,自引:0,他引:1
本文提出了一种基于多重加权树的查询优化方法,包括多重加权树并行查询计划模型、并行查询计划的复杂性模型和查询优化处工法。 相似文献
20.
Optimization of parallel query execution plans in XPRS 总被引:1,自引:0,他引:1
In this paper, we describe our approach to optimization of query execution plans in XPRS, a multiuser parallel database system based on a shared memory multiprocessor and a disk array. The main difficulties in this optimization problem are the compile-time unknown parameters such as available buffer size and number of free processors, and the enormous search space of possible parallel plans. We deal with these problems with a novel two phase optimization strategy which dramatically reduces the search space and allows run time parameters without significantly compromising plan optimality. In this paper we present our two phase optimization strategy and give experimental evidence from XPRS benchmarks that indicate that it almost always produces optimal or close to optimal plans. 相似文献