共查询到20条相似文献,搜索用时 0 毫秒
1.
Alberto Herrn Jos M. Colmenar Rafael Martí Abraham Duarte 《International Transactions in Operational Research》2020,27(1):336-360
The obnoxious p‐median problem consists of selecting p locations, considered facilities, in a way that the sum of the distances from each nonfacility location, called customers, to its nearest facility is maximized. This is an ‐hard problem that can be formulated as an integer linear program. In this paper, we propose the application of a variable neighborhood search (VNS) method to effectively tackle this problem. First, we develop new and fast local search procedures to be integrated into the basic VNS methodology. Then, some parameters of the algorithm are tuned in order to improve its performance. The best VNS variant is parallelized and compared with the best previous methods, namely branch and cut, tabu search, and GRASP over a wide set of instances. Experimental results show that the proposed VNS outperforms previous methods in the state of the art. This fact is finally confirmed by conducting nonparametric statistical tests. 相似文献
2.
3.
Azzedine Boukerche Alba Cristina Magalhaes Alves de Melo Mauricio Ayala-Rincón Maria Emilia Machado Telles Walter 《Journal of Parallel and Distributed Computing》2007
Recently, many organisms have had their DNA entirely sequenced. This reality presents the need for comparing long DNA sequences, which is a challenging task due to its high demands for computational power and memory. Sequence comparison is a basic operation in DNA sequencing projects, and most sequence comparison methods currently in use are based on heuristics, which are faster but offer no guarantees of producing the best alignments possible. In order to alleviate this problem, Smith–Waterman proposed an algorithm. This algorithm obtains the best local alignments but at the expense of very high computing power and huge memory requirements. In this article, we present and evaluate our experiments involving three strategies to run the Smith–Waterman algorithm in a cluster of workstations using a Distributed Shared Memory System. Our results on an eight-machine cluster presented very good speed-up and indicate that impressive improvements can be achieved depending on the strategy used. In addition, we present a number of theoretical remarks concerning how to reduce the amount of memory used. 相似文献
4.
5.
6.
Backtracking branch-and-prune (BP) algorithms and their variants are exhaustive search tree techniques widely employed to solve optimization problems in many scientific areas. However, they characteristically often demand significant amounts of computing power for problem sizes representative of real-world scenarios. Given that their search domains can often be partitioned, these algorithms are frequently designed to execute in parallel by harnessing distributed computing systems. However, to achieve efficient parallel execution times, an effective strategy is required to balance the nonuniform partition workloads across the available resources. Furthermore, with the increasing integration of servers with heterogeneous resources and the adoption of resource sharing, balancing workloads is becoming complex. This paper proposes a strategy to execute parallel BP algorithms more efficiently on even shared or heterogeneous distributed systems. The approach integrates a self-adjusting dynamic partitioning method in the BP algorithm with a dynamic scheduler, provided by an application middleware, which manages the parallel execution while addressing any issues of imbalance. Empirical results indicate better scalability with efficiencies above 90% for instances of an application case study for the discretizable molecular distance geometry problem (DMDGP). Improvements of up to 38% were obtained in execution speed-ups compared to a more traditional parallel BP implementation for DMDGP. 相似文献
7.
The problem of optimization of communications during the execution of a program on a parallel computer with distributed memory
is investigated. Statements are formulated that make it possible to determine the possibility of organization of data broadcast
and translation. The conditions proposed are represented in the form suitable for practical application and can be used for
automated parallelization of programs.
This work was done within the framework of the State Program of Fundamental Studies of the Republic of Belarus (under the
code name “Mathematical structures 21”) with the partial support of the Foundation for Fundamental Studies of the Republic
of Belarus (grant F03-062).
__________
Translated from Kibernetika i Sistemnyi Analiz, No. 2, pp. 166–182, March–April 2006. 相似文献
8.
开发并行程序要比开发单机串行程序更难。PVM开发环境是应用比较广的环境之一,适合于开发粗粒度的工程科学计算并行程序,而这些工程计算问题一般是一些数值计算问题的集合。编写这些数值计算并行程序有一定的难度和复杂度,并且现在没有很好支持开发PVM并行程序的成熟开发环境。针对这个问题,构造一个基于PVM的并行程序开发环境。开发环境包括一个并行算法库和一个嵌入到Visual Studio的可视化程序开发插件。通过开发平台进行并行程序开发将更加简单、高效。 相似文献
9.
《Optimization methods & software》2012,27(6):1231-1250
ABSTRACTWe propose a novel algorithm portfolio model that incorporates time series forecasting techniques to predict online the performance of its constituent algorithms. The predictions are used to allocate computational resources to the algorithms, accordingly. The proposed model is demonstrated on parallel algorithm portfolios consisting of three popular metaheuristics, namely tabu search, variable neighbourhood search, and multistart local search. Moving average and exponential smoothing techniques are employed for forecasting purposes. A challenging combinatorial problem, namely the detection of circulant weighing matrices, is selected as the testbed for the analysis of the proposed approach. Experimental evidence and statistical analysis provide insight on the performance of the proposed algorithms and reveal the benefits of using forecasting techniques for resource allocation in algorithm portfolios. 相似文献
10.
In this paper we present deterministic parallel algorithms for the coarse-grained multicomputer (CGM) and bulk synchronous parallel (BSP) models for solving the following well-known graph problems: (1) list ranking, (2) Euler tour construction in a tree,
(3) computing the connected components and spanning forest, (4) lowest common ancestor preprocessing, (5) tree contraction
and expression tree evaluation, (6) computing an ear decomposition or open ear decomposition, and (7) 2-edge connectivity
and biconnectivity (testing and component computation). The algorithms require O(log p) communication rounds with linear sequential work per round (p = no. processors, N = total input size). Each processor creates, during the entire algorithm, messages of total size O(log (p) (N/p)) .
The algorithms assume that the local memory per processor (i.e., N/p ) is larger than p
ε
, for some fixed ε > 0 . Our results imply BSP algorithms with O(log p) supersteps, O(g log (p) (N/p)) communication time, and O(log (p) (N/p)) local computation time.
It is important to observe that the number of communication rounds/ supersteps obtained in this paper is independent of the problem size, and grows only logarithmically with respect to p . With growing problem size, only the sizes of the messages grow but the total number of messages remains unchanged. Due
to the considerable protocol overhead associated with each message transmission, this is an important property. The result
for Problem (1) is a considerable improvement over those previously reported. The algorithms for Problems (2)—(7) are the
first practically relevant parallel algorithms for these standard graph problems.
Received July 5, 2000; revised April 16, 2001. 相似文献
11.
本文首先介绍了计算几何的基本概念,论述了计算几何的四个基本问题,即几何搜索问题、相交问题、邻接问题及凸壳问题。然后重点分析了凸壳构造问题,介绍了其最佳串行算法、及相应的并行算法。接着对一些计算几何的串行及并行算法进行了分析比较。最后提出了笔者对新一代并行计算机系统上设计计算几何并行算法的看法。 相似文献
12.
Paola Festa Mauricio G. C. Resende 《International Transactions in Operational Research》2009,16(1):1-24
A greedy randomized adaptive search procedure (GRASP) is a metaheuristic for combinatorial optimization. It is a multi-start or iterative process, in which each GRASP iteration consists of two phases, a construction phase, in which a feasible solution is produced, and a local search phase, in which a local optimum in the neighborhood of the constructed solution is sought. Since 1989, numerous papers on the basic aspects of GRASP, as well as enhancements to the basic metaheuristic have appeared in the literature. GRASP has been applied to a wide range of combinatorial optimization problems, ranging from scheduling and routing to drawing and turbine balancing. This is the first of two papers with an annotated bibliography of the GRASP literature from 1989 to 2008. This paper covers algorithmic aspects of GRASP. 相似文献
13.
《International Journal of Parallel, Emergent and Distributed Systems》2012,27(1-2):105-109
We consider the special case of the two functions coarsest partitioning problem, where one of the functions is a cyclic permutation and the other arbitrary. Here we present a parallel algorithm on a CRCW PRAM that solves the above partitioning problem in O(α(n)log(β(n))) time using O(n) processors, where n is the set cardinality, β(n) is the number of distinct prime factors of n, and α(n) is the sum of the exponents of the primes in the factorization of n. In almost all cases the algorithm runs in O(loglognlogloglogn) time with n processors. 相似文献
14.
Scheduling is a capital problem when using distributed heterogeneous computing (HC) and grid environments to solve complex problems. The scheduling problem in heterogeneous environments is NP‐hard, so a significant effort has been made to develop efficient methods for solving the problem. However, few works have faced realistic grid‐sized problem instances. This work presents a parallel CHC (pCHC) evolutionary algorithm codified over MALLBA, a general‐purpose library for combinatorial optimization, for solving the scheduling problem in HC and grid environments. Efficient numerical results are reported in the experimental analysis performed on both a standard benchmark and a set of large‐sized problem instances specially designed in this work. The comparative study shows that pCHC is able to achieve high problem solving efficacy, significantly improving over traditional deterministic scheduling methods, while also showing a good scalability behavior when solving large problem instances. 相似文献
15.
16.
采取 CPU 分发图像滤波任务和回收滤波结果、将多个图像数据划分分配给多个 GPU 及其线程块、GPU 调用核函数库对图像进行傅里叶变换和反傅里叶变换的方法,设计实现了 CPU 和 GPU 协同计算的多图像同态滤波并行算法。实验结果表明,给出的多图像同态滤波并行算法高效,与单 GPU 计算的并行算法相比,多 GPU 协同计算的并行算法显著缩短了多个图像同态滤波处理所需的时间。 相似文献
17.
《International Journal of Parallel, Emergent and Distributed Systems》2012,27(4):303-316
In this work, a parallel scheduling algorithm for scheduling a set of n partially ordered tasks on an m-processor parallel computing system is studied. The method is based on a conventional list scheduling, in particular, an earliest-task-first approach. Otherwise, the algorithm developed is an original algorithm. It is designed for a hypercube type system and is tested on a Transputer based environment, The time complexity of the algorithm is O(n(log n + log m)). The parallel scheduling algorithm produces the same schdules as its sequential counterpart whose complexity is O(mn 2(log n + log m)), but in a shorter time. 相似文献
18.
19.
在当前并行计算环境中的通信网络中,MPICH-1并行系统可以使用Internet和Myrinet千兆位包交换网络,而MPICH-2并行系统只能使用Internet,由于通信时间的限制而影响了整个系统性能.对MPICH-1和MPICH-2中的作业提交模式进行了研究,给出了一种在MPICH-2中使用Myrinet网络来提交作业的应用,从而达到减少了通信时间的目的. 相似文献
20.
网络计算环境下并行算法及其可扩放性分析 总被引:4,自引:2,他引:4
并行算法的可扩放性是提其有效利用计算节点的能力,它可以预测算法在处理机数目变化时的性能,在网络环境下用PVM实现了并行矩阵乘法及PSRS算法,分析了在网络计算环境下这两个算法的可扩放性,并利用试验数据进行了验证。 相似文献