期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Heterogeneous computing scheduling with evolutionary algorithms

Sergio Nesmachnow Héctor Cancela Enrique Alba 《Soft Computing - A Fusion of Foundations, Methodologies and Applications》2010,15(4):685-701

This work presents sequential and parallel evolutionary algorithms (EAs) applied to the scheduling problem in heterogeneous computing environments, a NP-hard problem with capital relevance in distributed computing. These methods have been specifically designed to provide accurate and efficient solutions by using simple operators that allow them to be later extended for solving realistic problem instances arising in distributed heterogeneous computing (HC) and grid systems. The EAs were codified over MALLBA, a general-purpose library for combinatorial optimization. Efficient numerical results are reported in the experimental analysis performed on well-known problem instances. The comparative study of scheduling methods shows that the parallel versions of the implemented evolutionary algorithms are able to achieve high problem solving efficacy, outperforming traditional scheduling heuristics and also improving over previous results already reported in the related literature. 相似文献

2.

一种改进的并行挖掘关联规则的算法

佘春东孙世新范植华胡世祥车著明《小型微型计算机系统》2004,25(12):2139-2142

发现关联规则是数据挖掘的一个重要的任务．简要介绍了几种发现关联规则的串行算法和并行算法，并针对IDD和HD这两种效率和可扩展性较好的算法，引入在线LPT调度算法，有效地解决了IDD和HD算法中非常重要的候选项目集在各个处理器节点之间的划分问题，尽可能使得各个节点负载平衡，从而提高算法的效率．相似文献

3.

PARALLEL ALGORITHMS TO COMPUTE THE EIGENVALUES AND EIGENVECTORS OFSYMMETRIC TOEPLITZ MATRICES∗

《International Journal of Parallel, Emergent and Distributed Systems》2012,27(1):75-93

Abstract

In this paper we present two parallel versions of bisection method to compute the spectrum of symmetric Toeplitz matrices. Both parallel algorithms have been implemented and analysed on a virtual shared memory multiprocessor using a portable message-passing environment. The algorithms very efficiently parallelize the sequential method, and the application of a dynamic strategy to distribute the computations produces better results than the use of a static method. We also improve the performance of the original sequential algorithm by applying Newton's method for the final approximation of the eigenvalues. However, the bad results of the sequential algorithm produce low speedups when we compare the parallel methods with the best available sequential algorithm. 相似文献

4.

On the use of diagnostic dependence-analysis tools in parallel programming: Experiences using PTOOL

Leslie A. Henderson Robert E. Hiromoto Olaf M. Lubeck Margaret L. Simmons 《The Journal of supercomputing》1990,4(1):83-96

Although considerable technology has been developed for debugging and developing sequential programs, producing verifiably correct parallel code is a much harder task. In view of the large number of possible scheduling sequences, exhaustive testing is not a feasible method for determining whether a given parallel program is correct; nor have there been sufficient theoretical developments to allow the automatic verification of parallel programs. PTOOL, a tool being developed at Rice University in collaboration with users at Los Alamos National Laboratory, provides an alternative mechanism for producing correct parallel code. PTOOL is a semi-automatic tool for detecting implicit parallelism in sequential Fortran code. It uses vectorizing compiler techniques to identify dependences preventing the parallelization of sequential regions. According to the model supported by PTOOL, a programmer should first implement and test his program using traditional sequential debugging techniques. Then, using PTOOL, he can select loop bodies that can be safely executed in parallel. At Los Alamos, we have been interested in examining the role of dependence-analysis tools in the parallel programming process. Therefore, we have used PTOOL as a static debugging tool to analyze parallel Fortran programs. Our experiences using PTOOL lead us to conclude that dependence-analysis tools are useful to today's parallel programmers. Dependence-analysis is particularly useful in the development of asynchronous parallel code. With a tool like PTOOL, a programmer can guarantee that processor scheduling cannot affect the results of his parallel program. If a programmer wishes to implement a partially parallelized region through the use of synchronization primitives, however, he will find that dependence analysis is less useful. While a dependence-analysis tool can greatly simplify the task of writing synchronization code, the ultimate responsibility of correctness is left to the programmer.This work was performed under the auspices of the U.S. Department of Energy. 相似文献

5.

Parallel and sequential transformations on digital images

Masafumi Yamashita 《Pattern recognition》1985,18(1):31-41

In this paper, the relation between parallel and sequential algorithms is discussed. We regard algorithms as definitions of transformations and investigated the relation between the sets of transformations defined by parallel and sequential algorithms. Three problems are treated mainly. The problems and the results for the problems may be summarized as follows. (1) Characterization of transformations which are both parallel and sequential—A necessary and sufficient condition for a transformation to be both parallel and sequential has been established. (2) Equivalence problems—The equivalence problem for two algorithms, one of which is parallel, is decidable, hence, the equivalence problem for two sequential algorithms is undecidable, i.e. an algorithm for deciding whether or not two given algorithms, one of which is parallel, define the same transformation has been presented. However, we have shown there is no algorithm for deciding whether or not two given sequential algorithms define the same transformation. (3) Translation problems—An algorithm for translating a parallel (sequential) algorithm into an equivalent sequential (parallel) algorithm has been presented. 相似文献

6.

Sequential and parallel cellular automata-based schedulingalgorithms

Seredynski F. Zomaya A.Y. 《Parallel and Distributed Systems, IEEE Transactions on》2002,13(10):1009-1023

We present an approach to designing cellular automata-based multiprocessor scheduling algorithms in which extracting knowledge about the scheduling process occurs. We consider the simplest case when a multiprocessor system is limited to two-processors. To design cellular automata corresponding to a given program graph, we propose a generic definition of program graph neighborhood, transparent to the various kinds, sizes, and shapes of program graphs. The cellular automata-based scheduler works in two modes: learning mode and operation mode. Discovered rules are typically suitable for sequential cellular automata working as a scheduler, while the most interesting and promising feature of cellular automata are their massive parallelism. To overcome difficulties in evolving parallel cellular automata rules, we propose using coevolutionary genetic algorithm. Discovered this way, rules enable us to design effective parallel schedulers. We present a number of experimental results for both sequential and parallel scheduling algorithms discovered in the context of a cellular automata-based scheduling system 相似文献

7.

A parallel algorithm of simulated annealing for multiprocessor scheduling

A. V. Kalashnikov V. A. Kostenko 《Journal of Computer and Systems Sciences International》2008,47(3):455-463

Algorithms of simulated annealing for solving problems of multiprocessor scheduling are considered, an approach to parallelization is proposed, and the results of comparisons between the classical sequential, sequential, and parallel algorithms of simulated annealing using a partition of the initial space of solutions into regions are presented. 相似文献

8.

一个调度Fork-Join任务图的新算法 总被引：16，自引：1，他引：16

刘振英方滨兴姜誉张毅赵宏《软件学报》2002,13(4):693-697

任务调度是影响工作站网络效率的关键因素之一.Fork-Join任务图可以代表很多并行结构,但其他已有调度Fork-Join任务图算法忽略了在非全互连工作站网络环境中通信之间不能并行执行的问题,有些效率高的算法又没有考虑节省处理器个数的问题.因此,专门针对该任务图,综合考虑调度长度、非并行通信和节省处理器个数问题,提出了一个基于任务复制的静态调度算法TSA_FJ.通过随机产生任务的执行时间和通信时间,生成了多个Fork-Join任务图,并且采用TSA_FJ算法和其他调度算法对生成的任务图进行调度.结果表明, 相似文献

9.

静态并行任务调度模拟工具的设计与实现

许海燕吴兰陟何立卢伟《计算机工程》2005,31(6):90-91,122

介绍构建的一个包含12种静态并行任务调度算法的模拟器TSAS和一个可视化的任务图生成器VTGG,并给出以它们为工具得到的各种任务调度算法的性能数据. 相似文献

10.

Block scheduling of iterative algorithms and graph-level priorityscheduling in a simulated data-flow multiprocessor

Evripidou P. Gaudiot J.-L. 《Parallel and Distributed Systems, IEEE Transactions on》1993,4(4):398-413

Iterative methods for solving linear systems are discussed. Although these methods are inherently highly sequential, it is shown that much parallelism could be exploited in a data-flow system by scheduling the iterative part of the algorithms in blocks and by looking ahead across several iterations. This approach is general and will apply to other iterative and loop-based problems. It is also demonstrated by simulation that relying solely on data-driven scheduling of parallel and unrolled loops results in low resource utilization and poor performance. A graph-level priority scheduling mechanism has been developed that greatly improves resource utilization and yields higher performance 相似文献

11.

Irregular Personalized Communication on Distributed Memory Machines

《Journal of Parallel and Distributed Computing》1995,25(1):58-71

In this paper, we present several algorithms for performing all-to-many personalized communication on distributed memory parallel machines. We assume that each processor sends a different message (of potentially different size) to a subset of all the processors involved in the collective communication. The algorithms are based on decomposing the communication matrix into a set of partial permutations. We study the effectiveness of our algorithms from both the view of static scheduling and runtime scheduling. 相似文献

12.

并行任务图的优化调度算法

李于锋莫则尧肖永浩熊敏《计算机工程与科学》2019,41(6):955-962

科学与工程计算中的很多复杂应用问题需要使用科学工作流技术,超算领域中的科学工作流常以并行任务图建模,并行任务图的有效调度对应用的高效执行有重要意义。给出了资源限制条件下并行任务图的调度模型;针对Fork-Join类并行任务图给出了若干最优化调度结论;针对一般并行任务图提出了一种新的调度算法,该算法考虑了数据通信开销对资源分配和调度性能的影响,并对已有的CPA算法在特定情况下进行了改进。通过实验与常用的CPR和CPA算法做比较,验证了提出的新算法能够获得很好的调度效果。本文提出的调度算法和得到的最优调度结论对工作流应用系统的高性能调度功能开发具有借鉴意义。相似文献

13.

Experiences with a parallel algorithm for data flow analysis

Yong-Fong Lee Barbara G. Ryder Thomas J. Marlowe 《The Journal of supercomputing》1991,5(2-3):163-188

We have designed a family of parallel data flow analysis algorithms for execution on distributed-memory MIMD machines, based on general-purpose, hybrid algorithms for data flow analysis [Marlowe and Ryder 1990]. We exploit a natural partitioning of the hybrid algorithms and explore a static mapping, dynamic scheduling strategy. Alternative mapping-scheduling choices and refinements of the flow graph condensation used are discussed. Our parallel hybrid algorithm family is illustrated on Reaching Definitions, although parallel algorithms also exist for many interprocedural (e.g., Aliasing) and intraprocedural (e.g., Available Expressions) problems [Marlowe 1989]. We have implemented the parallel hybrid algorithm for Reaching Definitions on an Intel iPSC/2. Our empirical results suggest the practicality of parallel hybrid algorithms.An earlier version of this paper was presented at Supercomputing '90.The research reported here was supported, in part, by the New Jersey Commission on Science and Technology and the CAIP Center's Industrial Members, by Siemens Research Corporation and by National Science Foundation grant CCR-8920078. 相似文献

14.

优化处理并行数据库查询的并行数据流方法 总被引：1，自引：0，他引：1

李建中《软件学报》1998,9(3):174-180

本文使用并行数据流技术优化和处理并行数据库查询的方法,提出了一整套相关算法,并给出了一个基于并行数据流方法的并行数据库查询优化处理器的完整设计.这些算法和相应的查询优化处理器已经用于作者自行设计的并行数据库管理系统原型.实践证明,并行数据流方法不仅能够快速有效地实现并行数据库管理系统,也能够有效地进行并行数据库查询的优化处理. 相似文献

15.

Efficient parallel hierarchical clustering algorithms 总被引：3，自引：0，他引：3

Rajasekaran S. 《Parallel and Distributed Systems, IEEE Transactions on》2005,16(6):497-502

Clustering of data has numerous applications and has been studied extensively. Though most of the algorithms in the literature are sequential, many parallel algorithms have also been designed. In this paper, we present parallel algorithms with better performance than known algorithms. We consider algorithms that work well in the worst case as well as algorithms with good expected performance. 相似文献

16.

Downlink data transmission scheduling algorithms in wireless networks

Keqin Li 《Simulation Modelling Practice and Theory》2011,19(6):1427-1444

The problem of downlink data transmission scheduling in wireless networks is studied. It is pointed out that every downlink data transmission scheduling algorithm must have two components to solve the two subproblems of power assignment and transmission scheduling. Two types of downlink data transmission scheduling algorithms are proposed. In the first type, power assignment is performed before transmission scheduling. In the second type, power assignment is performed after transmission scheduling. The performance of two algorithms of the first type which use the equal power allocation method are analyzed. It is shown that both algorithms exhibit excellent worst-case performance and asymptotically optimal average-case performance under the condition that the total transmission power is equally allocated to the channels. In general, both algorithms exhibit excellent average-case performance. It is demonstrated that two algorithms of the second type perform better than the two algorithms of the first type due to the equal time power allocation method. Furthermore, the performance of our algorithms are very close to the optimal and the room for further performance improvement is very limited. It is shown that all the above algorithms can be extended to schedule downlink data transmissions with parallel channels. It is also shown that the simple sequential scheduling algorithm is optimal if the total transmission power is equally allocated to the channels. As an extra contribution, an M/G/1 queueing model for the FCFS queueing discipline is established, and it is observed that increasing the number of channels has more impact on the reduction of the average response time than increasing the total transmission power. 相似文献

17.

一种基于动态窗口的虚拟信道通用调度算法

下载免费PDF全文

饶爱水李永刚周锦标汪毅李清梅《计算机测量与控制》2019,27(6):147-152

针对虚拟信道调度算法的通用设计问题,设计了独占式轮转和顺序式轮转两种全同步调度算法、抢占式优先和非抢占式优先两种全异步调度算法、以及独占式混合和顺序式混合两种同步/异步混合调度算法,进一步实现了基于动态窗口的虚拟信道通用调度算法,通用算法采用双层调度模型,通过参数配置可实现8种调度策略。实践表明,通用算法既能满足同步数据固定时隙要求,又能适应异步数据动态调整要求,还能满足应急数据及时发送要求,窗口边界和信道边界可动态调整,减少了信道资源浪费,具有广泛通用性和良好适应性。相似文献

18.

A toolkit for parallel functional programming

Pieter H. Hartel Rutger F. H. Hofman Koen G. Langendoen Henk L. Muller Willem G. Vree L. O. Hertzberger 《Concurrency and Computation》1995,7(8):765-793

Our toolkit for the design and implementation of parallel functional programs supports the stepwise development of parallel programs from a high level sequential specification to an optimised parallel implementation. The toolkit is used as follows:

1 The algorithm to be implemented is specified in a functional language. The program is debugged and tested using an interpreter.
2 The program is compiled for a sequential machine. Its performance is analysed and improved.
3 Annotation-driven transformations are applied to the program to indicate parallel tasks. Simulations at task level, basic block level and bus transaction level make it possible to analyse the parallel performance of the program at three levels of detail.
4 When the performance is optimised using the simulators, the program is executed on a genuine parallel machine.

Several programs have been developed with the toolkit. A program that simulates tidal flow in an estuary of the North sea is presented as a case study to demonstrate the merits of the toolkit when developing complex parallel programs. The toolkit not only supports the design of parallel applications, it also allows the study of important concepts in parallel computer architecture. These include the behaviour of cached memory systems, bus protocols, scheduling algorithms and memory management algorithms. 相似文献

19.

Amit Kumar Bhardwaj Yuvraj Gajpal Chirag Surti Sukhpal Singh Gill 《Software》2020,50(12):2231-2251

Cloud computing is becoming a profitable technology because of it offers cost-effective IT solutions globally. A well-designed task scheduling algorithm ensures the optimal utilization of clouds resources and reducing execution time dynamically. This research article deals with the task scheduling of inter-dependent subtasks on unrelated parallel computing machines in a cloud computing environment. This article considers two variants of the problem-based on two different objective function values. The first variant considers the minimization of the total completion time objective function while the second variant considers the minimization of the makespan objective function. Heuristic and meta-heuristic (HEART) based algorithms are proposed to solve the task scheduling problems. These algorithms utilize the property of list scheduling algorithm of unrelated parallel machine scheduling problem. A mixed integer linear programming (MILP) formulation has been provided for the two variants of the problem. The optimal solution is obtained by solving MILP formulation using A Mathematical Programming Language (AMPL) software. Extensive numerical experiments have been performed to evaluate the performance of proposed algorithms. The solutions obtained by the proposed algorithms are found to out-perform the existing algorithms. The proposed algorithms can be used by cloud computing service providers (CCSPs) for enhancing their resources utilization to reduce their operating cost. 相似文献

20.

Unified parallel encoding and decoding algorithms for Dandelion-like codes

Saverio Caminiti Rossella Petreschi 《Journal of Parallel and Distributed Computing》2010

The Dandelion-like codes are eight bijections between labeled trees and strings of node labels. The literature contains optimal sequential algorithms for these bijections, but no parallel algorithms have been reported. In this paper the first parallel encoding and decoding algorithms for Dandelion-like codes are presented. Namely, a unique encoding algorithm and a unique decoding algorithm, which when properly parameterized, can be used for all Dandelion-like codes, are designed. These algorithms are optimal in the sequential setting. The encoding algorithm implementation on an EREW PRAM is optimal, while the efficient implementation of the decoding algorithm requires concurrent reading. 相似文献