期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Scheduling precedence constrained task graphs with non-negligibleintertask communication onto multiprocessors

Selvakumar S. Siva Ram Murthy C. 《Parallel and Distributed Systems, IEEE Transactions on》1994,5(3):328-336

The multiprocessor scheduling problem is the problem of scheduling the tasks of a precedence constrained task graph (representing a parallel program) onto the processors of a multiprocessor in a way that minimizes the completion time. Since this problem is known to be NP-hard in the strong sense in all but a few very restricted eases, heuristic algorithms are being developed which obtain near optimal schedules in a reasonable amount of computation time. We present an efficient heuristic algorithm for scheduling precedence constrained task graphs with nonnegligible intertask communication onto multiprocessors taking contention in the communication channels into consideration. Our algorithm for obtaining satisfactory suboptimal schedules is based on the classical list scheduling strategy. It simultaneously exploits the schedule-holes generated in the processors and in the communication channels during the scheduling process in order to produce better schedules. We demonstrate the effectiveness of our algorithm by comparing with two competing heuristic algorithms available in the literature 相似文献

2.

基于通信竞争的Fork-Join任务图的调度算法

张建军杨峰瞿勇《计算机工程与设计》2009,30(23)

Fork-Join任务图是一种并行处理的基本结构,目前已有的Fork-Join任务图的调度算法大多没有考虑实际应用中通信链路的竞争及延迟以及节省处理机的问题,导致算法在具体应用中效率较低.因此,针对Fork-Join任务图,提出一个基于通信竞争的贪心调度算法,该算法具有高的加速比和总体效率,时间复杂度为O(vlogv),其中v表示任务集中任务的个数.实验结果表明,该算法相比其它算法具有较短的调度长度、较短的完成时间,使用的处理机数较少,具有更强的实用性. 相似文献

3.

A self-stabilizing algorithm for optimally efficient sets in graphs

Sandra M. Hedetniemi Stephen T. Hedetniemi Hao Jiang K.E. Kennedy Alice A. McRae 《Information Processing Letters》2012,112(16):621-623

相似文献

4.

Scheduling multiple task graphs with end-to-end deadlines in distributed real-time systems utilizing imprecise computations

Georgios L. Stavrinides^{Author Vitae} Helen D. Karatza Author Vitae 《Journal of Systems and Software》2010,83(6):1004-1014

In order to meet the inherent need of real-time applications for high quality results within strict timing constraints, the employment of effective scheduling techniques is crucial in distributed real-time systems. In this paper, we evaluate by simulation the performance of strategies for the dynamic scheduling of composite jobs in a homogeneous distributed real-time system. Each job that arrives in the system is a directed acyclic graph of component tasks and has an end-to-end deadline. For each scheduling policy, we provide an alternative version which allows imprecise computations, taking into account the effects of input error on the processing time of the component tasks of a job. The simulation results show that the alternative versions of the algorithms outperform their respective counterparts. To our knowledge, an imprecise computations approach for the dynamic scheduling of multiple task graphs with end-to-end deadlines and input error has never been discussed in the literature before. 相似文献

5.

Scheduling multiple task graphs in heterogeneous distributed real-time systems by exploiting schedule holes with bin packing techniques

Georgios L. Stavrinides Helen D. Karatza 《Simulation Modelling Practice and Theory》2011,19(1):540-552

The most crucial aspect of distributed real-time systems is the scheduling algorithm, which must guarantee that every job in the system will meet its deadline. In this paper, we evaluate by simulation the performance of strategies for the dynamic scheduling of composite jobs in a heterogeneous distributed real-time system. Each job that arrives in the system is a directed acyclic graph of component tasks and has an end-to-end deadline. For each scheduling policy, we provide alternative versions which allow the insertion of tasks into idle time slots, using various bin packing techniques. The comparison study, based on different workloads and system heterogeneity levels, shows that the alternative versions of the algorithms outperform their respective counterparts. 相似文献

6.

Searching graphs with A*: applications to job sequencing

Sen A.K. Bagchi A. Ramaswamy R. 《IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society》1996,26(1):168-173

The best-first search algorithm A* allows search graphs that are trees, directed acyclic graphs or directed graphs with cycles. In real life applications of A* the search graph is generally implemented as a tree. It is shown here that for certain well known one-machine job sequencing problems that arise in job shops, graph search is much faster than best-first tree search when problem instances are of small and medium size. Moreover, graph search uses less memory and so is able to solve larger problems. Depth-first search needs little memory, and is therefore capable in principle of solving problems of arbitrary size, but is slower than graph search by orders of magnitude for the examples that were studied 相似文献

7.

Scheduling of pipelined operator graphs

Hans L. Bodlaender Petra Schuurman Gerhard J. Woeginger 《Journal of Scheduling》2012,15(3):323-332

We investigate a class of scheduling problems that arise in the optimization of SQL queries for parallel machines (Chekuri et al. in PODS??95, pp.?255?C265, 1995). In these problems, an undirected graph is used to represent communication and inter-operator parallelism. The goal is to minimize the global response time of the system. We provide a polynomial time approximation scheme for the special cases where the operator graph is a tree, thereby improving on a polynomial time 2.87-approximation algorithm by Chekuri et al. The approximation scheme is generalized to the case where the operator graph has treewidth bounded by a constant. We analyze instances with small response times for general operator graphs: Deciding whether a response time of four time units can be reached is easy, but deciding whether a response time of six time units can be reached is NP-hard. Finally, we prove that for general operator graphs the existence of a polynomial time approximation algorithm with worst case performance guarantee better than 4/3 would imply P=NP. 相似文献

8.

Scheduling two agent task chains with a central selection mechanism

Alessandro Agnetis Gaia Nicosia Andrea Pacifici Ulrich Pferschy 《Journal of Scheduling》2015,18(3):243-261

相似文献

9.

Scheduling data-flow graphs via retiming and unfolding

Liang-Fang Chao Hsing-Mean Sha E. 《Parallel and Distributed Systems, IEEE Transactions on》1997,8(12):1259-1267

Loop scheduling is an important problem in parallel processing. The retiming technique reorganizes an iteration; the unfolding technique schedules several iterations together. We combine these two techniques to obtain a static schedule with a reduced average computation time per iteration. We first prove that the order of retiming and unfolding is immaterial for scheduling a data-flow graph (DFG). From this nice property, we present a polynomial-time algorithm on the original DFG, before unfolding, to find the minimum-rate static schedule for a given unfolding factor. For the case of a unit-time DFG, efficient checking and retiming algorithms are presented 相似文献

10.

调度Fork-Join任务图的贪心算法 总被引：1，自引：2，他引：1

YANG Bin 张建军 YANG Feng 《计算机工程与设计》2008,29(15)

任务调度算法的目标是把组成并行程序的一组任务分配到多个处理器以使得程序的完成时间最短,这是一个NP完全问题.虽然许多算法在任务满足某些条件时能产生最优调度,但大多都忽略了节省处理器个数和最小化程序总的完成时间等问题.Fork-Join结构是一种并行处理的基本结构.因此,专门针对Fork-Join任务图,提出了一个能产生最优调度的新的贪心调度算法,该算法具有高的加速比和总体效率,时间复杂度为O(v2),其中,v表示任务集中任务的个数.实验结果表明,相比其它算法,该算法具有较短的调度长度、较短的完成时间,使用的处理器数较少. 相似文献

11.

A mutual exclusion algorithm with optimally bounded bypasses

K. Alagarsamy 《Information Processing Letters》2005,96(1):36-40

Peterson's algorithm [G.L. Peterson, Myths about the mutual exclusion problem, Inform. Process. Lett. 12 (3) (1981) 115-116] for mutual exclusion has been widely studied for its elegance and simplicity. In Peterson's algorithm, each process has to cross n−1 stages to access the shared resource irrespective of the contention for the shared resource at that time, and allows unbounded bypasses. In [K. Block, T.-K. Woo, A more efficient generalization of Peterson's mutual exclusion algorithm, Inform. Process. Lett. 35 (1990) 219-222], Block and Woo proposed a modified algorithm that transforms the number stages to be crossed from fixed n−1 to t, where 1?t?n, and bounds the number of possible bypasses by n(n−1)/2. This paper proposes a simple modification that reduces the bound on the number of possible bypasses to optimal n−1. 相似文献

12.

Allocating task interaction graphs to processors in heterogeneousnetworks

Chi-Chung Hui Chanson S.T. 《Parallel and Distributed Systems, IEEE Transactions on》1997,8(9):908-925

The problem of allocating task interaction graphs (TIGs) to heterogeneous computing systems to minimize job completion time is investigated. The only restriction is that the interprocessor communication cost is the same for any pair of processors. This is suitable for local area network based systems, such as Ethernet, as well as fully interconnected multiprocessor systems. An optimal polynomial solution exists if sufficient homogeneous processors and communication capacity are available. This solution is generalized to obtain two faster heuristics, one for the case of homogeneous processors and the other for heterogeneous processors. The heuristics were tested extensively with 60,900 systematically generated random TIGs and shown to be stable independent of the size of the TIG. A performance model is also proposed to predict the performance of the heuristic algorithms, and it is successful in explaining the experimental results qualitatively 相似文献

13.

基于QoS的任务分类调度算法

徐洪智张彬连覃遵跃《计算机应用》2008,28(Z2)

分析了QoS Guided Min-Min及其改进算法,指出了这些算法调度不均衡的缺点.在此基础上设计出基于QoS的任务分类调度算法.先对特殊任务使用Min-Min方法进行调度,然后采用贪心策略对一般任务进行调度,转移或交换最大负载和最小负载机器上的一般任务,使各机器上的任务快速达到均衡.实验表明该算法在保证应用QoS的前提下,相对于以往算法负载分布更加均衡,任务总的完成时间更短. 相似文献

14.

Scheduling DAG-based applications in multicluster environments with background workload using task duplication

《国际计算机数学杂志》2012,89(11):2387-2397

Grids or multicluster computing environments are becoming increasingly popular to both scientific and commercial applications. Process scheduling remains a central issue to be effectively resolved in order to exploit the full potential that the grid or multicluster environment can offer. We use a directed acyclic graph (DAG) to model a process or an application where the nodes of the DAG represent the tasks of the process. Prior to the execution of a process in a multicluster environment, the tasks are required to be mapped onto the clusters. In this article, it is shown that the algorithm developed by He et al. [L. He, S.A. Jarvis, D.P. Spooner, D. Bacigalupo, G. Tan, and G.R. Nudd, Mapping DAG-based applications to multiclusters with background workload, Proceedings of the 2005 IEEE International Symposium on Cluster Computing and the Grid, Cardiff, 2005, pp 855–862.] for the multicluster DAG mapping problem can be significantly improved by incorporating the task duplication strategy. The proposed process scheduling algorithm has a time complexity O(| V|²(r+d+1)), where |V| represents the number of tasks; r, the number of clusters; and d, the maximum in-degree of tasks. 相似文献

15.

On cluster resource allocation for multiple parallel task graphs 总被引：1，自引：0，他引：1

Henri Casanova Frédéric Desprez Frédéric Suter 《Journal of Parallel and Distributed Computing》2010

Many scientific applications can be structured as parallel task graphs (PTGs), that is, graphs of data-parallel tasks. Adding data parallelism to a task-parallel application provides opportunities for higher performance and scalability, but poses additional scheduling challenges. In this paper, we study the off-line scheduling of multiple PTGs on a single, homogeneous cluster. The objective is to optimize performance without compromising fairness among the PTGs. We consider the range of previously proposed scheduling algorithms applicable to this problem, from both the applied and the theoretical literature, and we propose minor improvements when possible. Our main contribution is an extensive evaluation of these algorithms in simulation, using both synthetic and real-world application configurations, using two different metrics for performance and one metric for fairness. We identify a handful of algorithms that provide good trade-offs when considering all these metrics. The best algorithm overall is one that structures the schedule as a sequence of phases of increasing duration based on a makespan guarantee produced by an approximation algorithm. 相似文献

16.

Generating synthetic task graphs for simulating stream computing systems

Deepak Ajwani Shoukat Ali Kostas Katrinis Cheng-Hong Li Alfred J. Park John P. Morrison Eugen Schenfeld 《Journal of Parallel and Distributed Computing》2013

Stream-computing is an emerging computational model for performing complex operations on and across multi-source, high-volume data flows. The pool of mature publicly available applications employing this model is fairly small, and therefore the availability of workloads for various types of applications is scarce. Thus, there is a need for synthetic generation of large-scale workloads to drive simulations and estimate the performance of stream-computing applications at scale. We identify the key properties shared by most task graphs of stream-computing applications and use them to extend known random graph generation concepts with stream computing specific features, providing researchers with realistic input stream graphs. Our graph generation techniques serve the purpose of covering a disparity of potential applications and user input. Our first “domain-specific” framework exhibits high user-controlled configurability while the second “application-agnostic” framework focuses solely on emulating the key properties of general stream-computing systems, at the loss of domain-specific fine-tuning. 相似文献

17.

Scheduling of computational processes in real-time distributed systems with uncertain task execution times

N. V. Kolesov M. V. Tolmacheva P. V. Yukhta 《Journal of Computer and Systems Sciences International》2012,51(5):636-647

An approach to scheduling computational processes in real-time distributed computing systems is considered. It is assumed that the task execution time is inexactly; more precisely, it is assumed to belog to a certain time interval. The problem is formulated as the scheduling of jobs of which each is characterized by its priority and consists of a set of tasks (with respect to the number of processors) executing on different processors and associated by a hierarchical precedence relationship. The proposed approach is based on algorithms with low computational complexity for suboptimal scheduling of equal-priority tasks. 相似文献

18.

VxWorks下基于多任务调度的分析和研究 总被引：3，自引：0，他引：3

李慧李涛杨占华万玮《微机发展》2005,15(6):30-32

VxWorks操作系统是一个功能强大、而且独立于处理器的实时操作系统,它具有真正微内核的相当小的层次结构。稳定、可靠、高性能的内核以及友好的用户开发环境等优点使得VxWorks被广泛应用于高精尖以及实时性要求极高的领域。文中在分析VxWorks内核的多任务调度以及相应的调度算法的基础上,提出了基于优先级的多任务资源共享问题的解决方案,并介绍了一个解决优先级倒置的方案实例。相似文献

19.

VxWorks下基于多任务调度的分析和研究

李慧李涛杨占华万玮《计算机技术与发展》2005,15(6)

VxWorks操作系统是一个功能强大、而且独立于处理器的实时操作系统,它具有真正微内核的相当小的层次结构.稳定、可靠、高性能的内核以及友好的用户开发环境等优点使得VxWorks被广泛应用于高精尖以及实时性要求极高的领域.文中在分析VxWorks内核的多任务调度以及相应的调度算法的基础上,提出了基于优先级的多任务资源共享问题的解决方案,并介绍了一个解决优先级倒置的方案实例. 相似文献

20.

Cello: A Disk Scheduling Framework for Next Generation Operating Systems*

Shenoy Prashant Vin Harrick M. 《Real-Time Systems》2002,22(1-2):9-48

In this paper, we present the Cello disk scheduling framework for meeting the diverse service requirements of applications. Cello employs a two-level disk scheduling architecture, consisting of a class-independent scheduler and a set of class-specific schedulers. The two levels of the framework allocate disk bandwidth at two time-scales: the class-independent scheduler governs the coarse-grain allocation of bandwidth to application classes, while the class-specific schedulers control the fine-grain interleaving of requests. The two levels of the architecture separate application-independent mechanisms from application-specific scheduling policies, and thereby facilitate the co-existence of multiple class-specific schedulers. We demonstrate that Cello is suitable for next generation operating systems since: (i) it aligns the service provided with the application requirements, (ii) it protects application classes from one another, (iii) it is work-conserving and can adapt to changes in work-load, (iv) it minimizes the seek time and rotational latency overhead incurred during access, and (v) it is computationally efficient. 相似文献