期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

High performance computing on networks of workstations through the exploitation of function parallelism

Yung-Lin Liu Hau-Yang Cheng Chung-Ta King 《Journal of Systems Architecture》1999,45(15):1307-1321

Network of workstations (NOW) has become a widely accepted form of high-performance parallel computing. As in conventional multicomputers, parallel programs running on such a platform are often written in an SPMD form to exploit data parallelism. Each workstation in a NOW is treated similarly to a processing element in a multicomputer system. However, workstations are far more powerful and flexible than the processing elements in conventional multicomputers. In this paper, we discuss how workstations in a NOW can be used to exploit more parallelism in an SPMD program, especially those induced from concurrent activities. 相似文献

2.

Easy-to-use object-oriented parallel processing with Mentat

Grimshaw A.S. 《Computer》1993,26(5):39-51

Mentat, an object-oriented parallel processing system designed to directly address the difficulty of developing architecture-independent parallel programs, is discussed. The Mentat system consists of two components: the Mentat programming language and the Mentat runtime system. The Mentat programming language, which is based on C++, is described. Performance results from implementing the Mentat runtime system on a network of Sun 3 and 4 workstations, the Silicon Graphics Iris, the Intel iPSC/2, and the Intel iPSC/860 are presented 相似文献

3.

Coordinating Parallel Processes on Networks of Workstations 总被引：1，自引：0，他引：1

Xing Du Xiaodong Zhang 《Journal of Parallel and Distributed Computing》1997,46(2):186

The network of workstations (NOW) we consider for scheduling is heterogeneous and nondedicated, where computing power varies among the workstations and local and parallel jobs may interact with each other in execution. An effective NOW scheduling scheme needs sufficient information about system heterogeneity and job interactions. We use the measured power weight of each workstation to quantify the differences of computing capability in the system. Without a processing power usage agreement between parallel jobs and local user jobs in a workstation, job interactions are unpredictable, and performance of either type of jobs may not be guaranteed. Using the quantified and deterministic system information, we design a scheduling scheme calledself-coordinated local schedulingon a heterogeneous NOW. Based on a power usage agreement between local and parallel jobs, this scheme coordinates parallel processes independently in each workstation based on the coscheduling principle. We discuss its implementation on Unix System V Release 4 (SVR4). Our simulation results on a heterogeneous NOW show the effectiveness of the self-coordinated local scheduling scheme. 相似文献

4.

Parallel computation with dynamic load distribution for locally distributed Windows NT environments

《Journal of Microcomputer Applications》1995,18(3):193-201

Today 's requirements for computational power are still not satisfied. One answer for this demand is expensive supercomputers. Another attempt is the collection of computational power in a network. Connected workstations operate for parallel computation. Our approach for collecting unused resources in workstation clusters enables dynamic distribution of computational load over the network. Any computer individually distinguishes the amount of server resources sharing for others in the network. Our architecture is based on the client /server model —clients divide potentially hard problems into sub problems and forward them to different servers in the network. The usage of standardized remote procedure calls (rpc) as the basic mechanism for transmission of data between the workstations allows the extension of the concept for heterogeneous environments. This article gives a discussion of our implementation for Windows NT and presents the latest benchmarks with two special parallel applications. 相似文献

5.

Limitations of Cycle Stealing for Parallel Processing on a Network of Homogeneous Workstations

Scott T. Leutenegger Xian-He Sun 《Journal of Parallel and Distributed Computing》1997,43(2):733

The low cost and availability of clusters of workstations have lead researchers to re-explore distributed computing using independent workstations. This approach may provide better cost/performance than tightly coupled multiprocessors. In practice, this approach often utilizes wasted cycles to run parallel jobs. In this paper we address the feasibility and limitation of such a nondedicated parallel processing environment assuming workstation processes have priority over parallel tasks. We develop a simple analytical model to predict parallel job response times. Our model provides insight into how significantly workstation owner interference degrades parallel program performance. It forms a foundation for task partitioning and scheduling in a nondedicated network environment. A new term, task ratio, which relates the parallel task demand to the mean service demand of nonparallel workstation processes, is introduced. We propose that task ratio is a useful metric for determining how a parallel applications should be partitioned and scheduled in order to make efficient use of a nondedicated distributed system. 相似文献

6.

Multiprocessors from a software perspective

Amarasinghe S.P. Anderson J.M. Wilson C.S. Shih-Wei Liao Murphy B.R. French R.S. Lam M.S. Hall M.W. 《Micro, IEEE》1996,16(3):52-61

Like many architectural techniques that originated with mainframes. the use of multiple processors in a single computer is becoming popular in workstations and even personal computers. Multiprocessors constitute a significant percentage of recent workstation sales, and highly affordable multiprocessor personal computers are available in local computer stores. Once again, we find ourselves in a familiar situation: hardware is ahead of software. Because of the complexity of parallel programming, multiprocessors today are rarely used to speed up individual applications. Instead, they usually function as cycle-servers that achieve increased system throughput by running multiple tasks simultaneously. Automatic parallelization by a compiler is a particularly attractive approach to software development for multiprocessors, as it enables ordinary sequential programs to take advantage of the multiprocessor hardware without user involvement. This article looks to the future by examining some of the latest research results in automatic parallelization technology 相似文献

7.

面向网格的可视化系统研究 总被引：8，自引：0，他引：8

石教英赵友兵仇应俊陈为《计算机研究与发展》2004,41(12):2231-2236

科学计算可视化是数据和计算密集型应用，传统上常使用并行方法处理．随着计算机硬件性能的提高和网络技术的进步，主流并行可视化平台从高端并行机逐渐转变为PC集群和联网PC并始面向互联网获取资源和提供服务，格作为面向互联网的分布式计算方式，它的出现拓展了可视化应用的资源利用和服务提供范围，促生了可视化领域的一个新研究点：面向网格的可视化．介绍了面向网格的可视化的研究内容、研究现状以及一个基于Java的面向网格可视化系统原型GVis．相似文献

8.

Adaptive and Reliable Paging to Remote Main Memory

George Dramitinos Evangelos P. Markatos 《Journal of Parallel and Distributed Computing》1999,58(3):505

Workstation clusters provide significant aggregate amounts of resources, including processing power and main memory. In this paper we explore the collective use of main memory in a workstation cluster to boost the performance of applications that require more memory than a single workstation can provide. We describe the design, simulation, implementation, and evaluation of a pager that uses main memory of remote workstations in a workstation cluster as a faster-than-disk paging device and provides reliability in case of single workstation failures and adaptivity in network and disk load variations. Our pager has been implemented as a block device driver linked to the Digital UNIX operating system, without any modifications to the kernel code. Using several test applications we measure the performance of remote memory paging over an Ethernet interconnection network and find it to be up to twice as fast as traditional disk paging. We also evaluate the performance of various reliability policies and demonstrate their feasibility even over low bandwith networks such as Ethernet. We conclude that the benefits of reliable remote memory paging in workstation clusters are significant today and are likely to increase in the near future. 相似文献

9.

Experiences with networked parallel computing

Peter Hoogerbrugge Ravi Mirchandaney 《Concurrency and Computation》1995,7(1):1-16

The performance and proliferation of workstations continues to increase at a rapid rate. However, the practical utilization of workstation networks for parallel computing is still in its infancy. This is due to the relative immaturity of programming tools, low bandwidth networks such as Ethernet, and high message latencies. However, programming tools are becoming more mature and network bandwidths are increasing rapidly. Hence, networks of commodity workstations may prove to be practical for certain classes of parallel applications. This paper describes our experiences with two applications parallelized on a network of Sun workstations. The first application is from Shell's petroleum engineering department. This program quantitatively derives rock and porefill composition from well-log data, using a compute-intensive iterative optimization procedure. The second application is time filtering, which is a fundamental operation performed on seismic traces. Through our experiments we identify the limits of networked parallel computing based on the current state of network technology. We also provide a discussion on the possible impact of future high speed networks on networked parallel computing. 相似文献

10.

Spawn: a distributed computational economy

Waldspurger C.A. Hogg T. Huberman B.A. Kephart J.O. Stornetta W.S. 《IEEE transactions on pattern analysis and machine intelligence》1992,18(2):103-117

The authors have designed and implemented an open, market-based computational system called Spawn. The Spawn system utilizes idle computational resources in a distributed network of heterogeneous computer workstations. It supports both coarse-grain concurrent applications and the remote execution of many independent tasks. Using concurrent Monte Carlo simulations as prototypical applications, the authors explore issues of fairness in resource distribution, currency as a form of priority, price equilibria, the dynamics of transients, and scaling to large systems. In addition to serving the practical goal of harnessing idle processor time in a computer network, Spawn has proven to be a valuable experimental workbench for studying computational markets and their dynamics 相似文献

11.

Parallel processing of chemical information in a local area network — I. HYDRA: Concept,configuration, and implementation of parallel applications

《Computers & chemistry》1996,20(4):431-438

Sophisticated software packages put an increasing demand on computer hardware. In local area networks, computational intensive programs can lower the performance of individual workstations to an unacceptable level. However, utilizing in a coarse grained sense the computing power of all hosts in such networks, offers the potential to achieve considerable improvements in execution speed within reasonable cost limits. Since conventional workstations are not designed to be used in a parallel configuration, the program HYDRA is developed to control and synchronize parallel processing in a local area network. Part I of this paper focuses on the technical aspects of HYDRA, i.e. configuration and implementation. The second and third parts describe two applications of the HYDRA package in the field of chemistry: using parallel genetic algorithms for the conformational analysis of nucleic acids, and parallel cross-validation of artificial neural networks. 相似文献

12.

Analysis of Load Average and its Relationship to Program Run Time on Networks of Workstations

Trevor E. Meyer James A. Davis Jennifer L. Davidson 《Journal of Parallel and Distributed Computing》1997,44(2):141

Parallel processing systems using networks of workstations are being used to provide an alternative to expensive parallel processors. Scheduling of tasks on these networks is an important and practical problem that must be addressed. Although CPU load is an important parameter to many of the proposed scheduling schemes, no quantitative analysis of CPU load and its precise relation to the run time of application programs has to date been presented. The work in this paper describes the experimental analysis of one common load measure, the UNIX load average, and its relationship to the run time of computation-bound parallel programs. Data was gathered using a test application program designed to mimic common applications, performing long bursts of computation with occasional interprocess data exchange over the network. The resulting execution times and measured load averages were then analyzed using regression analysis to detect load-run time trends. This paper describes the test program and the experiments, then details the results of the data analysis. A technique is then presented for the evaluation of the load-run time relationship for a computation-bound program on a network of workstations. 相似文献

13.

Parallel Computing on an Ethernet Cluster of Workstations: Opportunities and Constraints 总被引：1，自引：0，他引：1

Hamdi Mounir Pan Yi Hamidzadeh B. Lim F. M. 《The Journal of supercomputing》1999,13(2):111-132

Parallel computing on clusters of workstations is receiving much attention from the research community. Unfortunately, many aspects of parallel computing over this parallel computing engine is not very well understood. Some of these issues include the workstation architectures, the network protocols, the communication-to-computation ratio, the load balancing strategies, and the data partitioning schemes. The aim of this paper is to assess the strengths and limitations of a cluster of workstations by capturing the effects of the above issues. This has been achieved by evaluating the performance of this computing environment in the execution of a parallel ray tracing application through analytical modeling and extensive experimentation. We were successful in illustrating the effect of major factors on the performance and scalability of a cluster of workstations connected by an Ethernet network. Moreover, our analytical model was accurate enough to agree closely with the experimental results. Thus, we feel that such an investigation would be helpful in understanding the strengths and weaknesses of an Ethernet cluster of workstation in the execution of parallel applications. 相似文献

14.

Simple authentication

C. R. Snow H. Whitfield 《Software》1994,24(5):437-447

This paper describes a simple authentication protocol for use with terminal emulation programs running on small workstations, and communicating with a host computer over a (potentially) insecure network. The technique has the advantage that it requires no more network traffic than would be necessary using a standard user-id/password challenge, and also requires only a small amount of additional intelligence at the workstation. Some management issues are also discussed. The system has been implemented on two host systems and two workstation types. 相似文献

15.

一种同构机群系统中的处理机分配算法 总被引：5，自引：0，他引：5

温钰洪王鼎兴沈美明《软件学报》1997,8(3):161-169

机群系统的分布式计算环境为并行处理技术带来了新的研究与应用问题，正成为并行计算的热点问题．如何合理、有效地将并行任务划分到机群系统的结点上，将直接影响系统的执行性能．本文分析影响系统执行效率的执行开销因素，同时提出一个启发式的处理机分配算法. 相似文献

16.

Optimal scheduling of compute-intensive tasks on a network ofworkstations

Efe K. Krishnamoorthy V. 《Parallel and Distributed Systems, IEEE Transactions on》1995,6(6):668-673

In a network of high performance workstations, many workstations are underutilized by their owners. The problem of using these idle cycles for solving computationally intensive tasks by executing a large task on many workstations has been addressed before and algorithms with O(N²) time and O(N) space for choosing the optimal subset of workstations out of N workstations were presented. We improve these algorithms to reduce the running time to O(N log N), while keeping the space requirement the same. The proposed algorithms are particularly useful for SPMD parallelism where computation is the same for all workstations and the data space is partitioned between the workstations 相似文献

17.

适合机群OpenMP系统的制导扩展 总被引：1，自引：0，他引：1

章隆兵吴少刚蔡飞胡伟武《计算机学报》2004,27(8):1129-1136

OpenMP以其易用性和支持增量并行的特点成为共享存储体系结构的编程标准．机群OpenMP系统在机群上实现了OpenMP计算环境，它将OpenMP的易编程性和机群的可扩展性结合起来，是很有意义的．OpenMP的编程方式主要有循环级和SPMD两种，其中循环级方式易于编程而SPMD方式难于编程．然而在机群OpenMP系统中获得高性能OpenMP程序，必需采用SPMD方式．该文描述了适合机群OpenMP系统的一个简单的OpenMP制导扩展子集(包括数据分布制导、循环调度模式)，并在机群OpenMP系统OpenMP／JIAJIA上进行了实现．应用测试表明，利用这些制导扩展进行编程，既保持循环级方式的易编程性又获得与SPMD方式相当的性能，是有效的编程方式．相似文献

18.

分布异构工作站上的任务调度算法

邬延辉陆鑫达曾志勇《小型微型计算机系统》2004,25(4):733-737

讨论了在一个由高速局域网连接的高性能异构工作站平台上，如何有效地利用空闲工作站来求解计算密集型任务矩阵相乘的问题，为了获得较好的并行计算性能，文中给出了一个异构工作站群之间任务调度的模型和算法，算法中考虑了并行计算中协作任务间的通信时间、数据加栽时间、结果收集时间和各个异构工作站的任务计算时间，通过这个模型，可以在所有可利用的工作站集合中找出最适合的子集，获得最短的执行时间．相似文献

19.

Parallel computing optimization in the Apollo domain network

Pekergin M.F. 《IEEE transactions on pattern analysis and machine intelligence》1992,18(4):296-303

The performance of parallel computing in a network of Apollo workstations where the processes use the remote procedure call (RPC) mechanism for communication is addressed. The speedup in such systems cannot be accurately estimated without taking into account the relatively large communication overheads. Moreover, it decreases by increasing parallelism when the latter exceeds some certain limit. To estimate the speedup and determine the optimum degree of parallelism, the author characterizes the parallelization and the communication overheads in the system considered. Then, parallel applications are modeled and their execution times are expressed for the general case of nonidentical tasks and workstations. The general case study allows the structural constraints of the applications to be taken into account by permitting their partitioning into heterogeneous tasks. A simple expression of the optimum degree of parallelism is obtained for identical tasks where the inherent constraints are neglected. The fact that the theoretical maximum speedup is bounded by half of the optimum degree of parallelism shows the importance of this measure 相似文献

20.

Middleware infrastructure for parallel and distributed programming models in heterogeneous systems

Al-Jaroodi J. Mohamed N. Jiang H. Swanson D. 《Parallel and Distributed Systems, IEEE Transactions on》2003,14(11):1100-1111

We introduce a middleware infrastructure that provides software services for developing and deploying high-performance parallel programming models and distributed applications on clusters and networked heterogeneous systems. This middleware infrastructure utilizes distributed agents residing on the participating machines and communicating with one another to perform the required functions. An intensive study of the parallel programming models in Java has helped identify the common requirements for a runtime support environment, which we used to define the middleware functionality. A Java-based prototype, based on this architecture, has been developed along with a Java object-passing interface (JOPI) class library. Since this system is written completely in Java, it is portable and allows executing programs in parallel across multiple heterogeneous platforms. With the middleware infrastructure, users need not deal with the mechanisms of deploying and loading user classes on the heterogeneous system. Moreover, details of scheduling, controlling, monitoring, and executing user jobs are hidden, while the management of system resources is made transparent to the user. Such uniform services are essential for facilitating the development and deployment of scalable high-performance Java applications on clusters and heterogeneous systems. An initial deployment of a parallel Java programming model over a heterogeneous, distributed system shows good performance results. In addition, a framework for the agents' startup mechanism and organization is introduced to provide scalable deployment and communication among the agents. 相似文献