期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

廖祥文刘美华古志民《计算机应用》2003,23(11):79-81

为了在Web机群代理中有效定位缓存对象在节点机中的位置,必须实现单一缓存映像,以隐藏机群的分布和异构特性,为客户提供一致的缓存资源。为解决该问题,文中首先提出一个缓存摘要管理器体系结构,该结构基于各个缓存节点机的摘要、结合负载平衡信息,可有效定位缓存节点机,然后给出了机群代理中缓存节点机与缓存摘要管理器的实现方法。相似文献

2.

单一系统映象在机群管理中的实现 总被引：2，自引：0，他引：2

朱璇郑纬民汪东升杨广文《计算机工程与应用》2002,38(7):86-88

单一系统映象是机群系统的关键技术。文章讨论了机群单一系统映象的含义,提出了一种用于并行计算机群管理的单一系统映象模型,实现了机群单一控制和单一作业管理,解决了机群系统难于管理、管理方式可视化不强等问题。相似文献

3.

机群并行系统的设计--机群并行系统设计中需考虑的因素

章文李代平罗清磊张信一《现代计算机》2003,(2):16-18

机群并行计算机成为并行计算研究的热点。本文介绍了在设计机群并行系统时需要考虑的几点因素。相似文献

4.

曙光机群资源管理的设计与实现 总被引：5，自引：0，他引：5

熊劲孙凝晖《计算机学报》2002,25(12):1357-1363

机群资源管理负责机群系统内计算所需资源的管理,该文介绍机群资源管理的主要功能和重要的研究问题,着重介绍曙光3000机群系统的资源管理的设计与实现,包括分区管理,任务管理和通信管理功能及其对高可用性,可扩展性,动态特性的支持;并给出相应的性能分析结果,此外,还介绍了相关系统的资源管理和对未来该领域研究的方法。相似文献

5.

基于SSI的远程集群管理系统

童端韩忠愿苏杭丽《计算机工程》2008,34(20):34-36

受集群系统结构的固有特性的影响,集群系统的管理问题日益突出。早期集群系统通过命令行方式进行管理,存在功能不完善、结构单一、可用性差、不支持远程管理等缺点。该文分析了集群管理软件的功能需求和相关技术,设计和实现了一套基于SSI的远程集群管理系统。该系统采用标准化模块设计方法,其功能可灵活组态,扩展性较好,并实现比较完整的单一系统映像,可提供简单、高效的管理功能。对系统进行了测试和评价,并提出该系统未来的研究方向。相似文献

6.

计算机机群系统分析

《计算机光盘软件与应用》2013,(2)

计算机系统具有可以同时进行运算或操作的特性,它包括同时性和并发性,计算机系统的机群系统由一组完整的分布在不同地理位置的计算机系统通过高性能的专业网络或局域网系统互连而成的系统,是并行计算机系统或计算机网络系统的一种类型,它作为一个单独的统一计算机资源来使用,既可以做科学运算,也可供商业运用。相似文献

7.

基于PBS的机群任务调度管理中间件的设计与实现 总被引：1，自引：0，他引：1

尚尔凡何川都志辉《小型微型计算机系统》2004,25(6):1048-1051

介绍了基于PBS的机群任务调度管理中间件，是基于现在比较流行的调度系统PBS进行开发和封装，对上层提供了统一的接口．同时在该中间件中还封装了用户管理模块、配置模块和任务调度模块，采用基于XML的通信协议．该中间件具有很强的扩展性，可以扩展应用于LSF等多种流行的机群任务调度系统．最后，在该中间件之上，使用Java开发了前台图形界面，实现了一整套完备的具有实时监控和远程管理能力的机群任务管理系统．相似文献

8.

基于工作站机群的PVM系统的序列比对 总被引：1，自引：0，他引：1

刘寿强潘春华桂兵祥吕国斌墙芳躅《计算机工程》2002,28(5):89-90,96

序列比对是分子生物学研究领域的一个重要的工具。在DNA数据量急剧增加的今天,高效的序列比对算法在研究新发现的次序中显得非常重要。通过Smith和Waterman法用PVM系统在工作站机群上已完成了分布式序列比对法。也同样在Inter iPSC/860高效性能并行计算机上获得了成功。这个分布式Smith-Waterman算法在Internet GRAIL和GENQUEST上充当搜索工具。该文论述了此算法的实现和性能指标。相似文献

9.

基于多级副本分布的机群服务器动态扩展研究

沈海华鞠大鹏沈美明《计算机工程》2006,32(13):100-102

升级和扩展是机群服务器系统需要经常面对的问题。该文提出了基于多级副本分布的机群服务器系统重构和动态扩展模型及结点动态添加算法,采用优化的局部平衡文件迁移策略,有效减少系统动态添加新结点时的文件迁移开销,支持机群系统的动态扩展。相似文献

10.

基于统一接口的机群中交换机监控系统的设计

陈然杜晓黎《计算机工程》2005,31(16):225-227

提出了一种基于统一接口的多种类型交换机的监控方法，并应用于机群当中。该方法符合程序模块化要求，有效地提高了监控系统的可管理性、可扩展性和可用性。相似文献

11.

NAS集群中单一系统映像的关键技术

鲁宏伟李悦《计算机应用研究》2003,20(7):108-109,112

NAS(Network Attached Storage)和SAN(Storage Area Network)是目前数据存储领域的主流技术,但都存在缺陷。介绍了构建NAS集群的设想和动向,并就其单一系统映像的关键技术进行了探讨。相似文献

12.

单一系统映象集群管理系统的研究与实现

白欣宋博左继章向建军《计算机工程与应用》2003,39(29):25-27,137

单一系统映象集群管理系统以传统的GUI图形界面完成与用户的人机交互,有效地屏蔽了集群的系统结构,方便了用户的使用和管理,为实现集群的高性能、高可靠性、高可用性提供了良好的系统软件支持。文章所设计的集群管理系统还提供了丰富的辅助工具,从而极大地简化了系统管理,使系统管理和操作对用户更加透明直观。目前此集群管理系统已应用于某实时测控集群系统。相似文献

13.

On cluster resource allocation for multiple parallel task graphs 总被引：1，自引：0，他引：1

Henri Casanova Frédéric Desprez Frédéric Suter 《Journal of Parallel and Distributed Computing》2010

Many scientific applications can be structured as parallel task graphs (PTGs), that is, graphs of data-parallel tasks. Adding data parallelism to a task-parallel application provides opportunities for higher performance and scalability, but poses additional scheduling challenges. In this paper, we study the off-line scheduling of multiple PTGs on a single, homogeneous cluster. The objective is to optimize performance without compromising fairness among the PTGs. We consider the range of previously proposed scheduling algorithms applicable to this problem, from both the applied and the theoretical literature, and we propose minor improvements when possible. Our main contribution is an extensive evaluation of these algorithms in simulation, using both synthetic and real-world application configurations, using two different metrics for performance and one metric for fairness. We identify a handful of algorithms that provide good trade-offs when considering all these metrics. The best algorithm overall is one that structures the schedule as a sequence of phases of increasing duration based on a makespan guarantee produced by an approximation algorithm. 相似文献

14.

Predictive resource management for meta-applications

N. Floros A. J. G. Hey K. E. Meacham J. Papay M. Surridge 《Future Generation Computer Systems》1999,15(5-6):723-734

This paper defines meta-applications as large, related collections of computational tasks, designed to achieve a specific overall result, running on a (possibly geographically) distributed, non-dedicated meta-computing platform. To carry out such applications in an industrial context, one requires resource management and job scheduling facilities (including capacity planning), to ensure that the application is feasible using the available resources, that each component job will be sent to an appropriate resource, and that everything will finish before the computing resources are needed for other purposes.

This requirement has been addressed by the PAC in three major European collaborative projects: PROMENVIR, TOOLSHED and HPC-VAO, leading to the creation of job scheduling software, in which scheduling is brought together with performance modelling of applications and systems, to provide meta-applications management facilities. This software is described, focusing on the performance modelling approach which was needed to support it.

Early results from this approach are discussed, raising some new issues in performance modelling and software deployment for meta-applications. An indication is given about ongoing work at the PAC designed to overcome current limitations and address these outstanding issues. 相似文献

15.

Modeling a resource contention in the management of virtual organizations

Tien Van Do 《Information Sciences》2010,180(17):3108-3116

A virtual organization provides a cost-efficient method allowing different autonomous entities, such as organizations, departments and individuals, to extend service offerings in a virtual marketplace. To support cost-efficient service provisioning, a suitable procedure must be applied to determine the amount of resources necessary for the operation of virtual organizations.We propose a new mathematical model for a quantitative performance evaluation of resource management in virtual organizations. We present an efficient algorithm to determine the steady state probabilities and the performance measures of the system. A comparison with a detailed simulation model and other numerical approaches shows that the proposed algorithm is fast and accurate. This algorithm can therefore be used for resource dimensioning to support the cost-efficient operation of virtual organizations. 相似文献

16.

Virtualized Web server cluster self-configuration to optimize resource and power use

Andre Felipe Monteiro Marcus Vinicius Azevedo Alexandre Sztajnberg 《Journal of Systems and Software》2013

This work proposes a reusable architecture that enables the self-configuration of a supporting infrastructure for Web server clusters using virtual machines. The goal of the architecture is to ensure service quality, evaluating how broadly it complies with the application's operating restrictions and proportionally acting on the configuration of physical servers (hosts) or virtual machines. In addition, through the rational use of resources, the proposal aims at saving energy. A prototype of the architecture was developed and a performance evaluation carried out with two different resource management approaches. This evaluation shows how fully functional and advantageous the proposal is in terms of using resources, avoiding waste, yet maintaining the application's quality of service within acceptable levels. The architecture also shows to be flexible enough to accept, with a reasonable amount of effort, different resource self-configuration policies. 相似文献

17.

Failure-aware resource management for high-availability computing clusters with distributed virtual machines 总被引：1，自引：0，他引：1

Song Fu 《Journal of Parallel and Distributed Computing》2010

In large-scale networked computing systems, component failures become norms instead of exceptions. Failure-aware resource management is crucial for enhancing system availability and achieving high performance. In this paper, we study how to efficiently utilize system resources for high-availability computing with the support of virtual machine (VM) technology. We design a reconfigurable distributed virtual machine (RDVM) infrastructure for networked computing systems. We propose failure-aware node selection strategies for the construction and reconfiguration of RDVMs. We leverage the proactive failure management techniques in calculating nodes’ reliability states. We consider both the performance and reliability status of compute nodes in making selection decisions. We define a capacity–reliability metric to combine the effects of both factors in node selection, and propose Best-fit algorithms with optimistic and pessimistic selection strategies to find the best qualified nodes on which to instantiate VMs to run user jobs. We have conducted experiments using failure traces from production systems and the NAS Parallel Benchmark programs on a real-world cluster system. The results show the enhancement of system productivity by using the proposed strategies with practically achievable accuracy of failure prediction. With the Best-fit strategies, the job completion rate is increased by 17.6% compared with that achieved in the current LANL HPC cluster. The task completion rate reaches 91.7% with 83.6% utilization of relatively unreliable nodes. 相似文献

18.

Maximizing stochastic robustness of static resource allocations in a periodic sensor driven cluster

《Future Generation Computer Systems》2014

This research investigates the problem of robust static resource allocation for distributed computing systems operating under imposed Quality of Service (QoS) constraints. Often, such systems are expected to function in an environment where uncertainty in system parameters is common. In such an environment, the amount of processing required to complete a task may fluctuate substantially. Determining a resource allocation that accounts for this uncertainty—in a way that can provide a probability that a given level of QoS is achieved—is an important area of research. We have designed novel techniques for maximizing the probability that a given level of QoS is achieved. These techniques feature a unique application of both path relinking and local search within a Genetic Algorithm. In addition, we define a new methodology for finding resource allocations that are guaranteed to have a non-zero probability of addressing the timing constraints of the system. We demonstrate the use of this methodology within two unique steady-state genetic algorithms designed to maximize the robustness of resource allocations. The performance results for our techniques are presented for a simulated environment that models a heterogeneous cluster-based radar data processing center. 相似文献

19.

Grid resource management based on economic mechanisms

Chuliang Weng Minglu Li Xinda Lu 《The Journal of supercomputing》2007,42(2):181-199

Resources in the grid context belong to different control organizations with different interest, therefore the economic interest of each grid participant should be considered. The economic mechanism can guarantee the interest of participants in the grid with fairness and efficiency. In this paper, an economic-based resource management framework is put forward for grid computing, and then how to determine the price of resources with the economic mechanism is studied. A general equilibrium method is presented for general resources and a double auction method is proposed for special resources in the grid environment, respectively. Simulations are performed and experimental results indicate that the two methods are effective for corresponding application scenarios. Expanded version of a paper presented at CCGrid 2005. 相似文献

20.

A resource allocation matrix approach to IT management

Anders Mårtensson 《Information Technology and Management》2006,7(1):21-34

相似文献