期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Power-aware optimization for heterogeneous multi-tier clusters

Peijian Wang Yong Qi Xue Liu 《Journal of Parallel and Distributed Computing》2014

Complex web applications are usually served by multi-tier web clusters. With the growing cost of energy, the importance of reducing power consumption in server systems is now well-known and has become a major research topic. However, most existing research focused solely on homogeneous clusters. This paper addresses the challenge of power management in Heterogeneous Multi-tier Web Clusters. We apply Generalized Benders Decomposition (GBD) to decompose the global optimization problem into small sub-problems. This algorithm achieves the optimal solution in an iterative fashion. The evaluation results show that our algorithm achieves more energy conservation than the previous work. 相似文献

2.

非均匀分布颗粒群中的曳力分布

张云葛蔚王小伟赵辉杨朝合李静海《计算机与应用化学》2011,28(1)

本文采用格子Boltzmann方法(LBM)在图形处理器(GPU)上计算了由静止圆柱阵列组成的团聚物周期单元内的不可压缩流体流动,流固交界面处采用直接反弹以实现无滑移边界,每个圆柱上的曳力通过统计动量交换直接求得。根据LBM求得的流体速度,对于团聚物中的单圆柱按能量最小多尺度(EMMS)模型计算平均曳力系数,并考察了将聚团近似为均匀悬浮的临界条件。对颗粒雷诺数Re_p在0～10之间的80种固相份额的模拟结果表明,密相空隙率可以表征这种临界条件。当固相份额恒定时,该临界空隙率随着Re_p的增加而降低;当Re_p恒定时,该临界空隙率随着固相份额的增加而降低。相似文献

3.

Efficient routing techniques in heterogeneous 3D Networks-on-Chip

Michael Opoku Agyeman Ali AhmadiniaAlireza Shahrabi 《Parallel Computing》2013

Three-dimensional Networks-on-Chips (3D NoCs) have recently been proposed to address the on-chip communication demands of future highly dense 3D multi-core systems. Homogeneous 3D NoC topologies have many Through Silicon Vias (TSVs) which have a costly and complex manufacturing process. Also, 3D routers use more memory and are more power hungry than conventional 2D routers. Alternatively, heterogeneous 3D NoCs combine both the area and performance benefits of 2D and 3D static router architectures by using a limited number of TSVs. To improve the performance of heterogeneous 3D NoCs, we propose an adaptive router architecture which balances the traffic in such NoCs. Particularly, experimental results show that our proposed architecture significantly improves the performance up to 75% by replacing 2D static routers with adaptive 2D routers in heterogeneous 3D NoCs, while keeping the maximum clock frequency, power and energy consumption of the adaptive router nearly at the same level as the static router. 相似文献

4.

Power optimization for dynamic configuration in heterogeneous web server clusters

Luciano Bertini Author Vitae Julius C.B. Leite Author Vitae Author Vitae 《Journal of Systems and Software》2010,83(4):585-598

To reduce the environmental impact, it is essential to make data centers green, by turning off servers and tuning their speeds for the instantaneous load offered, that is, determining the dynamic configuration in web server clusters. We model the problem of selecting the servers that will be on and finding their speeds through mixed integer programming; we also show how to combine such solutions with control theory. For proof of concept, we implemented this dynamic configuration scheme in a web server cluster running Linux, with soft real-time requirements and QoS control, in order to guarantee both energy-efficiency and good user experience. In this paper, we show the performance of our scheme compared to other schemes, a comparison of a centralized and a distributed approach for QoS control, and a comparison of schemes for choosing speeds of servers. 相似文献

5.

Performance and energy task migration model for heterogeneous clusters

Stafford Esteban Bosque José Luis 《The Journal of supercomputing》2021,77(9):10053-10064

The Journal of Supercomputing - This article presents a set of linear regression models to predict the impact of task migration on different objectives, like performance and energy consumption. It... 相似文献

6.

Dynamic partitioning of loop iterations on heterogeneous PC clusters 总被引：1，自引：1，他引：0

Chao-Tung Yang Wen-Chung Shih Shian-Shyong Tseng 《The Journal of supercomputing》2008,44(1):1-23

Loop partitioning on parallel and distributed systems has been a critical problem. Furthermore, it becomes more difficult to deal with on the emerging heterogeneous PC cluster environments. In the past, some loop self-scheduling schemes have been proposed to be applicable to heterogeneous cluster environments. In this paper, we propose a performance-based approach, which partitions loop iterations according to the performance ratio of cluster nodes. To verify the proposed approach, a heterogeneous cluster is built, and three types of application programs are implemented to be executed in this testbed. Experimental results show that the proposed approach performs better than traditional schemes.

Shian-Shyong TsengEmail: Email:

相似文献

7.

面向多样应用和异构集群的约束调度机制

《计算机应用研究》2015,(10)

相似文献

8.

BroadScale: Efficient scaling of heterogeneous storage systems

Shu-Yuen D. Yao Cyrus Shahabi Roger Zimmermann 《International Journal on Digital Libraries》2006,6(1):98-111

Scalable storage architectures enable digital libraries and archives for the addition or removal of storage devices to increase storage capacity and bandwidth or retire older devices. Past work in this area have mainly focused on statically scaling homogeneous storage devices. However, heterogeneous devices are quickly being adopted for storage scaling since they are usually faster, larger, more widely available, and more cost-effective. We propose BroadScale, an algorithm based on Random Disk Labeling, to dynamically scale heterogeneous storage systems by distributing data objects according to their device weights. Assuming a random placement of objects across a group of heterogeneous storage devices, our optimization objectives when scaling are to ensure a uniform distribution of objects, redistribute a minimum number of objects, and maintain fast data access with low computational complexity. We show through experimentation that BroadScale achieves these requirements when scaling heterogeneous storage. 相似文献

9.

Efficient assignment and scheduling for heterogeneous DSP systems

Shao Z. Zhuge Q. Xue C. Sha E.H.-M. 《Parallel and Distributed Systems, IEEE Transactions on》2005,16(6):516-525

This paper addresses high level synthesis for real-time digital signal processing (DSP) architectures using heterogeneous functional units (FUs). For such special purpose architecture synthesis, an important problem is how to assign a proper FU type to each operation of a DSP application and generate a schedule in such a way that all requirements can be met and the total cost can be minimized. We propose a two-phase approach to solve this problem. In the first phase, we solve the heterogeneous assignment problem, i.e., how to assign proper FU types to applications such that the total cost can be minimized while the timing constraint is satisfied. In the second phase, based on the assignments obtained in the first phase, we propose a minimum resource scheduling algorithm to generate a schedule and a feasible configuration that uses as little resource as possible. We prove that the heterogeneous assignment problem is NP-complete. Efficient algorithms are proposed to find an optimal solution when the given DFG is a simple path or a tree. Three other algorithms are proposed to solve the general problem. The experiments show that our algorithms can effectively reduce the total cost compared with the previous work. 相似文献

10.

Efficient, balanced data placement algorithm in scalable storage clusters

LIU Zhong 《通讯和计算机》2007,4(7):8-17

Data distribution and load balancing become increasingly important in large-scale distributed storage system. This paper -focuses on the problem of designing an optimal, self-adaptive strategies for balanced distribution and reorganization of replicated objects among a dynamically heterogeneous nodes, and presents a novel decentralized algorithm, Dynamic Interval Mapping, which maps replicated objects to a scalable collection of nodes, it distributes objects to nodes optimally, redistributing minimum amount of objects when new nodes are added or existing nodes are removed to maintain the balanced distribution. It supports weighted allocation and guarantees that replicas of a particular object are not placed on the same node. The time complexity and storage requirements are superior to previous methods. 相似文献

11.

Efficient allocation of resources in multiple heterogeneous Wireless Sensor Networks

Wei Li Flávia C. Delicato Paulo F. Pires Young Choon Lee Albert Y. Zomaya Claudio Miceli Luci Pirmez 《Journal of Parallel and Distributed Computing》2014

Wireless Sensor Networks (WSNs) are useful for a wide range of applications, from different domains. Recently, new features and design trends have emerged in the WSN field, making those networks appealing not only to the scientific community but also to the industry. One such trend is the running different applications on heterogeneous sensor nodes deployed in multiple WSNs in order to better exploit the expensive physical network infrastructure. Another trend deals with the capability of accessing sensor generated data from the Web, fitting WSNs in novel paradigms of Internet of Things (IoT) and Web of Things (WoT). Using well-known and broadly accepted Web standards and protocols enables the interoperation of heterogeneous WSNs and the integration of their data with other Web resources, in order to provide the final user with value-added information and applications. Such emergent scenarios where multiple networks and applications interoperate to meet high level requirements of the user will pose several changes in the design and execution of WSN systems. One of these challenges regards the fact that applications will probably compete for the resources offered by the underlying sensor nodes through the Web. Thus, it is crucial to design mechanisms that effectively and dynamically coordinate the sharing of the available resources to optimize resource utilization while meeting application requirements. However, it is likely that Quality of Service (QoS) requirements of different applications cannot be simultaneously met, while efficiently sharing the scarce networks resources, thus bringing the need of managing an inherent tradeoff. In this paper, we argue that a middleware platform is required to manage heterogeneous WSNs and efficiently share their resources while satisfying user needs in the emergent scenarios of WoT. Such middleware should provide several services to control running application as well as to distribute and coordinate nodes in the execution of submitted sensing tasks in an energy-efficient and QoS-enabled way. As part of the middleware provided services we present the Resource Allocation in Heterogeneous WSNs (SACHSEN) algorithm. SACHSEN is a new resource allocation heuristic for systems composed of heterogeneous WSNs that effectively deals with the tradeoff between possibly conflicting QoS requirements and exploits heterogeneity of multiple WSNs. 相似文献

12.

异构集群系统中实时通信信号处理调度算法

下载免费PDF全文

张树森杭磊朱晓敏《计算机工程与应用》2009,45(20):118-121

当宽带大容量数据采集进入并行计算机网络后,通过集群计算方式对强衰弱通信信号实现高增益、低延时处理,达到有效实时解译通信数据的目的。提出了一种新的动态启发式调度算法——MDS算法。该算法综合考虑任务的时间要求、系统吞吐率和负载均衡。在任务的截止期较短的情况下,MDS算法仍能保证任务具有较高的调度成功率;同时在满足任务截止期的条件下系统具有较高的吞吐率并达到负载均衡。通过实验测试,分析了一些任务参数对MDS算法的影响,并与其他算法进行了比较。实验结果表明,MDS算法优于其他算法。相似文献

13.

A load index and load balancing algorithm for heterogeneous clusters

Jose Luis Bosque Pablo Toharia Oscar D. Robles Luis Pastor 《The Journal of supercomputing》2013,65(3):1104-1113

This paper presents a load balancing algorithm specifically designed for heterogeneous clusters, composed of nodes with different computational capabilities. The method is based on a new index, which takes into consideration two levels of processors heterogeneity: the number of cores per node and the computational power of each core. The experimental results show that this index allows achieving balanced workload distributions even on those clusters where heterogeneity can not be neglected. 相似文献

14.

Adaptive energy-efficient scheduling for real-time tasks on DVS-enabled heterogeneous clusters

Xiaomin Zhu Chuan He Kenli Li Xiao Qin 《Journal of Parallel and Distributed Computing》2012

Developing energy-efficient clusters not only can reduce power electricity cost but also can improve system reliability. Existing scheduling strategies developed for energy-efficient clusters conserve energy at the cost of performance. The performance problem becomes especially apparent when cluster computing systems are heavily loaded. To address this issue, we propose in this paper a novel scheduling strategy–adaptive energy-efficient scheduling or AEES–for aperiodic and independent real-time tasks on heterogeneous clusters with dynamic voltage scaling. The AEES scheme aims to adaptively adjust voltages according to the workload conditions of a cluster, thereby making the best trade-offs between energy conservation and schedulability. When the cluster is heavily loaded, AEES considers voltage levels of both new tasks and running tasks to meet tasks’ deadlines. Under light load, AEES aggressively reduces the voltage levels to conserve energy while maintaining higher guarantee ratios. We conducted extensive experiments to compare AEES with an existing algorithm–MEG, as well as two baseline algorithms–MELV, MEHV. Experimental results show that AEES significantly improves the scheduling quality of MELV, MEHV and MEG. 相似文献

15.

适应异构集群的Mesos多资源调度DRF增强算法

柯尊旺于炯廖彬《计算机应用》2016,36(5):1216-1221

云计算集群环境下多资源分配的公平性是考量资源调度子系统最重要的指标之一,DRF作为通用的多资源公平分配算法,在异构异质的集群环境下可能有失公平性。在研究Mesos框架中DRF多资源公平分配算法的基础上,设计并实现了增加机器性能评估影响因子的meDRF分配算法。将计算节点的机器性能得分,作为DRF主导份额计算的因子,使得计算任务有均等的机会获得优质计算资源和劣质计算资源。通过选取K-means、Bayes及PageRank等多种作业进行实验,实验结果表明:meDRF较DRF分配算法更能体现多资源分配的公平性,且资源分配具有更好的稳定性,能有效提高系统资源的利用率。相似文献

16.

clusterCL: comprehensive support for multi-kernel data-parallel applications in heterogeneous asymmetric clusters

Raca Valon Mehofer Eduard 《The Journal of supercomputing》2020,76(12):9976-10008

Heterogeneous cluster systems consisting of CPUs and different kinds of accelerators have become mainstream in HPC. Programming such systems is a difficult task and requires addressing manifold challenges that stem from the intricate composition of such systems and peculiarities of scientific applications. A broad range of obstacles preventing efficient execution have to be considered and dealt with properly. In this paper, we propose a systematic approach and a framework that is capable of providing comprehensive support for running data-parallel applications in heterogeneous asymmetric clusters. Our implementation provides work partitioning and distribution by ensuring workload balance in the cluster while handling of partitioning-induced communication and synchronization in a transparent way. In our experimental section, we choose 11 representative scientific applications from different domains to evaluate our approach. Experimental results show a strong speedup and workload balance for different cluster configurations.

相似文献

17.

Fast anomaly detection in hyperspectral images with RX method on heterogeneous clusters

J. M. Molero A. Paz E. M. Garzón J. A. Martínez A. Plaza I. García 《The Journal of supercomputing》2011,58(3):411-419

Remotely sensed hyperspectral sensors provide image data containing rich information in both the spatial and the spectral domain, and this information can be used to address detection tasks in many applications. One of the most widely used and successful algorithms for anomaly detection in hyperspectral images is the RX algorithm. Despite its wide acceptance and high computational complexity when applied to real hyperspectral scenes, few approaches have been developed for parallel implementation of this algorithm. In this paper, we evaluate the suitability of using a hybrid parallel implementation with a high-dimensional hyperspectral scene. A general strategy to automatically map parallel hybrid anomaly detection algorithms for hyperspectral image analysis has been developed. Parallel RX has been tested on an heterogeneous cluster using this routine. The considered approach is quantitatively evaluated using hyperspectral data collected by the NASA’s Airborne Visible Infra-Red Imaging Spectrometer system over the World Trade Center in New York, 5 days after the terrorist attacks. The numerical effectiveness of the algorithms is evaluated by means of their capacity to automatically detect the thermal hot spot of fires (anomalies). The speedups achieved show that a cluster of multi-core nodes can highly accelerate the RX algorithm. 相似文献

18.

Efficient grouping-based mapping and scheduling on heterogeneous cluster architectures

Qunyan Sun Qingfeng Zhuge Jingtong Hu Juan Yi Edwin H.-M. Sha 《Computers & Electrical Engineering》2014

Heterogeneous clusters of computers usually provide high computing power for large-scale applications at the expense of large cost. And there are two challenges currently faced by researchers. One is how to map large applications, modeled by Directed Acyclic Graphs (DAG), to heterogeneous architectures with minimal cost. The other is how to schedule tasks on each cluster to further decrease the total cost. 相似文献

19.

Efficient editing and data abstraction by finding homogeneous clusters

Stefanos Ougiaroglou Georgios Evangelidis 《Annals of Mathematics and Artificial Intelligence》2016,76(3-4):327-349

The efficiency of the k-Nearest Neighbour classifier depends on the size of the training set as well as the level of noise in it. Large datasets with high level of noise lead to less accurate classifiers with high computational cost and storage requirements. The goal of editing is to improve accuracy by improving the quality of the training datasets. To obtain such datasets, editing removes noise and mislabeled data as well as smooths the decision boundaries between the discrete classes. On the other hand, prototype abstraction aims to reduce the computational cost and the storage requirements of classifiers by condensing the training data. This paper proposes an editing algorithm called Editing through Homogeneous Clusters (EHC). Then, it extends the idea by introducing a prototype abstraction algorithm that integrate the EHC mechanism and is capable of creating a small noise-free representative set of the initial training data. This algorithm is called Editing and Reduction through Homogeneous Clusters (ERHC). Both are based on a fast and parameter free iterative execution of k-means clustering that forms homogeneous clusters. Both consider as noise and remove clusters consisting of a single item. In addition, ERHC summarizes the items of the remaining clusters by storing the mean item for each one in the representative set. EHC and ERHC are tested on several datasets. The results show that both run very fast and achieve high accuracy. In addition, ERHC achieves high reduction rates. 相似文献

20.

Efficient heterogeneous programming with FPGAs using the Controller model

Rodriguez-Canal Gabriel Torres Yuri Andújar Francisco J. Gonzalez-Escribano Arturo 《The Journal of supercomputing》2021,77(12):13995-14010

The Journal of Supercomputing - The Controller model is a heterogeneous parallel programming model implemented as a library. It transparently manages the coordination, communication and kernel... 相似文献