共查询到20条相似文献,搜索用时 187 毫秒
1.
2.
3.
4.
《每周电脑报》2004,(42)
11月9日,英特尔公司推出了6款新处理器来更新安腾2产品线。这些新处理器提供了15%的数据库交易处理性能增幅,以及高达35%的总体整数和浮点运算性能提升。全新处理器包括采用400MHz前端总线(FSB)和9MB三级(L3)高速缓存的安腾2处理器MP1.60GHz,主要面向诸如数据库和ERP等关键业务应用。此外,还包括采用6MB三级高速缓存的安腾2处理器MP1.60GHz,和采用4MB三级高速缓存的安腾2处理器MP1.50GHz。采用400和533MHz前端总线和3MB三级高速缓存的安腾2处理器DP1.60GHz,可为高性能计算集群和前端企业级系统带来出色的性价比。这一新款处理… 相似文献
5.
《每周电脑报》2004,(45)
12月中旬获悉,美国SGI发布了一款Altix3700系列中的新产品,它以2倍的带宽和处理器密度,成为该系列产品中的旗舰产品。这款新的Altix3700Bx2型机器以业界标准的Linux作为操作系统,以安腾2处理器为核心,使用户能以更高的性价比构造更强大的超级计算系统,占据空间更小。据悉,10月26日,NASA哥伦比亚系统已正式拥有10240个处理器的Altix系统。将256个安腾2处理器应用到单一系统,Altix3700使用了SGI NUMAflex全球共享内存结构将新的高密度的CPU模块连接起来,并使其实际应用中发挥最大的效能。新的系统使用了业界最快的SGI NUMAlink4连… 相似文献
6.
5月19日,惠普和英特尔公司在北京联合举行最新战略计划研讨会,取名为“团结日”。会上,来自惠普美国总部的惠普关键业务系统服务器市场营销总监VishMulchand和来自英特尔总部的英特尔企业平台集团经理LisaGraff重申了惠普和英特尔的合作战略,全面展示了双方合作的新成果和未来发展蓝图。Lisa介绍:根据Intel的安腾处理器发展蓝图,2005年英特尔将推出代号为"Millington"的安腾2处理器,并推出第一款双内核安腾2处理器"Montecito"以及多内核安腾2处理器"Dimona",在2007年推出多内核安腾2处理器"Tukwila"。英特尔的目标是,到2007年,安腾2处理… 相似文献
7.
8.
9.
10.
11.
12.
由美国国家大气研究中心(NCAR)和宾夕法尼亚大学(PSU)共同开发的MM5模式是目前气象领域广泛采用的中尺度模式之一。由于MM5模式的广泛应用和计算特点,MM5 Benchmark成为了高性能计算领域衡量计算机实用性能的典型应用之一。本文介绍了MM5模式的并行计算方法,针对目前流行的Intel Itanium-2 CPU特点,在性能分析的基础上,从编译选项、代码优化、区域分解、处理机拓扑结构等方面对程序性能进行优化。实际测试结果表明,在某国产巨型机上优化后的持续计算性能最大提高20%左右。 相似文献
13.
Francisco Isidro Massetto Liria Matsumoto Sato Kuan-Ching Li 《The Journal of supercomputing》2012,60(1):87-116
Breakthrough advances in microprocessor technology and efficient power management have altered the course of development of
processors with the emergence of multi-core processor technology, in order to bring higher level of processing. The utilization
of many-core technology has boosted computing power provided by cluster of workstations or SMPs, providing large computational
power at an affordable cost using solely commodity components. Different implementations of message-passing libraries and
system softwares (including Operating Systems) are installed in such cluster and multi-cluster computing systems. In order
to guarantee correct execution of message-passing parallel applications in a computing environment other than that originally
the parallel application was developed, review of the application code is needed. In this paper, a hybrid communication interfacing
strategy is proposed, to execute a parallel application in a group of computing nodes belonging to different clusters or multi-clusters
(computing systems may be running different operating systems and MPI implementations), interconnected with public or private
IP addresses, and responding interchangeably to user execution requests. Experimental results demonstrate the feasibility
of this proposed strategy and its effectiveness, through the execution of benchmarking parallel applications. 相似文献
14.
Cluster/distributed computing has become a popular, cost-effective alternative to high-performance parallel computers. Many parallel programming languages and related programming models have become widely accepted on clusters. However, the high communication overhead is a major shortcoming of running parallel applications on cluster/distributed computing environments. To reduce the communication overhead and thus the completion time of a parallel application, this paper introduces and evaluates an efficient Key Message (KM) approach to support parallel computing on cluster computing environments. In this paper, we briefly present the model and algorithm, and then analytical and simulation methods are adopted to evaluate the performance of the algorithm. It demonstrates that when network background load increases or the computation to communication ratio decreases, the analysis results show better improvement on communication of a parallel application over the system which does not use the KM approach. 相似文献
15.
网络通信系统是机群的一个重要组成部分,也是影响机群整机处理效率的关键因素.随着单个结点计算能力的增强,网络通信能力也需要相应地提高.一种提高网络通信能力的方法是引入多个网卡同时进行消息发送,即并行通信.通常,并行通信是基于RMA机制实现的,对于小于17KB的消息,由于RMA机制的握手过程使得并行通信性能的提高很有限.提出了基于智能网卡支持的并行通信协议.该协议将消息重组所需的握手过程下移到网卡上实现,从而减少了握手开销,扩展了并行通信的范围.实验数据表明,与基于RMA机制的并行协议相比,该协议提高了3KB-17KB消息段的通信性能;对应用程序,如FT程序,该协议将其执行时间减少了9.4%,而基于RMA机制的并行协议只减少了7.8%.最后分析了限制并行通信性能提高的主要因素. 相似文献
16.
曙光2000超级计算机系统软件的设计 总被引:10,自引:3,他引:7
曙光2000超级计算机系统采用可扩展机群体系结构,是通用的超级并行计算机,可支持科学与工程计算、网络服务和数据处理应用。该文介绍了曙光2000系统软件设计采用担SUMA技术路线,即在通信软件、可扩展文件系统和服务器取信的设计上体现可管理性,在单一系统映像、集成化并行环境和傻瓜界面的设计上体现好用性。文章详细阐述了系统软件的设计和关键技术,包括通信系统、COSMOS可扩展文件系统、管理软件和用刻界面 相似文献
17.
Weifeng Liu Jie Zhou Bin Gong Hongjun Dai Meng Guo 《International Journal of Parallel, Emergent and Distributed Systems》2020,35(3):246-259
AbstractAs an alternative to traditional computing architecture, cloud computing now is rapidly growing. However, it is based on models like cluster computing in general. Now supercomputers are getting more and more powerful, helping scientists have more indepth understanding of the world. At the same time, clusters of commodity servers have been mainstream in the IT industry, powering not only large Internet services but also a growing number of data-intensive scientific applications, such as MPI based deep learning applications. In order to reduce the energy cost, more and more efforts are made to improve the energy consumption of HPC systems. Because I/O accesses account for a large portion of the execution time for data intensive applications, it is critical to design energy-aware parallel I/O functions for addressing challenges related to HPC energy efficiency. As the de facto standard for designing parallel applications in cluster environment, the Message Passing Interface has been widely used in high performance computing, therefore, getting the energy consumption information of MPI applications is critical for improving the energy efficiency of HPC systems. In this work we first present our energy measurement tool, a software framework that eases the energy collection in cluster environment. And then we present an approach which can optimise the parallel I/O operation’s energy efficiency. The energy scheduling algorithm is evaluated in a cluster. 相似文献
18.
A memory-based networking approach provides clusters of computers up to 1,000 times the communication performance of conventional networks, with no compromise in cost or reliability. The memory channel for PCI's performance gains are the result of a system design approach that exploits natural cluster constraints to define a memory-based network. MC implements a form of virtual shared memory that permits applications to completely bypass the operating system and perform cluster communication directly from the user level. The hardware's simple and powerful communication model supports error handling at almost no cost or complexity to the application; guaranteed ordering under errors is the key innovation. The end result: Real-world cluster communication latency dropped by up to two orders of magnitude, and overhead by up to three orders of magnitude. These improvements elevate a lowly set of standard PCI computers running Unix into an impressive, highly available, parallel computing system 相似文献
19.
克服机群系统通信瓶颈的软件方法 总被引:4,自引:1,他引:4
机群系统是一种新兴的并行计算系统 ,它能够以较低的价格提供很高的计算性能 ,因此有广泛的应用前景 .机群系统从整体上说是一种松耦合的体系结构 ,通信是其性能提高的主要瓶颈 .本文首先针对机群系统通信问题做了简要分析 ,在此基础上论述了软件在提高通信性能方面的重要地位 ,着重讨论了如何通过软件方法来提高机群系统性能的途径 .实验结果表明 ,对于许多问题 ,基于相同的硬件支持 ,可以通过软件方法 ,成倍地提高其在机群系统上的求解性能 相似文献
20.
C. Pedraza E. Castillo J. Castillo J.L. Bosque J.I. Martinez O.D. Robles J. Cano P. Huerta 《Journal of Systems Architecture》2010,56(11):633-640
The SMILE project main aim is to build an efficient low-cost cluster based on FPGA boards in order to take advantage of its reconfigurable capabilities. This paper shows the cluster architecture, describing: the SMILE nodes, the high-speed communication network for the nodes and the software environment. Simulating complex applications can be very hard, therefore a SystemC model of the whole system has been designed to simplify this task and provide error-free downloading and execution of the applications in the cluster. The hardware–software co-design process involved in the architecture and SystemC design is presented as well. The SMILE cluster functionality is tested executing a real complex Content-Based Information Retrieval (CBIR) parallel application and the performance of the cluster is compared (time, power and cost) with a traditional cluster approach. 相似文献