共查询到20条相似文献,搜索用时 62 毫秒
1.
2006年9月7日,英特尔发布了面向入门级服务器市场的双核至强3000系列处理器和与之配套的3000/3010芯片组。此次产品发布首次将中高端服务器处理器品牌——至强(Xeon)引入到入门级的服务器产品中。 相似文献
2.
3.
9月22日,英特尔在京发布7款45nm英特尔至强7400系列处理器产品。全面提升性能标尺,并为虚拟化性能表现设定了全新标准。 相似文献
4.
随着计算机处理器的速度不断刷新的同时,芯片的能耗也在不断提高。如何解决二者的矛盾,促使芯片生产厂商把更多的目光转向了“多内核”芯片产品。 相似文献
5.
有专家把“双核英特尔至强处理器5100系列”的面市,视为全球IT业世界杯中,最精彩的一记“世界波”。 相似文献
6.
宋世鹏 《电子计算机与外部设备》2010,(5):148-148
本刊讯3月31日,英特尔推出升级为Nehalem架构的至强7500系列处理器,该产品可用于构建从双路到最高256路的服务器系统,与已被广泛采用的上一代产品至强7400相比,其性能平均提升达3倍之多,从而在企业级性能基准测试中创下了更为领先的成绩。 相似文献
7.
8.
随着互联网应用的发展以及云计算的异军突起,企业级服务器对计算速度及功耗的要求也越来越高,强大的数据处理能力、复杂的数据库处理能力、繁重的科学数据计算以及各种庞大的网络服务需求都对服务器处理器的芯片技术提出了更高的要求。2007年9月,英特尔推出了至强7300系列服务器处理器芯片,推动了4核处理器的发展和应用。时隔一年,2008年9月22日,英特尔在北京发布了7款45纳米英特尔至强7400系列处理器产品, 相似文献
9.
日前,全新的英特尔至强处理器E5-2600产品家族与Intel Data PlaneDevelopment Kit(Intel DPDK)一起登场,它们将能够支持电信设备制造商将三种通信工作负载整合到单一的英特尔架构平台之上。这一能力使服务提供 相似文献
10.
《CAD/CAM与制造业信息化》2010,(Z1)
2010年3月17日,本刊讯,英特尔公司在北京发布了新一代智能服务器处理器——英特尔至强处理器5600系列。据了解,该系列处理器传承了英特尔创新的智能计算理念,主要用于 相似文献
11.
激光等离子体粒子模拟广泛用于探索极端物质状态下的科学问题。将一种基于粒子云网格方法的三维等离子体粒子模拟程序LARED P移植到Intel Xeon Phi协处理器上。在移植的过程中,综合运用了Native和Offload两种编程模式:首先运用Native模式对LARED P程序中热点计算任务进行优化研究,通过采用SIMD扩展指令使该计算任务获得了4.61倍的加速;然后运用Offload模式将程序移植到CPU-Intel Xeon Phi异构系统上,并通过使用异步数据传输和双缓冲技术分别提升了程序性能9.8%和21.8%。 相似文献
12.
Paweł Czarnul 《International journal of parallel programming》2017,45(5):1091-1107
The paper deals with parallelization of computing similarity measures between large vectors. Such computations are important components within many applications and consequently are of high importance. Rather than focusing on optimization of the algorithm itself, assuming specific measures, the paper assumes a general scheme for finding similarity measures for all pairs of vectors and investigates optimizations for scalability in a hybrid Intel Xeon/Xeon Phi system. Hybrid systems including multicore CPUs and many-core compute devices such as Intel Xeon Phi allow parallelization of such computations using vectorization but require proper load balancing and optimization techniques. The proposed implementation uses C/OpenMP with the offload mode to Xeon Phi cards. Several results are presented: execution times for various partitioning parameters such as batch sizes of vectors being compared, impact of dynamic adjustment of batch size, overlapping computations and communication. Execution times for comparison of all pairs of vectors are presented as well as those for which similarity measures account for a predefined threshold. The latter makes load balancing more difficult and is used as a benchmark for the proposed optimizations. Results are presented for the native mode on an Intel Xeon Phi, CPU only and the CPU \(+\) offload mode for a hybrid system with 2 Intel Xeons with 20 physical cores and 40 logical processors and 2 Intel Xeon Phis with a total of 120 physical cores and 480 logical processors. 相似文献
13.
Yang Chao-Tung Liu Jung-Chun Chan Yu-Wei Kristiani Endah Kuo Chan-Fu 《The Journal of supercomputing》2021,77(3):2486-2510
The Journal of Supercomputing - With the success of deep learning (DL) methods in diverse application domains, several deep learning software frameworks have been proposed to facilitate the usage... 相似文献
14.
Server-based networks have well-documented performance limitations. These limitations outline a major goal of Intel's embedded transport acceleration (ETA) project, the ability to deliver high-performance server communication and I/O over standard Ethernet and transmission control protocol/Internet protocol (TCP/IP) networks. By developing this capability, Intel hopes to take advantage of the large knowledge base and ubiquity of these standard technologies. With the advent of 10 gigabit Ethernet, these standards promise to provide the bandwidth required of the most demanding server applications. We use the term packet processing engine (PPE) as a generic term for the computing and memory resources necessary for communication-centric processing. Such PPEs have certain desirable attributes; the ETA project focuses on developing PPEs with such attributes, which include scalability, extensibility, and programmability. General-purpose processors, such as the Intel Xeon in our prototype, are extensible and programmable by definition. Our results show that software partitioning can significantly increase the overall communication performance of a standard multiprocessor server. Specifically, partitioning the packet processing onto a dedicated set of compute resources allows for optimizations that are otherwise impossible when time sharing the same compute resources with the operating system and applications. 相似文献
15.
In recent years, Intel promotes its new product Xeon Phi coprocessor, which is similar to the x86 architecture coprocessor. It has about 60 cores and can be regarded as a single computing node, with the computing power that cannot be ignored. This work aims to improve the workload balance by parallel loop self-scheduling scheme performed on Xeon Phi-based computer cluster. The proposed concept is implemented by hybrid MPI and OpenMP parallel programming in C language. Since parallel loop self-scheduling composes of static and dynamic allocation, weighting algorithm is adopted in the static part, while the well-known loop self-scheduling is adopted in dynamic part. The loop block is partitioned according to the weighting of MIC and HOST nodes. Accordingly, Xeon Phi with many-core is adopted to implement parallel loop self-scheduling. Finally, we test the performance in the experiments by four applicable problems: matrix multiplication, sparse matrix multiplication, Mandelbrot set and circuit meet. The experimental results indicate how to do the weight allocation and which scheduling method can achieve the best performance. 相似文献
16.
17.
Kang Ji-Hoon Hwang Jinyul Sung Hyung Jin Ryu Hoon 《The Journal of supercomputing》2021,77(9):9597-9614
The Journal of Supercomputing - Direct numerical simulations (DNS) of turbulent flows have increasing importance because they not only provide fundamental understanding of turbulent flows but also... 相似文献
18.
19.
Coronado-Barrientos Edoardo Indalecio Guillermo García-Loureiro Antonio 《The Journal of supercomputing》2018,74(6):2823-2840
The Journal of Supercomputing - This work is focused on the application of the new AXC format in iterative algorithms on the Intel Xeon Phi coprocessor to solve linear systems by accelerating the... 相似文献