首页 | 本学科首页   官方微博 | 高级检索  
     


Network interfaces for programmable NICs and multicore platforms
Authors:Andrés Ortiz  Julio Ortega  Antonio F. Díaz  Alberto Prieto
Affiliation:1. Department of Communications Engineering, University of Málaga, Spain;2. Department of Computer Architecture and Technology, University of Granada, Spain;1. Key Lab of Semiconductor Materials Science, Institute of Semiconductors, Chinese Academy of Sciences, PO Box 912, Beijing 100083, People''s Republic of China;2. ISCAS-XJTU Joint Laboratory of Functional Materials and Devices for Informatics, Beijing, People''s Republic of China;3. Xi''an Jiaotong University, Xi''an 710049, People''s Republic of China;1. Beijing University of Chemical Technology, Beijing, 100029, China;2. Institute of Physics, Chinese Academy of Sciences, Beijing, 100080, China;1. CESNET z. s. p. o., Zikova 4, 160 00 Praha, Czech Republic;2. Masaryk University, Botanická 68a, 602 00 Brno, Czech Republic
Abstract:The availability of multicore processors and programmable NICs, such as TOEs (TCP/IP Offloading Engines), provides new opportunities for designing efficient network interfaces to cope with the gap between the improvement rates of link bandwidths and microprocessor performance. This gap poses important challenges related with the high computational requirements associated to the traffic volumes and wider functionality that the network interface has to support. This way, taking into account the rate of link bandwidth improvement and the ever changing and increasing application demands, efficient network interface architectures require scalability and flexibility. An opportunity to reach these goals comes from the exploitation of the parallelism in the communication path by distributing the protocol processing work across processors which are available in the computer, i.e. multicore microprocessors and programmable NICs.Thus, after a brief review of the different solutions that have been previously proposed for speeding up network interfaces, this paper analyzes the onloading and offloading alternatives. Both strategies try to release host CPU cycles by taking advantage of the communication workload execution in other processors present in the node. Nevertheless, whereas onloading uses another general-purpose processor, either included in a chip multiprocessor (CMP) or in a symmetric multiprocessor (SMP), offloading takes advantage of processors in programmable network interface cards (NICs). From our experiments, implemented by using a full-system simulator, we provide a fair and more complete comparison between onloading and offloading. Thus, it is shown that the relative improvement on peak throughput offered by offloading and onloading depends on the rate of application workload to communication overhead, the message sizes, and on the characteristics of the system architecture, more specifically the bandwidth of the buses and the way the NIC is connected to the system processor and memory. In our implementations, offloading provides lower latencies than onloading, although the CPU utilization and interrupts are lower for onloading. Taking into account the conclusions of our experimental results, we propose a hybrid network interface that can take advantage of both, programmable NICs and multicore processors.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号