Accelerated assessment of fine-grain AVF in NoC using a Multi-Cell Upsets considered fault injection |
| |
Affiliation: | 1. School of Micro Electronics, Shanghai Jiao Tong University, China;2. Cisco Research Center, USA;1. Laurentian University, 935 Ramsey Lake Rd, Greater Sudbury, ON P3E 2C6, Canada;2. Power Film Inc., 2337 230th Street, Ames, IA 50014, USA;3. MIRARCO, 935 Ramsey Lake Rd, Greater Sudbury, ON P3E 2C6, Canada;1. Glass Processing and Characterization Laboratory/College of Optics and Photonics, CREOL – University of Central Florida, 4304 Scorpius Street, Orlando, FL 32816, USA;2. School of Material Science & Engineering, Clemson University, COMSET, 161 Sirrine Hall, Clemson, SC 29634, USA;3. Semiconductor and Dimensional Metrology Division, National Institute of Standards and Technology (NIST), 100 Bureau Drive, Gaithersburg, MD 20899, USA;1. Purdue University, School of Materials Engineering, West Lafayette, IN 47907, United States;2. Purdue University, School of Mechanical Engineering, West Lafayette, IN 47907, United States |
| |
Abstract: | With the increasing threat of soft errors induced bits upset, Network on Chip (NoC) as the communication infrastructure in many-core systems has been proven a reliability bottleneck in a fault tolerant parallel system. The often-used metric Architecture Vulnerability Factor (AVF), measures the architecture-level soft error impacts to compromise the design cost of fault tolerant schemes and reliability well. As a complementary of existing estimation methods about standard IP like processor and Cache, this work aims at an accelerated fault injection methodology for the fine-grain AVF assessment in NoC via two components: (1) modeling the complex fault patterns of both Multi-Cell Upsets (MCU) and Single Bit Upset (SBU) in the standard Fault Injection (FI) method; (2) accelerating the estimation via classifying and exploiting the fine-grain metrics according to different error impacts. The comprehensive simulation results using the diverse configures (e.g., varying fault model, benchmark, traffic load, network size and fault list size) also demonstrate that the proposed approach (i) shrinks the estimation inaccuracy due to MCU patterns 18.89% underestimation in no protection case and 88.92% overestimation under ECC (Error Correction Coding) protection on average; (ii) achieves about 5× speedup without estimation accuracy loss via phased pre-analysis based on fine-grain classification; (iii) verifies ECC a cost-effective mechanism to protect NoC router: soft errors reduced by about 50% over the no protection case, with only less than 2% area overhead. |
| |
Keywords: | AVF assessment NoC Soft error MCU Acceleration Fault injection |
本文献已被 ScienceDirect 等数据库收录! |
|