首页 | 本学科首页   官方微博 | 高级检索  
     


Thread vulnerability in parallel applications
Authors:Isil Oz  Haluk Rahmi Topcuoglu  Mahmut Kandemir  Oguz Tosun
Affiliation:1. Computer Engineering Department, Bogazici University, 34342, Istanbul, Turkey;2. Computer Engineering Department, Marmara University, 34722, Istanbul, Turkey;3. Department of Computer Science and Engineering, The Pennsylvania State University, University Park, PA 16802, USA
Abstract:Continuously reducing transistor sizes and aggressive low power operating modes employed by modern architectures tend to increase transient error rates. Concurrently, multicore machines are dominating the architectural spectrum today in various application domains. These two trends require a fresh look at resiliency of multithreaded applications against transient errors from a software perspective. In this paper, we propose and evaluate a new metric called the Thread Vulnerability Factor (TVFTVF). A distinguishing characteristic of TVFTVF is that its calculation for a given thread (which is typically one of the threads of a multithreaded application) does not depend on its code alone, but also on the codes of the threads that share resources and data with that thread. As a result, we decompose TVFTVF of a thread into two complementary parts: local and remote. While the former captures the TVFTVF induced by the code of the target thread, the latter represents the vulnerability impact of the threads that interact with the target thread. We quantify the local and remote TVFTVF values for three architectural components (register file, ALUs, and caches) using a set of ten multithreaded applications from the Parsec and Splash-2 benchmark suites. Our experimental evaluation shows that TVFTVF values tend to increase as the number of cores increases, which means the system becomes more vulnerable as the core count rises. We further discuss how TVFTVF metric can be employed to explore performance–reliability tradeoffs in multicores. Reliability-based analysis of compiler optimizations and redundancy-based fault tolerance are also mentioned as potential usages of our TVFTVF metric.
Keywords:Reliability  Thread Vulnerability  Fault tolerance  Multicores  TVFTVF" target="_blank">gif" overflow="scroll">TVF
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号