Diversity and fault avoidance for dependable replication systems |
| |
Authors: | Sung-Hwa Lim Jai-Hoon Kim |
| |
Affiliation: | Graduate School of Information and Communication, Ajou University, Suwon 443-749, South Korea |
| |
Abstract: | In the hot-standby replication system, the system cannot process its tasks anymore when all replicated nodes have failed. Thus, the remaining living nodes should be well-protected against failure when parts of replicated nodes have failed. Design faults and system-specific weaknesses may cause chain reactions of common faults on identical replicated nodes in replication systems. These can be alleviated by replicating diverse hardware and software. Going one-step forward, failures on the remaining nodes can be suppressed by predicting and preventing the same fault when it has occurred on a replicated node. In this paper, we propose a fault avoidance scheme which increases system dependability by avoiding common faults on remaining nodes when parts of nodes fail, and analyze the system dependability. |
| |
Keywords: | Fault tolerance Distributed systems Safety/security in digital systems |
本文献已被 ScienceDirect 等数据库收录! |
|