Design of algorithm-based fault-tolerant multiprocessor systems forconcurrent error detection and fault diagnosis |
| |
Authors: | Vinnakota B Jha NK |
| |
Affiliation: | Dept. of Electr. Eng., Minnesota Univ., Minneapolis, MN; |
| |
Abstract: | Algorithm-based fault tolerance (ABPT) is a low-overhead system-level concurrent error detection and fault location scheme for multiprocessor systems. We present new methods for the design of ABFT systems. Our design procedure is applicable to a wide range of systems in which processors share data elements. A feature of our design approach is that the type of checks to be used in the final system can be controlled by the system designer. We also present some new bounds on the number of checks needed in ABFT system design |
| |
Keywords: | |
|
|