An efficient protocol for checkpointing recovery in distributedsystems |
| |
Authors: | Kim J.L. Park T. |
| |
Affiliation: | Dept. of Comput. Sci., Texas A&M Univ., College Station, TX; |
| |
Abstract: | The authors present an efficient synchronized checkpointing protocol that exploits the dependency relation between processes in distributed systems. In this protocol, a process takes a checkpoint when it knows that all processes on which it computationally depends took their checkpoints, hence the process need not always wait for the decision made by the checkpointing coordinator as in the conventional synchronized protocols. As a result, the checkpointing coordination time is substantially reduced and the possibility of total abort of the checkpointing coordination is reduced |
| |
Keywords: | |
|
|