Use of common time base for checkpointing and rollback recovery ina distributed system |
| |
Authors: | Ramanathan P Shin KG |
| |
Affiliation: | Dept. of Electr. & Comput. Eng., Wisconsin Univ., Madison, WI; |
| |
Abstract: | An approach to checkpointing and rollback recovery in a distributed computing system using a common time base is proposed. A common time base is established in the system using a hardware clock synchronization algorithm. This common time base is coupled with the idea of pseudo-recovery points to develop a checkpointing algorithm that has the following advantages: reduced wait for commitment for establishing recovery lines, fewer messages to be exchanged, and less memory requirement. These advantages are assessed quantitatively by developing a probabilistic model |
| |
Keywords: | |
|
|