Rollback recovery in distributed systems using loosely synchronizedclocks |
| |
Authors: | Tong Z Kain RY Tsai WT |
| |
Affiliation: | Bit 3 Comput. Corp., Minneapolis, MN; |
| |
Abstract: | A rollback recovery scheme for distributed systems is proposed. The state-save synchronization among processes is implemented by bounding clock drifts such that no state-save synchronization messages are required. Since the clocks are only loosely synchronized, the synchronization overhead can be negligible in many applications. An interprocess communication protocol which encodes state-save progress information within message frames is introduced to checkpoint consistent system states. A rollback recovery algorithm that will force a minimum number of nodes to roll back after failures is developed |
| |
Keywords: | |
|
|