首页 | 本学科首页   官方微博 | 高级检索  
     


Failure detection and consensus in the crash-recovery model
Authors:Marcos Kawazoe Aguilera  Wei Chen  Sam Toueg
Affiliation:(1) Department of Computer Science, Cornell University, Ithaca, NY 14853-7501, USA (e-mail: {aguilera,sam}@cs.cornell.edu) , US;(2) Oracle Corporation, One Oracle Drive, Nashua, NH 03062, USA (e-mail: weichen@us.oracle.com) , US
Abstract:Summary. We study the problems of failure detection and consensus in asynchronous systems in which processes may crash and recover, and links may lose messages. We first propose new failure detectors that are particularly suitable to the crash-recovery model. We next determine under what conditions stable storage is necessary to solve consensus in this model. Using the new failure detectors, we give two consensus algorithms that match these conditions: one requires stable storage and the other does not. Both algorithms tolerate link failures and are particularly efficient in the runs that are most likely in practice – those with no failures or failure detector mistakes. In such runs, consensus is achieved within time and with 4 n messages, where is the maximum message delay and n is the number of processes in the system. Received: May 1998 / Accepted: November 1999
Keywords::Fault tolerance –  Failure detection –  Consensus –  Process crash –  Process recovery –  Asynchronous systems –  Stable          storage
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号