首页 | 本学科首页   官方微博 | 高级检索  
     

一种分布计算系统自适应故障侦测方法
引用本文:蔡京平,贾云得.一种分布计算系统自适应故障侦测方法[J].小型微型计算机系统,2007,28(1):136-139.
作者姓名:蔡京平  贾云得
作者单位:北京理工大学,计算机科学工程系,北京,100081
基金项目:国家高技术研究发展计划(863计划)
摘    要:面向高可靠智能应用的分布计算系统,首先提出一组故障侦测服务的QoS度量标准,其次给出一种自适应故障侦测方法.该方法使用一个无需统计行为的高度动态的计算方法,动态地估算心跳消息超时时限,并协商改变心跳消息的发送周期,以适应分布计算系统计算节点和网络状态变化,提高故障侦测服务的QoS.模拟实验表明,该方法能够适应分布计算系统状况的变化,在侦测的实时性和正确性上提供较好的平衡.

关 键 词:故障侦测  高可靠智能应用  分布计算系统  心跳法  适应性
文章编号:1000-1220(2007)01-0136-04
修稿时间:2005-10-17

An Adaptive Failure Detection Method for Distributed Computing Systems
CAI Jing-ping,JIA Yun-de.An Adaptive Failure Detection Method for Distributed Computing Systems[J].Mini-micro Systems,2007,28(1):136-139.
Authors:CAI Jing-ping  JIA Yun-de
Affiliation:Department of Computer Science and Engineering, School of Information Science and Technology, Beijing Institute of Technology, Beijing 100081, China
Abstract:This paper proposes a set of OoS metrics for failure detection service. based on the distributed computing systems of high reliable intelligent applications. An adaptable heartbeat failure detection method is then present. This method dynamically estimates the heartbeat detection timeout using an dynamic algorithm of non-probabilistic behavior. It changes the sending interval of heartbeat according to the processor load and transmission delay of the system. Simulation results show that the failure detector can achieve a compromise between a good detection time and the need of avoiding false detections. It can improve the failure detection QoS of the distributed computing systems.
Keywords:failure detection  high reliable intelligent applications  distributed computing system  heartbeat  adaptation  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号