计算机应用 ›› 2010, Vol. 30 ›› Issue (05): 1312-1315.

• 软件过程技术 • 上一篇    下一篇

基于消息延迟预测的自适应失效检测模型

石磊1,侯垚森2   

  1. 1. 郑州大学南校区信息工程学院
    2. 郑州大学信息工程学院计算机系
  • 收稿日期:2009-11-16 修回日期:2010-01-28 发布日期:2010-05-04 出版日期:2010-05-01
  • 通讯作者: 石磊

Adaptive failure detection model based on message delay prediction

,   

  • Received:2009-11-16 Revised:2010-01-28 Online:2010-05-04 Published:2010-05-01

摘要: 失效检测是实现高可用性的一个关键技术,通常采用超时机制判断进程是否失效的方法实现。然而,由于网络状况的不断变化,缺乏自适应机制的失效检测器难以满足应用对QoS的需求。自适应失效检测要求失效检测器能够根据实际应用和网络负载变化而动态地改变检测的质量。在对现有自适应失效检测算法研究、分析的基础上提出一种基于消息延迟预测的失效检测模型,采用基于历史消息延迟预测下一个消息延迟的方法,实现了一种PA-AFD自适应失效检测算法。实验及分析表明该算法在保证检测准确性和完整性的同时缓解了网络延迟对失效检测的影响。

关键词: 高可用性, 失效检测, 服务质量, 分布式系统, 自适应

Abstract: Failure detection is a key technology to realize high availability. It usually uses overtime mechanism to determine whether a process fails or not. But old failure detectors without adaptive mechanism can not meet the requirements of Quality of Service (QoS) of applications all the time because of the instability of the network. Adaptive failure detection requires that failure detectors can dynamically adjust the detection quality according to the variations of applications and the load of the network. A new failure detection model based on the predicted message delay was proposed on the basis of researching and analyzing the existing adaptive failure detection algorithms. An adaptive failure detection algorithm named PA-AFD based on the prediction from historical message delay was realized. The experimental results and analysis show that the algorithm can relieve the affect of message delay on the failure detection while ensuring the accuracy and integrity of detection.

Key words: high availability, failure detection, Quality of Service (QoS), distributed system, adaptive