提高用任务重复的检查点方案的性能-电子学报.PDF

提高用任务重复的检查点方案的性能-电子学报.PDF

第 5 期 电  子   学   报 Vol . 28  No . 5  2000 年 5 月 ACTA ELECTRONICA SINICA May  2000   提高用任务重复的检查点方案的性能 李凯原 ,杨孝宗 ( 哈尔滨工业大学计算机科学与工程系 ,哈尔滨 150001)   摘  要 :  设置检查点是减少程序在故障条件下执行时间的一种常用技术. 将检查点与任务重复技术相结合 ,不 仅能够完成有效的故障恢复 ,而且还能进行完善的故障检测. 上述系统的开销主要来 自两方面 :其一是每个检查点的 比较和保存开销 ,其二是因故障而引起的卷回. 本文利用增量检查点对 Ziv 和 Bruck 提出的方法进行了改进 ,改进后的 方法不仅能够有效地减少比较 、保存检查点的开销 ,而且还能够避免潜伏故障引起的卷回. 分析表明改进后的方法与 Ziv 和 Bruck 的方法相比表现出更好的性能. 关键词 :  容错 ; 检查点 ; 卷回恢复 ; 任务重复 中图分类号 :  TP3028    文献标识码 :  A    文章编号 : (2000) Improving the Performance of Checkpointing Scheme with Ta sk Duplication L I Kaiyuan ,YAN G Xiaozong ( Dept . of Comp uter Science and Engineering of Harbin Institute of Technology , Harbin 150001, China) Ab stract :  Checkpointing is a common technique for reducing the execution time of programs under fault assumption . With the combination of checkpointing and task duplication ,not only effective fault recovery but also perfect fault detection can be achieved. The overhead of such systems comes from two aspects :comparing and saving operations at each checkpoint ,and the rollbacks caused by faults. This paper improves the method presented by Ziv and Bruck by employing incremental checkpointing. The improved method can reduce the overhead of comparing and saving operation ,and moreover the rollbacks caused by latent faults can be avoided. Analysis shows that our method exhibits better performance by comparison with that of Ziv and Bruck . Key word s :  fault tolerance ;checkpoint ;rollbackrecovery ;task duplication 1  引言

文档评论(0)

1亿VIP精品文档

相关文档