Checkpointing is a very well known mechanism to achieve fault tolerance. In distributed applications where processes can checkpoint independently of each other, a local checkpoint is useful for fault tolerance purposes only if it belongs to at least one consistent global checkpoint. In this case, execution can be restarted from it without needing to rollback the execution in the past. The paper exploits a theoreticalframeworkthatfacilitatesthe definition and analysis of distributed checkpoint algorithms to avoid rollback propagation. Several distributed algorithms are presented which avoid roll-back propagation by forcing additional checkpoints in processes. The effectiveness of the algorithms is evaluated in several testbed applications, showing their limited capability of bounding the number of additional checkpoints.

Analysis and Evaluation of Distributed Checkpoint Algorithms to Avoid Roll-Back Propagation / Zambonelli, Franco. - In: IEE PROCEEDINGS. SOFTWARE. - ISSN 1462-5970. - STAMPA. - 145:(1998), pp. 212-218.

Analysis and Evaluation of Distributed Checkpoint Algorithms to Avoid Roll-Back Propagation

ZAMBONELLI, Franco
1998

Abstract

Checkpointing is a very well known mechanism to achieve fault tolerance. In distributed applications where processes can checkpoint independently of each other, a local checkpoint is useful for fault tolerance purposes only if it belongs to at least one consistent global checkpoint. In this case, execution can be restarted from it without needing to rollback the execution in the past. The paper exploits a theoreticalframeworkthatfacilitatesthe definition and analysis of distributed checkpoint algorithms to avoid rollback propagation. Several distributed algorithms are presented which avoid roll-back propagation by forcing additional checkpoints in processes. The effectiveness of the algorithms is evaluated in several testbed applications, showing their limited capability of bounding the number of additional checkpoints.
1998
145
212
218
Analysis and Evaluation of Distributed Checkpoint Algorithms to Avoid Roll-Back Propagation / Zambonelli, Franco. - In: IEE PROCEEDINGS. SOFTWARE. - ISSN 1462-5970. - STAMPA. - 145:(1998), pp. 212-218.
Zambonelli, Franco
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/644642
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact