Checkpointing and Rollback-Recovery for Distributed Systems
From MaRDI portal
Publication:3740209
DOI10.1109/TSE.1987.232562zbMath0603.68018MaRDI QIDQ3740209
Publication date: 1987
Published in: IEEE Transactions on Software Engineering (Search for Journal in Brave)
Related Items (26)
Consistent global checkpoints based on direct dependency tracking ⋮ A compositional framework for fault tolerance by specification transformation ⋮ Evaluations of domino-free communication-induced checkpointing protocols ⋮ Resolving error propagation in distributed systems ⋮ Optimal checkpointing interval of a communication system with rollback recovery ⋮ An efficient backup warning policy for a hard disk ⋮ FNB: fast non-blocking coordinated checkpointing protocol for distributed systems ⋮ An optimistic checkpointing and message logging approach for consistent global checkpoint collection in distributed systems ⋮ Garbage collection in uncoordinated checkpointing algorithms ⋮ Efficient algorithms for optimistic crash recovery ⋮ The inhibition spectrum and the achievement of causal consistency ⋮ Communication-based prevention of useless checkpoints in distributed computations ⋮ An efficient approach for constructing reliable distributed applications ⋮ An optimality proof for asynchronous recovery algorithms in distributed systems ⋮ Concurrent common knowledge: Defining agreement for asynchronous systems ⋮ Transformation of programs for fault-tolerance ⋮ Second-level algorithms, superrecursivity, and recovery problem in distributed systems ⋮ On the no-Z-cycle property in distributed executions ⋮ Adaptive checkpointing in message passing distributed systems ⋮ Optimised Recovery with a Coordinated Checkpoint/Rollback Protocol for Domain Decomposition Applications ⋮ GUARANTEED MUTUALLY CONSISTENT CHECKPOINTING IN DISTRIBUTED COMPUTATIONS ⋮ Checkpointing with mutable checkpoints. ⋮ Rollback-dependency trackability: A minimal characterization and its protocol ⋮ A distributed error recovery technique and its implementation and application on UNIX ⋮ Virus tests to maximize availability of software systems ⋮ Interval consistency of asynchronous distributed computations
This page was built for publication: Checkpointing and Rollback-Recovery for Distributed Systems