Unreliable failure detectors for reliable distributed systems

From MaRDI portal
Publication:4371670

DOI10.1145/226643.226647zbMath0885.68021OpenAlexW2133943294WikidataQ29302224 ScholiaQ29302224MaRDI QIDQ4371670

Sam Toueg, Tushar Deepak Chandra

Publication date: 21 January 1998

Published in: Journal of the ACM (Search for Journal in Brave)

Full work available at URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.579.388




Related Items

The expressive power of snap-stabilizationA necessary and sufficient condition for transforming limited accuracy failure detectorsFrom binary consensus to multivalued consensus in asynchronous message-passing systemsA visit to mutual exclusion in seven datesSynchronous condition-based consensusDynamic group communicationThe Theta-Model: achieving synchrony without clocksThe Heard-Of model: computing in distributed systems with benign faultsConsensus and collision detectors in radio networksOn the computability power and the robustness of set agreement-oriented failure detector classesOn implementing omega in systems with weak reliability and synchrony assumptionsImplementing unreliable failure detectors with unknown membershipA weakly-adaptive condition-based consensus algorithm in asynchronous distributed systemsA timing assumption and two \(t\)-resilient protocols for Implementing an eventual leader service in asynchronous shared memory systemsLight-weight leases for storage-centric coordinationA distributed leader election algorithm in crash-recovery and omissive systemsFault-tolerant multiparty session typesAnonymous asynchronous systems: the case of failure detectorsThe perfectly synchronized round-based model of distributed computingIn search of lost timeAlgorithms for a distributed IDS in MANETsCorona: a stabilizing deterministic message-passing skip listThe weakest failure detector to implement a register in asynchronous systems with hybrid communicationPower and limits of distributed computing shared memory modelsRandomized \(k\)-set agreement in crash-prone and Byzantine asynchronous systemsPartial synchrony based on set timelinessFailure detectors encapsulate fairnessRenaming and the weakest family of failure detectors\(\text{Para}^2\): parameterized path reduction, acceleration, and SMT for reachability in threshold-guarded distributed algorithmsCommunication-efficient and crash-quiescent omega with unknown membershipCommunication-efficient failure detection and consensus in omission environmentsUniform reliable broadcast in anonymous distributed systems with fair lossy channelsConsensus in the presence of mortal Byzantine faulty processes\textsc{Ramos}: concurrent writing and reconfiguration for collaborative systemsPronto: high availability for standard off-the-shelf databasesMessage and time efficient consensus protocols for synchronous distributed systemsOn termination detection in crash-prone distributed systems with failure detectorsUsing asynchrony and zero degradation to speed up indulgent consensus protocolsA knowledge-theoretic analysis of uniform distributed coordination and failure detectorsLow complexity Byzantine-resilient consensusActive disk Paxos with infinitely many processesThe inherent price of indulgenceTight bounds for \(k\)-set agreement with limited-scope failure detectorsOn the importance of having an identity or, is consensus really universal?The weakest failure detector to solve nonuniform consensusThe overhead of consensus failure recoveryTransient fault detectorsBooting clock synchronization in partially synchronous systems with hybrid process and link failuresFailure detectors as type boostersThe weakest failure detectors to boost obstruction-freedomLow-latency atomic broadcast in the presence of contentionWhen consensus meets self-stabilizationThe renaming problem in shared memory systems: an introductionCommunication-optimal eventually perfect failure detection in partially synchronous systemsVerification of consensus algorithms using satisfiability solving\textsc{Rambo}: a robust, reconfigurable atomic memory service for dynamic networksThreshold protocols in survivor set systemsThe disagreement power of an adversaryOn set consensus numbersRandomization can be a healer: consensus with dynamic omission failuresThe minimum information about failures for solving non-local tasks in message-passing systemsA theory of system behaviour in the presence of node and link failureSet-constrained delivery broadcast: a communication abstraction for Read/write implementable distributed objectsGracefully degrading consensus and \(k\)-set agreement in directed dynamic networksConsensus in anonymous asynchronous systems with crash-recovery and omission failuresOn the complexity of basic abstractions to implement consensusOn mobile agent verifiable problemsDistributed fault detection and isolation of continuous-time non-linear systemsConsensus using omega in asynchronous systems with unknown membership and degenerative Byzantine failuresConsensus in rooted dynamic networks with short-lived stabilityThe weakest failure detector for eventual consistencyThe impossibility of boosting distributed service resilienceAn impossibility about failure detectors in the iterated immediate snapshot modelOn the road to the weakest failure detector for \(k\)-set agreement in message-passing systemsAn object based algebra for specifying a fault tolerant software architectureThe asynchronous bounded-cycle modelA simple and communication-efficient omega algorithm in the crash-recovery modelA simple proof of the necessity of the failure detector \(\Sigma \) to implement an atomic register in asynchronous message-passing systemsAdaptive progress: a gracefully-degrading liveness propertyRefined quorum systemsOn the completeness of bounded model checking for threshold-based distributed algorithms: reachabilityFault-management in P2P-MPIImplementing the Omega failure detector in the crash-recovery failure modelSafe termination detection in an asynchronous distributed system when processes may crash and recoverReducing \(\Omega\) to \(\lozenge\mathcal W\)Increasing the resilience of distributed and replicated database systemsA flexible formal framework for masking/demasking faultsConsensus in Byzantine asynchronous systemsCommunication-efficient randomized consensusOn modelling mobilityRevisiting the PAXOS algorithmContention-related crash failures: definitions, agreement algorithms, and impossibility resultsAsynchronous bounded lifetime failure detectorsUsing the heartbeat failure detector for quiescent reliable communication and consensus in partitionable networksOn the round complexity of Byzantine agreement without initial set-upByzantine-resistant total ordering algorithms.Making Byzantine consensus liveOn the hardness of failure-sensitive agreement problems.Restricted failure detectors: Definition and reduction protocolsOptimistic atomic broadcast: A pragmatic viewpointExtracting Symbolic Transitions from TLA$$^{+}$$+ SpecificationsA case study on parametric verification of failure detectorsSurvey on Parameterized Verification with Threshold Automata and the Byzantine Model CheckerEffective multicast programming in large scale distributed systemsOpen consensusUnnamed ItemOn the weakest failure detector everWhat You Always Wanted to Know About Model Checking of Fault-Tolerant Distributed AlgorithmsCommunication Patterns and Input Patterns in Distributed ComputingA Separation of n-consensus and (n + 1)-consensus Based on Process SchedulingA weakest failure detector-based asynchronous consensus protocol for \(f<n\)Correctness proof of a database replication protocol under the perspective of the I/O automaton modelSynthesis of distributed algorithms with parameterized threshold guardsAn Eventually Perfect Failure Detector for Networks of Arbitrary Topology Connected with ADD Channels Using Time-To-Live ValuesImplementing ♢P with Bounded Messages on a Network of ADD ChannelsSecure consensus with distributed detection via two-hop communicationAgreeing within a few writesData-driven mixed-integer linear programming-based optimisation for efficient failure detection in large-scale distributed systemsFault tolerant network constructorsUniform atomic broadcast and consensus in fully anonymous synchronous systems with crash failuresThe Iterated Restricted Immediate Snapshot ModelSelf-stabilizing indulgent zero-degrading binary consensusHow to Solve Consensus in the Smallest Window of SynchronyThe Weakest Failure Detector for Message Passing Set-AgreementUsing Bounded Model Checking to Verify Consensus AlgorithmsX-Ability: a theory of replicationNon-blocking atomic commit in asynchronous distributed systems with failure detectorsHandling message semantics with Generic Broadcast protocolsHundreds of impossibility results for distributed computingRandomized protocols for asynchronous consensusAppraising two decades of distributed computing theory researchCondition-based consensus solvability: a hierarchy of conditions and efficient protocolsDistributed consensus, revisitedEventually perfect predicate detection in crash-affected finite average response time systemsFormal Model–Driven Design of Distributed AlgorithmsWhat Can be Computed in a Distributed System?Tutorial on Parameterized Model Checking of Fault-Tolerant Distributed AlgorithmsGenuine atomic multicast in asynchronous distributed systemsExperiences with object group systemsDeterministic Models of Communication FaultsUnnamed ItemUnnamed ItemAn infrastructure to support cooperation of knowledge-level agents on the semantic gridOn Optimal Probabilistic Asynchronous Byzantine AgreementWait-Free Dining Under Eventual Weak ExclusionThe DHCP Failover Protocol: A Formal PerspectiveGeneralized Symmetry Breaking Tasks and Nondeterminism in Concurrent ObjectsPacket efficient implementation of the Omega failure detectorTowards a Universal Approach for the Finite Departure Problem in Overlay NetworksWait-freedom with advicePerfect failure detection with very few bitsSnap-stabilizing tasks in anonymous networksStructuring unreliable radio networksFault-Tolerant Consensus with an Abstract MAC Layer.Characterizing Consensus in the Heard-Of ModelFair Exchange Is Incomparable to ConsensusUnnamed ItemMuch Ado About Nothing?Simultaneous Consensus vs Set Agreement: A Message-Passing-Sensitive Hierarchy of Agreement Problems