Detectores de Falhas e Assuntos Correlatos


Principais artigos relacionados

  1. M. J. Fischer, N. A. Lynch and M. D. Paterson.
    Impossibility of Distributed Consensus with One Faulty Process.
    Journal of ACM, 32(2): 374-382, April 1985. [FLP1985]

  2. D. Dolev, C. Dwork and L. Stockmeyer.
    On the minimal synchronism needed for distributed consensus.
    Journal of the ACM, 34(1): 77-97, January 1987. [DDS1987*]

  3. C. Dwork, N. A. Lynch and L. Stockmeyer.
    Consensus in the presence of partial synchrony.
    Journal of the ACM, 35(2): 288-323}, April 1988. [DLS1988*]

  4. R. Guerraoui and A. Schiper.
    Consensus: the big misunderstanding.
    In Proceedings of the 6th IEEE Computer Society Workshop on Future Trends in Distributed Computing Systems (FTDCS-6), pages 183-188, Tunis, Tunisia, October 1997. IEEE Computer Society Press. [GS1997]

  5. F. Cristian and C. Fetzer.
    The Timed Asynchronous Distributed System Model.
    IEEE Transactions on Parallel and Distributed Systems, 10(6), pp. 642-657, Jun 1999. [CF1999]

Artigos sobre a teoria de Detectores de Falhas

  1. Tushar Deepak Chandra, Sam Toueg.
    Unreliable Failure Detectors for Reliable Distributed Systems.
    JACM 43(2): 225-267 (1996). [CT1996]

  2. Tushar Deepak Chandra, Vassos Hadzilacos, Sam Toueg.
    The Weakest Failure Detector for Solving Consensus.
    JACM 43(4): 685-722 (1996). [CHT1996]

  3. Marcos Kawazoe Aguilera, Sam Toueg, Borislav Deianov.
    Revising the Weakest Failure Detector for Uniform Reliable Broadcast.
    DISC 1999: 19-33. [ATD1999a*]

  4. Marcos Kawazoe Aguilera, Wei Chen, Sam Toueg.
    Failure Detection and Consensus in the Crash-Recovery Model.
    Distributed Computing 13(2): 99-125 (2000). [ACT2000a]

  5. Bernadette Charron-Bost, Rachid Guerraoui, and André Schiper.
    Synchronous system and perfect failure detector: solvability and efficiency issues.
    In Proceedings of the IEEE Int. Conf. on Dependable Systems and Networks (DSN), pages 523-532, New York, USA, June 2000. IEEE Computer Society. [C-BGS2000]

  6. Michel Raynal.
    Quiescent Uniform Reliable Broadcast as an Introdutory Survey to Failure Detector Oracles.
    IRISA/INRIA, Technical Report 1356, Rennes, France, Octobre 2000. [Raynal2000]

Artigos sobre protocolos e implementações de Detectores de Falhas

  1. Christof Fetzer and Flaviu Cristian.
    Fail-Aware Failure Detectors. [FC]

  2. António Casimiro and Paulo Veríssimo.
    Timing Failure Detection with a Timed Computing Base. [CV]

  3. R. Guerraoui and A. Schiper.
    Gamma-accurate failure detectors.
    In Proceedings of the 10th International Workshop on Distributed Algorithms (WDAG-10), LNCS 1151, Bologna, Italy, October 1996. Springer-Verlag. [GS1996]

  4. Marcos Kawazoe Aguilera, Wei Chen, Sam Toueg.
    Heartbeat: A Timeout-Free Failure Detector for Quiescent Reliable Communication.
    WDAG 1997: 126-140. [ACT1997]

  5. Pascal Felber, Xavier Défago, Rachid Guerraoui, and Philipp Oser.
    Failure Detectors as First Class Objects. [FDGO]

  6. A. Doudou, B. Garbinato, R. Guerraoui, and A. Schiper.
    Muteness failure detectors: Specification and implementation.
    In Proceedings 3rd European Dependable Computing Conference (EDCC-3), LNCS 1667, pages 71-87, Prague, Czech Republic, September 1999. [DGGS1999]

  7. Raimundo Macêdo.
    Failure Detection in Asynchronous Distributed Systems.
    II Workshop de Testes e Tolerância a Falhas (II WTF 2000), Curitiba, PR, July 2000. [Macedo2000]

  8. Christof Fetzer, Michel Raynal and Frédéric Tronel.
    A Failure Detection Protocol Based on a Lazy Approach.
    IRISA/INRIA, Technical Report 1367, Rennes, France, Novembre 2000. [FRT2000]

  9. Wei Chen, Sam Toueg and Marcos Kawazoe Aguilera.
    On the Quality of Sevice of Failure Detectors.
    DSN'2000, June 2000. [CTA2000]

  10. Mikel Larrea, Antonio Fernádez and Sergio Arévalo.
    Optimal Implementation of the Weakest Failure Detector to Solve Consensus. [LFA]

  11. Nicole Sergent, Xavier Défago and André Schiper.
    Failure Detectors: implementation issues and impacts in consensus performance.
    1999. [SDS1999]

  12. Raul Ceretta Nunes.
    Self-Tuned Failure Detectors.
    I Workshop de Teses e Dissertações (IWTD/SCTF'2001), Florianópolis, March 2001. [Nunes2001]

Artigos sobre a solução de problemas usando Detectores de Falhas

  1. A. Schiper and A. Ricciardi.
    Virtually-synchronous communication based on a weak failure suspector.
    In Proceedings of the 23rd International Symposium on Fault-Tolerant Computing (FTCS-23), pages 534-543, Toulouse, France, June 1993. [SR1993]

  2. R. Guerraoui, M. Larrea, and A. Schiper.
    Non blocking atomic commitment with an unreliable failure detector.
    In Proceedings of the 14th Symposium on Reliable Distributed Systems (SRDS-14), pages 41-50, Bad Neuenahr, Germany, September 1995. [GLS1995]

  3. A. Schiper.
    Early consensus in an asynchronous system with a weak failure detector.
    Distributed Computing, 10(3):149-157, April 1997. [Schiper1997*]

  4. Marcos Kawazoe Aguilera, Sam Toueg.
    Failure Detection and Randomization: A Hybrid Approach to Solve Consensus.
    SIAM J. Comput. 28(3): 890-903 (1998). [AT1998]

  5. Marcos Kawazoe Aguilera, Wei Chen, Sam Toueg.
    Using the Heartbeat Failure Detector for Quiescent Reliable Communication and Consensus in Partitionable Networks.
    TCS 220(1): 3-30 (1999). [ACT1999]

  6. Fabíola Greve, Michel Hurfin, Raimundo Macêdo and Michel Raynal.
    Consensus Based On Strong Failure Detectors : A Time and Message-Efficient Protocol.
    In: IEEE International Workshop on Fault-Tolerant Parallel and Distributed Systems, Cancun, Springer Verlag, v. 1800, p.1258-1267, 2000. [GHMR2000]

  7. Achour Mostefaoui and Michel Raynal.
    Consensus Based on Failure Detectors with a Perpetual Accuracy Property.
    Proceedings of the 14th International Parallel and Distributed Processing Symposium (IPDPS'00), 2000. [MR2000a]

Artigos sobre assuntos correlatos

  1. A. Ricciardi, A. Schiper, and K. Birman.
    Understanding partitions and the "no partition" assumption.
    In Proceedings of the 4th IEEE Computer Society Workshop on Future Trends in Distributed Computing Systems (FTDCS-4), pages 354-360, Lisbon, Portugal, September 1993. [RSB1993]

  2. Paulo Veríssimo and Carlos Almeida.
    Quasi-sinchronism: a step away from the tradicional fault-tolerant real-time system models. [VA]

  3. P. Urbán, X. Défago, and A. Schiper.
    Contention-aware metrics for distributed algorithms: Comparison of atomic broadcast algorithms.
    In Proceedings of the 9th IEEE International Conference on Computer Communications and Networks (IC3N 2000), October 2000. [UDS2000]

  4. R. Guerraoui, M. Hurfin, A. Mostefaoui, R. Oliveira, M. Raynal, and A. Schiper.
    Consensus in asynchronous distributed systems: A concise guided tour.
    In S. Shrivastava S. Krakowiak, editor, Advances in Distributed Systems, number LNCS 1752, pages 33-47. Spinger, 2000. [GHMORS2000*]

  5. Marcos Kawazoe Aguilera, Wei Chen, Sam Toueg.
    On Quiescent Reliable Communication.
    SIAM J. Comput. 29(6): 2040-2073 (2000). [ACT2000b]

  6. Bernadette Charron-Bost, Sam Toueg, Anindya Basu.
    Revisiting Safety and Liveness in the Context of Failures.
    CONCUR 2000: 552-565. [C-BTB2000*]

  7. Paulo Veríssimo and António Casimiro.
    The Timely Computing Base.
    Departamento de Informática, Faculdade de Ciências da Universidade de Lisboa, TR-99-2, May 1999. [VC1999]

  8. Paulo Veríssimo, António Casimiro and Christof Fetzer.
    The Timely Computing Base: Timely Actions in the Presence of Uncertain Timeliness.
    DSN'2000, June 2001. [VCF2001]

  9. Rachid Guerraoui and André Schiper.
    The generic consensus service.
    IEEE Transactions on Software Engineering, 27(1):29-41, January 2001. [GS2001]

  10. F. V. Brasileiro and L. M. R. Sampaio.
    A Practical Approach to Validate the Design of Fault-Tolerant Distributed Protocols for Asynchronous Systems.
    Anais do IX Simpósio de Computadores Tolerantes a Falhas, Florianópolis, Brazil March 2001. [BS2001]