A fault tolerance approach for distributed systems using monitoring based replication

High availability is a desired feature of a dependable distributed system. Replication is a well-known technique to achieve fault tolerance in distributed systems, thereby enhancing availability. We propose an approach relying on replication techniques and based on monitoring information to be appli...

Full description

Saved in:
Bibliographic Details
Published in2010 IEEE International Conference on Intelligent Computer Communication and Processing pp. 451 - 458
Main Authors Costan, A, Dobre, C, Pop, F, Leordeanu, C, Cristea, V
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.08.2010
Subjects
Online AccessGet full text
ISBN9781424482283
1424482283
DOI10.1109/ICCP.2010.5606398

Cover

More Information
Summary:High availability is a desired feature of a dependable distributed system. Replication is a well-known technique to achieve fault tolerance in distributed systems, thereby enhancing availability. We propose an approach relying on replication techniques and based on monitoring information to be applied in distributed systems for fault tolerance. Our approach uses both active and passive strategies to implement an optimistic replication protocol. Using a proxy to handle service calls and relying on service replication strategies, we effectively deal with the complexity and overhead issues. This paper presents an architecture for implementing the proxy based on monitoring data and the replication management. Experimentation and application testing using an implementation of the architecture is presented. The architecture is demonstrated to be a viable technique for increasing dependability in distributed systems.
ISBN:9781424482283
1424482283
DOI:10.1109/ICCP.2010.5606398