A fault tolerance approach for distributed systems using monitoring based replication
High availability is a desired feature of a dependable distributed system. Replication is a well-known technique to achieve fault tolerance in distributed systems, thereby enhancing availability. We propose an approach relying on replication techniques and based on monitoring information to be appli...
        Saved in:
      
    
          | Published in | 2010 IEEE International Conference on Intelligent Computer Communication and Processing pp. 451 - 458 | 
|---|---|
| Main Authors | , , , , | 
| Format | Conference Proceeding | 
| Language | English | 
| Published | 
            IEEE
    
        01.08.2010
     | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 9781424482283 1424482283  | 
| DOI | 10.1109/ICCP.2010.5606398 | 
Cover
| Summary: | High availability is a desired feature of a dependable distributed system. Replication is a well-known technique to achieve fault tolerance in distributed systems, thereby enhancing availability. We propose an approach relying on replication techniques and based on monitoring information to be applied in distributed systems for fault tolerance. Our approach uses both active and passive strategies to implement an optimistic replication protocol. Using a proxy to handle service calls and relying on service replication strategies, we effectively deal with the complexity and overhead issues. This paper presents an architecture for implementing the proxy based on monitoring data and the replication management. Experimentation and application testing using an implementation of the architecture is presented. The architecture is demonstrated to be a viable technique for increasing dependability in distributed systems. | 
|---|---|
| ISBN: | 9781424482283 1424482283  | 
| DOI: | 10.1109/ICCP.2010.5606398 |