Data Management in Erasure-Coded Distributed Storage Systems
Most data centers around the world (Google, Facebook, Amazon and so on) uses Replication as the major safeguard to provide redundancy in cases of failures, which are quite common in such environment. Diverse research has been done in the area of Replication within data center, spanning from data con...
        Saved in:
      
    
          | Published in | 2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID) pp. 902 - 907 | 
|---|---|
| Main Authors | , | 
| Format | Conference Proceeding | 
| Language | English | 
| Published | 
            IEEE
    
        01.05.2020
     | 
| Subjects | |
| Online Access | Get full text | 
| DOI | 10.1109/CCGrid49817.2020.00018 | 
Cover
| Summary: | Most data centers around the world (Google, Facebook, Amazon and so on) uses Replication as the major safeguard to provide redundancy in cases of failures, which are quite common in such environment. Diverse research has been done in the area of Replication within data center, spanning from data consistency (quorum/consensus algorithms), degraded reads, data placement (subject to network topology, physical distribution over multiple data centers, etc.), NoSQL functionalities such as leveraging on data locality while executing Hadoop Map-Reduce task. The major drawback of replication is the amount of storage capacity requirement, which is 300% the actual data. Erasure Code is tipped to be the next best alternative to providing redundancy within data centers. And in doing so, a lot of concepts need revisiting. Performance and Bandwidth are major concerns with erasure codes, and there are other open areas that needs adapted solution. For example security and energy-saving. Erasure coding allows for greater flexibility - for instance, by allowing some data center nodes to be switched off, or by allowing for more options to spread load, etc. The high level objective of the PhD is thus to derive a holistic data management solution for erasure coding based distributed storage systems. | 
|---|---|
| DOI: | 10.1109/CCGrid49817.2020.00018 |