Encoding-Aware Data Placement for Efficient Degraded Reads in XOR-Coded Storage Systems: Algorithms and Evaluation

Modern storage systems adopt erasure coding to maintain fault tolerance with low storage redundancy. However, how to improve the performance of degraded reads in erasure-coded storage has been a critical issue. We revisit this problem from two different perspectives that are neglected by existing st...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on parallel and distributed systems Vol. 29; no. 12; pp. 2757 - 2770
Main Authors Shen, Zhirong, Lee, Patrick P. C., Shu, Jiwu, Guo, Wenzhong
Format Journal Article
LanguageEnglish
Published New York IEEE 01.12.2018
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN1045-9219
1558-2183
DOI10.1109/TPDS.2018.2842210

Cover

Abstract Modern storage systems adopt erasure coding to maintain fault tolerance with low storage redundancy. However, how to improve the performance of degraded reads in erasure-coded storage has been a critical issue. We revisit this problem from two different perspectives that are neglected by existing studies: data placement and encoding rules. To this end, we propose encoding-aware data placement (EDP), which mitigates the number of I/Os in degraded reads during a single failure for general XOR-based erasure codes. EDP carefully selects appropriate parity units to be generated by sequential data based on the encoding rules and establishes their generation orders. We further refine the data placement for optimizing the degraded reads to any two sequential data units. Trace-driven evaluation results show that EDP significantly reduces I/Os in degraded reads and hence shortens the read time.
AbstractList Modern storage systems adopt erasure coding to maintain fault tolerance with low storage redundancy. However, how to improve the performance of degraded reads in erasure-coded storage has been a critical issue. We revisit this problem from two different perspectives that are neglected by existing studies: data placement and encoding rules. To this end, we propose encoding-aware data placement (EDP), which mitigates the number of I/Os in degraded reads during a single failure for general XOR-based erasure codes. EDP carefully selects appropriate parity units to be generated by sequential data based on the encoding rules and establishes their generation orders. We further refine the data placement for optimizing the degraded reads to any two sequential data units. Trace-driven evaluation results show that EDP significantly reduces I/Os in degraded reads and hence shortens the read time.
Author Lee, Patrick P. C.
Guo, Wenzhong
Shen, Zhirong
Shu, Jiwu
Author_xml – sequence: 1
  givenname: Zhirong
  orcidid: 0000-0003-2673-5868
  surname: Shen
  fullname: Shen, Zhirong
  email: zhirong.shen2601@gmail.com
  organization: Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong
– sequence: 2
  givenname: Patrick P. C.
  orcidid: 0000-0002-4501-4364
  surname: Lee
  fullname: Lee, Patrick P. C.
  email: pclee@cse.cuhk.edu.hk
  organization: Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong
– sequence: 3
  givenname: Jiwu
  surname: Shu
  fullname: Shu, Jiwu
  email: shujw@tsinghua.edu.cn
  organization: Department of Computer Science and Technology, Tsinghua University, Beijing, China
– sequence: 4
  givenname: Wenzhong
  orcidid: 0000-0003-4118-8823
  surname: Guo
  fullname: Guo, Wenzhong
  email: guowenzhong@fzu.edu.cn
  organization: Fujian Provincial Key Laboratory of Network Computing and Intelligent Information Processing, Key Laboratory of Spatial Data Mining & Information Sharing, College of Mathematics and Computer Sciences, and Ministry of Education, Fuzhou University, Fuzhou, China
BookMark eNp9kMtOGzEUhq2KSgXKA1TdWOp6UvvYkxl3FyXpRUICEZDYjQ72mdRoYlPboeLtO0NQFyxYnYv-_1y-E3YUYiDGPkkxk1KYr9eXq80MhGxn0GoAKd6xY1nXbQWyVUdjLnRdGZDmAzvJ-V4IqWuhj1laBxudD9tq8RcT8RUW5JcDWtpRKLyPia_73ls_VSvaJnTk-BWhy9wHfntxVS3j1NqUmHBLfPOUC-3yN74YtjH58nuXOQbH14847LH4GD6y9z0Omc5e4im7-b6-Xv6szi9-_FouzisLRpWqaWpz5wRqMBZ7bZwZL1ZEIC3Ytr-btzg-5Rot57VsLAo3B2xAOQcGHFh1yr4c5j6k-GdPuXT3cZ_CuLIDqUDURgkxqpqDyqaYc6K-s74831kS-qGTopsAdxPgbgLcvQAenfKV8yH5HaanNz2fDx5PRP_1rWqE0Fr9A_Q8h-g
CODEN ITDSEO
CitedBy_id crossref_primary_10_1016_j_asoc_2021_107548
crossref_primary_10_1109_TPDS_2023_3306150
crossref_primary_10_1080_17517575_2019_1700552
crossref_primary_10_1007_s12652_020_02582_y
crossref_primary_10_1007_s12652_020_02748_8
crossref_primary_10_1109_ACCESS_2020_3037164
crossref_primary_10_1109_ACCESS_2021_3049256
crossref_primary_10_1007_s11432_021_3421_6
crossref_primary_10_1016_j_engappai_2020_103803
crossref_primary_10_1007_s12652_019_01666_8
crossref_primary_10_1080_17517575_2019_1701715
crossref_primary_10_1016_j_asoc_2019_106038
crossref_primary_10_1109_ACCESS_2019_2956067
crossref_primary_10_1109_ACCESS_2019_2956980
crossref_primary_10_1080_17517575_2021_1885743
crossref_primary_10_1007_s12652_020_02543_5
crossref_primary_10_1016_j_asoc_2020_106071
crossref_primary_10_5937_telfor2202056S
crossref_primary_10_1007_s11227_020_03511_8
crossref_primary_10_1002_ett_3922
crossref_primary_10_1016_j_asoc_2020_106065
crossref_primary_10_1080_17517575_2020_1722254
crossref_primary_10_1109_ACCESS_2019_2960603
Cites_doi 10.1109/IPDPS.2011.78
10.1145/1542275.1542326
10.1109/DSN.2011.5958220
10.1109/INFOCOM.2014.6848156
10.1109/ICPP.2015.57
10.1109/NAS.2014.29
10.1109/SRDS.2016.041
10.1109/DSN.2014.47
10.1109/DSN.2014.57
10.1109/TC.2014.2360543
10.1109/MSST.2012.6232371
10.1145/1416944.1416949
10.1145/2658991
10.1109/TC.2007.70830
10.1137/0108018
10.1109/18.746809
10.1007/3-540-45748-8_31
10.1109/IPDPS.2015.46
10.1109/JSAC.2010.100218
10.1109/DSN.2015.19
10.1145/2560013
10.1145/2043556.2043571
10.1109/12.364531
10.1145/1165389.945450
10.1109/CLUSTER.2013.6702621
10.1109/TPDS.2016.2525770
ContentType Journal Article
Copyright Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2018
Copyright_xml – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2018
DBID 97E
RIA
RIE
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
DOI 10.1109/TPDS.2018.2842210
DatabaseName IEEE All-Society Periodicals Package (ASPP) 2005–Present
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Electronic Library (IEL)
CrossRef
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Technology Research Database
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISSN 1558-2183
EndPage 2770
ExternalDocumentID 10_1109_TPDS_2018_2842210
8370044
Genre orig-research
GrantInformation_xml – fundername: Fujian Provincial Natural Science Foundation
  grantid: 2017J05102
– fundername: Research Grants Council of Hong Kong
  grantid: GRF 14216316; CRF C7036-15G
– fundername: Technology Innovation Platform Project of Fujian Province
  grantid: 2014H2005
– fundername: National Natural Science Foundation of China
  grantid: 61602120; 61327902; 61433008; U1435216; 61672159; U1705262
  funderid: 10.13039/501100001809
– fundername: Fujian Engineering Research Center of Big Data Analysis and Processing
– fundername: Fujian Collaborative Innovation Center for Big Data Application in Governments
GroupedDBID --Z
-~X
.DC
0R~
29I
4.4
5GY
6IK
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABQJQ
ABVLG
ACGFO
ACIWK
AENEX
AGQYO
AHBIQ
AKJIK
AKQYR
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
EBS
EJD
HZ~
IEDLZ
IFIPE
IPLJI
JAVBF
LAI
M43
MS~
O9-
OCL
P2P
PQQKQ
RIA
RIE
RNS
TN5
TWZ
UHB
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
RIG
ID FETCH-LOGICAL-c293t-7759bd0a429caf49d94503ee21c2c8fb68a104d7416517ca0d62a723dd292d2c3
IEDL.DBID RIE
ISSN 1045-9219
IngestDate Sun Jun 29 15:25:42 EDT 2025
Thu Apr 24 23:13:01 EDT 2025
Wed Oct 01 04:37:21 EDT 2025
Wed Aug 27 08:33:32 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 12
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c293t-7759bd0a429caf49d94503ee21c2c8fb68a104d7416517ca0d62a723dd292d2c3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0003-4118-8823
0000-0002-4501-4364
0000-0003-2673-5868
PQID 2132059300
PQPubID 85437
PageCount 14
ParticipantIDs crossref_citationtrail_10_1109_TPDS_2018_2842210
ieee_primary_8370044
proquest_journals_2132059300
crossref_primary_10_1109_TPDS_2018_2842210
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2018-12-01
PublicationDateYYYYMMDD 2018-12-01
PublicationDate_xml – month: 12
  year: 2018
  text: 2018-12-01
  day: 01
PublicationDecade 2010
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationTitle IEEE transactions on parallel and distributed systems
PublicationTitleAbbrev TPDS
PublicationYear 2018
Publisher IEEE
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References ref35
ref34
ref12
ref37
ref15
ref36
ref14
pinheiro (ref18) 2007
ref30
ref33
ref11
ref32
ford (ref6) 2010
corbett (ref5) 2004
ref1
xia (ref31) 2015
ref17
ref38
ref16
ref19
borthakur (ref3) 2010
rashmi (ref21) 2013
schroeder (ref24) 2007
ref23
ref26
ref25
khan (ref13) 2012
ref28
ref27
huang (ref10) 2012
ref29
plank (ref20) 2008
ref8
ref7
ref9
ref4
rashmi (ref22) 2014
bloemer (ref2) 1995
References_xml – start-page: 1
  year: 2004
  ident: ref5
  article-title: Row-diagonal parity for double disk failure correction
  publication-title: Proc 3rd USENIX Conf File Storage Technol
– ident: ref30
  doi: 10.1109/IPDPS.2011.78
– start-page: 8
  year: 2013
  ident: ref21
  article-title: A solution to the network challenges of data recovery in erasure-coded distributed storage systems: A study on the facebook warehouse cluster
  publication-title: Proc 3rd USENIX Conf on Hot Topics in Storage and File Syst
– ident: ref12
  doi: 10.1145/1542275.1542326
– ident: ref29
  doi: 10.1109/DSN.2011.5958220
– ident: ref35
  doi: 10.1109/INFOCOM.2014.6848156
– year: 2008
  ident: ref20
  article-title: Jerasure: A library in C/C++ facilitating erasure coding for storage applications-version 1.2
– ident: ref8
  doi: 10.1109/ICPP.2015.57
– ident: ref33
  doi: 10.1109/NAS.2014.29
– start-page: 61
  year: 2010
  ident: ref6
  article-title: Availability in globally distributed storage systems
  publication-title: Proc 9th USENIX Conf Operating Syst Des Implementation
– ident: ref25
  doi: 10.1109/SRDS.2016.041
– start-page: 213
  year: 2015
  ident: ref31
  article-title: A tale of two erasure codes in HDFS
  publication-title: Proc USENIX Conf File Storage Technol
– ident: ref15
  doi: 10.1109/DSN.2014.47
– ident: ref26
  doi: 10.1109/DSN.2014.57
– ident: ref38
  doi: 10.1109/TC.2014.2360543
– ident: ref37
  doi: 10.1109/MSST.2012.6232371
– start-page: 331
  year: 2014
  ident: ref22
  article-title: A hitchhiker's guide to fast and efficient data reconstruction in erasure-coded data centers
  publication-title: Proc ACM Conf SIGCOMM
– year: 1995
  ident: ref2
  article-title: An XOR-based erasure-resilient coding scheme
– start-page: 2
  year: 2007
  ident: ref18
  article-title: Failure trends in a large disk drive population
  publication-title: Proc 5th USENIX Conf File Storage Technol
– ident: ref17
  doi: 10.1145/1416944.1416949
– ident: ref14
  doi: 10.1145/2658991
– ident: ref11
  doi: 10.1109/TC.2007.70830
– start-page: 251
  year: 2012
  ident: ref13
  article-title: Rethinking erasure codes for cloud file systems: Minimizing i/o for recovery and degraded reads
  publication-title: Proc 10th USENIX Conf File Storage Technol
– ident: ref23
  doi: 10.1137/0108018
– ident: ref34
  doi: 10.1109/18.746809
– ident: ref28
  doi: 10.1007/3-540-45748-8_31
– ident: ref7
  doi: 10.1109/IPDPS.2015.46
– ident: ref16
  doi: 10.1109/JSAC.2010.100218
– year: 2010
  ident: ref3
  article-title: HDFS RAID
  publication-title: Hadoop User Group Meeting
– ident: ref36
  doi: 10.1109/DSN.2015.19
– ident: ref19
  doi: 10.1145/2560013
– ident: ref4
  doi: 10.1145/2043556.2043571
– ident: ref1
  doi: 10.1109/12.364531
– ident: ref9
  doi: 10.1145/1165389.945450
– start-page: 2
  year: 2012
  ident: ref10
  article-title: Erasure coding in windows azure storage
  publication-title: Proc USENIX Conf Annu Tech Conf
– ident: ref32
  doi: 10.1109/CLUSTER.2013.6702621
– ident: ref27
  doi: 10.1109/TPDS.2016.2525770
– start-page: 1
  year: 2007
  ident: ref24
  article-title: Disk failures in the real world: What does an mttf of 1, 000, 000 hours mean to you?
  publication-title: Proc 5th USENIX Conf File Storage Technol
SSID ssj0014504
Score 2.4033825
Snippet Modern storage systems adopt erasure coding to maintain fault tolerance with low storage redundancy. However, how to improve the performance of degraded reads...
SourceID proquest
crossref
ieee
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 2757
SubjectTerms Coding
Decoding
degraded reads
Distributed databases
Encoding
Encoding-aware data placement
Fault tolerance
Fault tolerant systems
Layout
Performance degradation
Performance enhancement
Placement
Redundancy
Storage systems
Systems analysis
XOR-coded storage systems
Title Encoding-Aware Data Placement for Efficient Degraded Reads in XOR-Coded Storage Systems: Algorithms and Evaluation
URI https://ieeexplore.ieee.org/document/8370044
https://www.proquest.com/docview/2132059300
Volume 29
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVIEE
  databaseName: IEEE Electronic Library (IEL)
  customDbUrl:
  eissn: 1558-2183
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0014504
  issn: 1045-9219
  databaseCode: RIE
  dateStart: 19900101
  isFulltext: true
  titleUrlDefault: https://ieeexplore.ieee.org/
  providerName: IEEE
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT9swFH4CTuwwfg2twJAPnCZSEsdJY24VLUKT2NBapN4i_ypU6xLUpkLir-e9xI3QmKbdksi2rHy232f7ve8BnLk0TqU0KtAqxg2KQSyk1ipwKs16mZCZdXQOefs9vbkX3ybJZAPO21gY51ztfOa69Fjf5dvSrOio7IKEWkIhNmGzl6VNrFZ7YyCSOlUg7i6SQOI09DeYUSgvxneDETlxZV1cizmnYNk3NqhOqvJuJa7Ny_UO3K471niV_OquKt01L39oNv5vz3fho-eZrN8MjD3YcMU-7KxzODA_pffhwxtBwgNYDAtTkjUL-s9q4dhAVYrd0Uk7tc-Q4LJhrTlBbwPSmbDOMnLEX7JZwSY_fgZXJX0a4VYeVyrmBdEvWX_-UC5m1ePvJVOFZcNWZPwT3F8Px1c3gc_KEBikBhXS8URqGyo0ZEZNhbQSf33sHI8MN9lUp5lCECwxvSTqGRXalKsej63lkltu4kPYKsrCfQamdcQtkgYlhRKZTaWdIp8R3EVGO-SeHQjXOOXGS5ZT5ox5Xm9dQpkTtDlBm3toO_C1rfLU6HX8q_ABQdUW9Ch14GQ9GHI_o5c5p1jzRMZhePT3WsewTW03ri4nsFUtVu4LEpZKn9Yj9RWfruYo
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT9swFH5i7LBxgA02UcY2H3aalpI4ThrvVtGibqMMjSL1FvlXBwIS1KZC4q_fe4kbIZim3ZLITqx8tt9n-73vAXxyaZxKaVSgVYwLFINYSK1V4FSa9TIhM-toH3J8ko7OxfdpMl2DL20sjHOudj5zXbqsz_JtaZa0VXZAQi2hEM_geSKESJporfbMQCR1skBcXySBxIHozzCjUB5MTgdn5MaVdXE25pzCZR9YoTqtypO5uDYwR1swXjWt8Su56i4r3TX3j1Qb_7ftr2DTM03Wb7rGa1hzxTZsrbI4MD-ot2HjgSThDsyHhSnJngX9OzV3bKAqxU5pr53ez5DismGtOkF3A1KasM4ycsVfsMuCTX_-Cg5LenSGi3mcq5iXRP_K-te_y_lldXGzYKqwbNjKjL-B86Ph5HAU-LwMgUFyUCEhT6S2oUJTZtRMSCvx18fO8chwk810mikEwRLXS6KeUaFNuerx2FouueUmfgvrRVm4XWBaR9wibVBSKJHZVNoZMhrBXWS0Q_bZgXCFU268aDnlzrjO68VLKHOCNidocw9tBz63VW4bxY5_Fd4hqNqCHqUO7K86Q-7H9CLnFG2eyDgM9_5e6yO8GE3Gx_nxt5Mf7-AlfadxfNmH9Wq-dO-RvlT6Q91r_wDUH-l1
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Encoding-Aware+Data+Placement+for+Efficient+Degraded+Reads+in+XOR-Coded+Storage+Systems%3A+Algorithms+and+Evaluation&rft.jtitle=IEEE+transactions+on+parallel+and+distributed+systems&rft.au=Shen%2C+Zhirong&rft.au=Lee%2C+Patrick+P.+C.&rft.au=Shu%2C+Jiwu&rft.au=Guo%2C+Wenzhong&rft.date=2018-12-01&rft.pub=IEEE&rft.issn=1045-9219&rft.volume=29&rft.issue=12&rft.spage=2757&rft.epage=2770&rft_id=info:doi/10.1109%2FTPDS.2018.2842210&rft.externalDocID=8370044
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1045-9219&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1045-9219&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1045-9219&client=summon