Analysis of Reliability Dynamics of SSD RAID

Solid-state drives (SSDs) have been widely deployed in desktops and data centers. However, SSDs suffer from bit errors, and the bit error rate is time dependent since it increases as an SSD wears down. Traditional storage systems mainly use parity-based RAID to provide reliability guarantees by stri...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on computers Vol. 65; no. 4; pp. 1131 - 1144
Main Authors Li, Yongkun, Lee, Patrick P.C., Lui, John C.S.
Format Journal Article
LanguageEnglish
Published New York IEEE 01.04.2016
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN0018-9340
1557-9956
DOI10.1109/TC.2014.2349505

Cover

Abstract Solid-state drives (SSDs) have been widely deployed in desktops and data centers. However, SSDs suffer from bit errors, and the bit error rate is time dependent since it increases as an SSD wears down. Traditional storage systems mainly use parity-based RAID to provide reliability guarantees by striping redundancy across multiple devices, but the effectiveness of traditional RAID schemes in SSDs remains debatable. In particular, an open problem is how different parity distributions over multiple devices influence the reliability of an SSD RAID array. That is, should we evenly distribute patsaverties as suggested by conventional wisdom, or unevenly distribute parties as recently proposed for SSD RAID? To address this fundamental problem, we propose the first analytical model to quantify the reliability dynamics of an SSD RAID array as it ages. Specifically, we develop a "non-homogeneous" continuous time Markov chain model, and derive the transient reliability solution. We validate our model via trace-driven simulation and conduct numerical analysis to analyze the reliability dynamics of SSD RAID arrays subject to different parity distributions, error rates, and SSD array configurations. Our model enables system practitioners to decide the appropriate parity distribution based on their reliability requirements.
AbstractList Solid-state drives (SSDs) have been widely deployed in desktops and data centers. However, SSDs suffer from bit errors, and the bit error rate is time dependent since it increases as an SSD wears down. Traditional storage systems mainly use parity-based RAID to provide reliability guarantees by striping redundancy across multiple devices, but the effectiveness of traditional RAID schemes in SSDs remains debatable. In particular, an open problem is how different parity distributions over multiple devices influence the reliability of an SSD RAID array. That is, should we evenly distribute patsaverties as suggested by conventional wisdom, or unevenly distribute parties as recently proposed for SSD RAID? To address this fundamental problem, we propose the first analytical model to quantify the reliability dynamics of an SSD RAID array as it ages. Specifically, we develop a "non-homogeneous" continuous time Markov chain model, and derive the transient reliability solution. We validate our model via trace-driven simulation and conduct numerical analysis to analyze the reliability dynamics of SSD RAID arrays subject to different parity distributions, error rates, and SSD array configurations. Our model enables system practitioners to decide the appropriate parity distribution based on their reliability requirements.
Author Li, Yongkun
Lui, John C.S.
Lee, Patrick P.C.
Author_xml – sequence: 1
  givenname: Yongkun
  surname: Li
  fullname: Li, Yongkun
  email: yongkunlee@gmail.com
  organization: School of Computer Science and Technology, University of Science and Technology of China, Hefei, Anhui, China
– sequence: 2
  givenname: Patrick P.C.
  surname: Lee
  fullname: Lee, Patrick P.C.
  email: pclee@cse.cuhk.edu.hk
  organization: Department of Computer Science and Engineering, Chinese University of Hong Kong, Shatin, Hong Kong
– sequence: 3
  givenname: John C.S.
  surname: Lui
  fullname: Lui, John C.S.
  email: cslui@cse.cuhk.edu.hk
  organization: Department of Computer Science and Engineering, Chinese University of Hong Kong, Shatin, Hong Kong
BookMark eNp9kD1rwzAURUVJoUnauUMXQ9c6eZIsWRqD049AoJB4F7IigYJjp5Iz-N_XaUKHDp0ePO65XM4EjZq2sQg9YphhDHJeFjMCOJsRmkkG7AaNMWN5KiXjIzQGwCKVNIM7NIlxDwCcgByjl0Wj6z76mLQu2dja68rXvuuTZd_ogzc__-12mWwWq-U9unW6jvbheqeofHsti490_fm-Khbr1BAhu5TlhuCcYSeYo3RYYTXLaM6JqCpiqKCWSE3kDksud0zQyubghDWApdZO0il6vtQeQ_t1srFT-_YUhp1R4VxwmWGSkSHFLikT2hiDdcr4Tne-bbqgfa0wqLMXVRbq7EVdvQzc_A93DP6gQ_8P8XQhvLX2N82FAMo5_QZnsWug
CODEN ITCOB4
CitedBy_id crossref_primary_10_1109_TCAD_2023_3348033
crossref_primary_10_1109_TDMR_2021_3108941
crossref_primary_10_1109_TC_2019_2962691
crossref_primary_10_1109_TIM_2022_3227983
crossref_primary_10_4018_IJGHPC_2017100101
crossref_primary_10_1109_TCAD_2020_2974346
crossref_primary_10_1145_3299710_3211338
crossref_primary_10_1109_TDMR_2017_2749639
crossref_primary_10_1109_TCAD_2020_3020495
crossref_primary_10_1145_3627992
crossref_primary_10_3390_fi15100319
crossref_primary_10_1109_ACCESS_2019_2944764
crossref_primary_10_1109_TC_2020_2978823
Cites_doi 10.1080/03461238.1953.10419459
10.1145/1519065.1519081
10.1145/1982185.1982266
10.1145/2367589.2367603
10.1109/SRDS.2013.16
10.1145/1629435.1629459
10.1145/1089733.1089735
10.1145/2093139.2093143
10.1006/jpdc.1993.1013
10.1145/1534530.1534544
10.1109/ICCNC.2012.6167470
10.1007/978-1-4757-4828-4_3
10.1109/12.543706
10.1007/978-3-642-24403-2_20
10.1115/1.4010337
10.1145/1669112.1669118
10.1145/1807060.1807061
10.1145/1987816.1987835
10.1109/DSN.2013.6575359
10.1080/15326348908807130
10.1109/RELPHY.2008.4558857
10.1109/RELDIS.1997.632790
10.1145/1555349.1555371
10.1145/2349896.2349900
10.1109/CMPCON.1989.301913
10.1109/ISCIT.2009.5341269
10.1145/50202.50214
10.1109/TC.2010.197
10.1109/FTCS.1993.627346
10.1145/2465529.2465546
ContentType Journal Article
Copyright Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016
Copyright_xml – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016
DBID 97E
RIA
RIE
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
DOI 10.1109/TC.2014.2349505
DatabaseName IEEE Xplore (IEEE)
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Xplore
CrossRef
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList Technology Research Database

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISSN 1557-9956
EndPage 1144
ExternalDocumentID 4045665361
10_1109_TC_2014_2349505
6880366
Genre orig-research
GrantInformation_xml – fundername: National Natural Science Foundation of China; National Nature Science Foundation of China
  grantid: 61303048
  funderid: 10.13039/501100001809
– fundername: Fundamental Research Funds for the Central Universities
  grantid: WK0110000040
– fundername: CUHK MoE-Microsoft Key Laboratory of Human-centric Computing and Interface Technologies
GroupedDBID --Z
-DZ
-~X
.DC
0R~
29I
4.4
5GY
6IK
85S
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABQJQ
ABVLG
ACGFO
ACIWK
ACNCT
AENEX
AETEA
AGQYO
AGSQL
AHBIQ
AKQYR
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
EBS
EJD
HZ~
IEDLZ
IFIPE
IPLJI
JAVBF
LAI
M43
MS~
O9-
OCL
P2P
PQQKQ
RIA
RIE
RNS
RXW
TAE
TN5
TWZ
UHB
UPT
XZL
YZZ
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c289t-57c21751f85f33155ea5437628bb2c383e29a29d1969d583be70f8ec019aaf93
IEDL.DBID RIE
ISSN 0018-9340
IngestDate Sun Jun 29 13:47:55 EDT 2025
Thu Apr 24 22:56:41 EDT 2025
Tue Jul 01 00:27:35 EDT 2025
Wed Aug 27 08:35:49 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 4
Keywords CTMC
transient analysis
RAID
reliability
Solid-state drives
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c289t-57c21751f85f33155ea5437628bb2c383e29a29d1969d583be70f8ec019aaf93
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
PQID 1786941242
PQPubID 85452
PageCount 14
ParticipantIDs proquest_journals_1786941242
crossref_primary_10_1109_TC_2014_2349505
ieee_primary_6880366
crossref_citationtrail_10_1109_TC_2014_2349505
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2016-April-1
2016-4-1
20160401
PublicationDateYYYYMMDD 2016-04-01
PublicationDate_xml – month: 04
  year: 2016
  text: 2016-April-1
  day: 01
PublicationDecade 2010
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationTitle IEEE transactions on computers
PublicationTitleAbbrev TC
PublicationYear 2016
Publisher IEEE
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References hess (ref14) 0
ref34
ref12
ref37
ref15
ref31
moon (ref28) 2013
ref30
ref33
ref11
ref32
ref2
ref39
ref17
ref38
ref19
muntz (ref29) 1990
ref18
im (ref16) 2011; 60
weibull (ref36) 1951; 18
ref24
ref23
ref26
ref25
ref20
ref22
ref21
agrawal (ref1) 2008
ref27
enderle (ref10) 0
ref8
grupp (ref13) 2012
ref7
cai (ref5) 2012
bucy (ref3) 2008
ref9
ref4
ref6
sun (ref35) 2011
References_xml – start-page: 521
  year: 2012
  ident: ref5
  article-title: Error patterns in MLC NAND flash memory: Measurement, characterization, and analysis
  publication-title: Proc Conf Des Autom Test Eur
– ident: ref17
  doi: 10.1080/03461238.1953.10419459
– ident: ref30
  doi: 10.1145/1519065.1519081
– start-page: 2
  year: 2012
  ident: ref13
  article-title: The bleak future of NAND flash memory
  publication-title: Proc 10th USENIX Conf File Storage Technol
– ident: ref21
  doi: 10.1145/1982185.1982266
– ident: ref9
  doi: 10.1145/2367589.2367603
– start-page: 57
  year: 2008
  ident: ref1
  article-title: Design tradeoffs for SSD performance
  publication-title: Proc Annu Tech Conf
– year: 2011
  ident: ref35
  article-title: Quantifying reliability of solid-state storage from multiple aspects
  publication-title: Proc SNAPI
– ident: ref23
  doi: 10.1109/SRDS.2013.16
– ident: ref22
  doi: 10.1145/1629435.1629459
– ident: ref11
  doi: 10.1145/1089733.1089735
– ident: ref26
  doi: 10.1145/2093139.2093143
– ident: ref25
  doi: 10.1006/jpdc.1993.1013
– start-page: 162
  year: 1990
  ident: ref29
  article-title: Performance analysis of disk arrays under failure
  publication-title: Proc 16th Int Conf Very Large Data Bases
– ident: ref15
  doi: 10.1145/1534530.1534544
– ident: ref38
  doi: 10.1109/ICCNC.2012.6167470
– year: 2008
  ident: ref3
  article-title: The DiskSim simulation environment version 4.0 reference manual
– ident: ref8
  doi: 10.1007/978-1-4757-4828-4_3
– ident: ref7
  doi: 10.1109/12.543706
– ident: ref39
  doi: 10.1007/978-3-642-24403-2_20
– volume: 18
  start-page: 293
  year: 1951
  ident: ref36
  article-title: A statistical distribution function of wide applicability
  publication-title: J Appl Mech
  doi: 10.1115/1.4010337
– ident: ref12
  doi: 10.1145/1669112.1669118
– start-page: 1
  year: 2013
  ident: ref28
  article-title: Don't let RAID raid the lifetime of your SSD array
  publication-title: Proc 5th UNESIX Conf Hot Topics Storage File Syst
– ident: ref2
  doi: 10.1145/1807060.1807061
– ident: ref18
  doi: 10.1145/1987816.1987835
– ident: ref19
  doi: 10.1109/DSN.2013.6575359
– ident: ref33
  doi: 10.1080/15326348908807130
– ident: ref27
  doi: 10.1109/RELPHY.2008.4558857
– ident: ref37
  doi: 10.1109/RELDIS.1997.632790
– year: 0
  ident: ref10
– ident: ref6
  doi: 10.1145/1555349.1555371
– ident: ref20
  doi: 10.1145/2349896.2349900
– ident: ref34
  doi: 10.1109/CMPCON.1989.301913
– year: 0
  ident: ref14
– ident: ref31
  doi: 10.1109/ISCIT.2009.5341269
– ident: ref32
  doi: 10.1145/50202.50214
– volume: 60
  start-page: 80
  year: 2011
  ident: ref16
  article-title: Flash-aware RAID techniques for dependable and high-performance flash memory SSD
  publication-title: IEEE Trans Comput
  doi: 10.1109/TC.2010.197
– ident: ref4
  doi: 10.1109/FTCS.1993.627346
– ident: ref24
  doi: 10.1145/2465529.2465546
SSID ssj0006209
Score 2.2607214
Snippet Solid-state drives (SSDs) have been widely deployed in desktops and data centers. However, SSDs suffer from bit errors, and the bit error rate is time...
SourceID proquest
crossref
ieee
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 1131
SubjectTerms Aging
Arrays
Ash
Disk drives
Error analysis
Numerical models
Reliability
Transient analysis
Title Analysis of Reliability Dynamics of SSD RAID
URI https://ieeexplore.ieee.org/document/6880366
https://www.proquest.com/docview/1786941242
Volume 65
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVIEE
  databaseName: IEEE Electronic Library (IEL)
  customDbUrl:
  eissn: 1557-9956
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0006209
  issn: 0018-9340
  databaseCode: RIE
  dateStart: 19680101
  isFulltext: true
  titleUrlDefault: https://ieeexplore.ieee.org/
  providerName: IEEE
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwED21nWCg0IIoFJSBgaFJk9ixk7FqqQoSDDRI3SLHdRZQiyAd4Ndzdpyq4kNiiyI7snxn3734-R3AFYaAImdx5BKB2ISKRLkiUr6rcwma0IJJYdgWD2z2RO8W0aIBg-1dGKWUIZ8pTz-as_zlWm70r7IhQ2cjjDWhyXlS3dXa7rqspnMEuIAJ9a2MT-Anw3SsKVzUCwmiAV2nbicCmZIqP_ZhE1ymbbivh1VxSp69TZl78vObYuN_x30IBzbLdEaVWxxBQ6060K4rODh2QXdgf0eOsAuDWqHEWReO5ipXGt4fzqQqW2_ez-cT53F0OzmGdHqTjmeurabgSgRVpRtxifAjCgq0DiGYRigRUdxewjjPQ4lAVYWJCJOl1stZRjHJFfeLWEnMAYUoEnICrdV6pU7B4bSQCJR4LCWj3Oc5hv1QURaouIhJwHrg1ROcSas0rgtevGQGcfhJlo4zbZHMWqQH19sOr5XIxt9Nu3p-t83s1PagX1sws4vwPQt4rK_pYhJy9nuvc9jDb7OKiNOHVvm2UReYY5T5pXGuL3exyLM
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT8IwFH5BPKgHUdCIou7gwQMb29p13ZGABBQ4yEy4LV3pLhowOg761_u6H4T4I_G2LG3W9L32vW_9-j2AGwwBScy4ZxKB2ISKQJnCU7apcwka0IRJkbEtpmz4RO_n3rwC7c1dGKVURj5Tln7MzvIXK7nWv8o6DJ2NMLYDux6iCj-_rbXZd1lJ6HBwCRNqF0I-jh10wp4mcVHLJYgHdKW6rRiUFVX5sRNn4WVQg0k5sJxV8myt09iSn980G_878iM4LPJMo5s7xjFU1LIOtbKGg1Es6TocbAkSNqBdapQYq8TQbOVcxfvD6OeF67P3s1nfeOyO-icQDu7C3tAs6imYEmFVanq-RADiOQnahxBMJJTwKG4wLo9jVyJUVW4g3GChFXMWHiex8u2EK4lZoBBJQE6hulwt1RkYPk0kQiWfS8mob_sxBn5XUeYonnDisCZY5QRHstAa1yUvXqIMc9hBFPYibZGosEgTbjcdXnOZjb-bNvT8bpoVU9uEVmnBqFiG75Hjc31RF9OQ8997XcPeMJyMo_Fo-nAB-_gdltNyWlBN39bqEjOONL7KHO0LNVvMBA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Analysis+of+Reliability+Dynamics+of+SSD+RAID&rft.jtitle=IEEE+transactions+on+computers&rft.au=Li%2C+Yongkun&rft.au=Lee%2C+Patrick+PC&rft.au=John+CS+Lui&rft.date=2016-04-01&rft.pub=The+Institute+of+Electrical+and+Electronics+Engineers%2C+Inc.+%28IEEE%29&rft.issn=0018-9340&rft.eissn=1557-9956&rft.volume=65&rft.issue=4&rft.spage=1131&rft_id=info:doi/10.1109%2FTC.2014.2349505&rft.externalDBID=NO_FULL_TEXT&rft.externalDocID=4045665361
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0018-9340&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0018-9340&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0018-9340&client=summon