Analysis of Reliability Dynamics of SSD RAID
Solid-state drives (SSDs) have been widely deployed in desktops and data centers. However, SSDs suffer from bit errors, and the bit error rate is time dependent since it increases as an SSD wears down. Traditional storage systems mainly use parity-based RAID to provide reliability guarantees by stri...
Saved in:
Published in | IEEE transactions on computers Vol. 65; no. 4; pp. 1131 - 1144 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
New York
IEEE
01.04.2016
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Subjects | |
Online Access | Get full text |
ISSN | 0018-9340 1557-9956 |
DOI | 10.1109/TC.2014.2349505 |
Cover
Abstract | Solid-state drives (SSDs) have been widely deployed in desktops and data centers. However, SSDs suffer from bit errors, and the bit error rate is time dependent since it increases as an SSD wears down. Traditional storage systems mainly use parity-based RAID to provide reliability guarantees by striping redundancy across multiple devices, but the effectiveness of traditional RAID schemes in SSDs remains debatable. In particular, an open problem is how different parity distributions over multiple devices influence the reliability of an SSD RAID array. That is, should we evenly distribute patsaverties as suggested by conventional wisdom, or unevenly distribute parties as recently proposed for SSD RAID? To address this fundamental problem, we propose the first analytical model to quantify the reliability dynamics of an SSD RAID array as it ages. Specifically, we develop a "non-homogeneous" continuous time Markov chain model, and derive the transient reliability solution. We validate our model via trace-driven simulation and conduct numerical analysis to analyze the reliability dynamics of SSD RAID arrays subject to different parity distributions, error rates, and SSD array configurations. Our model enables system practitioners to decide the appropriate parity distribution based on their reliability requirements. |
---|---|
AbstractList | Solid-state drives (SSDs) have been widely deployed in desktops and data centers. However, SSDs suffer from bit errors, and the bit error rate is time dependent since it increases as an SSD wears down. Traditional storage systems mainly use parity-based RAID to provide reliability guarantees by striping redundancy across multiple devices, but the effectiveness of traditional RAID schemes in SSDs remains debatable. In particular, an open problem is how different parity distributions over multiple devices influence the reliability of an SSD RAID array. That is, should we evenly distribute patsaverties as suggested by conventional wisdom, or unevenly distribute parties as recently proposed for SSD RAID? To address this fundamental problem, we propose the first analytical model to quantify the reliability dynamics of an SSD RAID array as it ages. Specifically, we develop a "non-homogeneous" continuous time Markov chain model, and derive the transient reliability solution. We validate our model via trace-driven simulation and conduct numerical analysis to analyze the reliability dynamics of SSD RAID arrays subject to different parity distributions, error rates, and SSD array configurations. Our model enables system practitioners to decide the appropriate parity distribution based on their reliability requirements. |
Author | Li, Yongkun Lui, John C.S. Lee, Patrick P.C. |
Author_xml | – sequence: 1 givenname: Yongkun surname: Li fullname: Li, Yongkun email: yongkunlee@gmail.com organization: School of Computer Science and Technology, University of Science and Technology of China, Hefei, Anhui, China – sequence: 2 givenname: Patrick P.C. surname: Lee fullname: Lee, Patrick P.C. email: pclee@cse.cuhk.edu.hk organization: Department of Computer Science and Engineering, Chinese University of Hong Kong, Shatin, Hong Kong – sequence: 3 givenname: John C.S. surname: Lui fullname: Lui, John C.S. email: cslui@cse.cuhk.edu.hk organization: Department of Computer Science and Engineering, Chinese University of Hong Kong, Shatin, Hong Kong |
BookMark | eNp9kD1rwzAURUVJoUnauUMXQ9c6eZIsWRqD049AoJB4F7IigYJjp5Iz-N_XaUKHDp0ePO65XM4EjZq2sQg9YphhDHJeFjMCOJsRmkkG7AaNMWN5KiXjIzQGwCKVNIM7NIlxDwCcgByjl0Wj6z76mLQu2dja68rXvuuTZd_ogzc__-12mWwWq-U9unW6jvbheqeofHsti490_fm-Khbr1BAhu5TlhuCcYSeYo3RYYTXLaM6JqCpiqKCWSE3kDksud0zQyubghDWApdZO0il6vtQeQ_t1srFT-_YUhp1R4VxwmWGSkSHFLikT2hiDdcr4Tne-bbqgfa0wqLMXVRbq7EVdvQzc_A93DP6gQ_8P8XQhvLX2N82FAMo5_QZnsWug |
CODEN | ITCOB4 |
CitedBy_id | crossref_primary_10_1109_TCAD_2023_3348033 crossref_primary_10_1109_TDMR_2021_3108941 crossref_primary_10_1109_TC_2019_2962691 crossref_primary_10_1109_TIM_2022_3227983 crossref_primary_10_4018_IJGHPC_2017100101 crossref_primary_10_1109_TCAD_2020_2974346 crossref_primary_10_1145_3299710_3211338 crossref_primary_10_1109_TDMR_2017_2749639 crossref_primary_10_1109_TCAD_2020_3020495 crossref_primary_10_1145_3627992 crossref_primary_10_3390_fi15100319 crossref_primary_10_1109_ACCESS_2019_2944764 crossref_primary_10_1109_TC_2020_2978823 |
Cites_doi | 10.1080/03461238.1953.10419459 10.1145/1519065.1519081 10.1145/1982185.1982266 10.1145/2367589.2367603 10.1109/SRDS.2013.16 10.1145/1629435.1629459 10.1145/1089733.1089735 10.1145/2093139.2093143 10.1006/jpdc.1993.1013 10.1145/1534530.1534544 10.1109/ICCNC.2012.6167470 10.1007/978-1-4757-4828-4_3 10.1109/12.543706 10.1007/978-3-642-24403-2_20 10.1115/1.4010337 10.1145/1669112.1669118 10.1145/1807060.1807061 10.1145/1987816.1987835 10.1109/DSN.2013.6575359 10.1080/15326348908807130 10.1109/RELPHY.2008.4558857 10.1109/RELDIS.1997.632790 10.1145/1555349.1555371 10.1145/2349896.2349900 10.1109/CMPCON.1989.301913 10.1109/ISCIT.2009.5341269 10.1145/50202.50214 10.1109/TC.2010.197 10.1109/FTCS.1993.627346 10.1145/2465529.2465546 |
ContentType | Journal Article |
Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016 |
Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016 |
DBID | 97E RIA RIE AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D |
DOI | 10.1109/TC.2014.2349505 |
DatabaseName | IEEE Xplore (IEEE) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Xplore CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
DatabaseTitle | CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional |
DatabaseTitleList | Technology Research Database |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering Computer Science |
EISSN | 1557-9956 |
EndPage | 1144 |
ExternalDocumentID | 4045665361 10_1109_TC_2014_2349505 6880366 |
Genre | orig-research |
GrantInformation_xml | – fundername: National Natural Science Foundation of China; National Nature Science Foundation of China grantid: 61303048 funderid: 10.13039/501100001809 – fundername: Fundamental Research Funds for the Central Universities grantid: WK0110000040 – fundername: CUHK MoE-Microsoft Key Laboratory of Human-centric Computing and Interface Technologies |
GroupedDBID | --Z -DZ -~X .DC 0R~ 29I 4.4 5GY 6IK 85S 97E AAJGR AARMG AASAJ AAWTH ABAZT ABQJQ ABVLG ACGFO ACIWK ACNCT AENEX AETEA AGQYO AGSQL AHBIQ AKQYR ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS EJD HZ~ IEDLZ IFIPE IPLJI JAVBF LAI M43 MS~ O9- OCL P2P PQQKQ RIA RIE RNS RXW TAE TN5 TWZ UHB UPT XZL YZZ AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D |
ID | FETCH-LOGICAL-c289t-57c21751f85f33155ea5437628bb2c383e29a29d1969d583be70f8ec019aaf93 |
IEDL.DBID | RIE |
ISSN | 0018-9340 |
IngestDate | Sun Jun 29 13:47:55 EDT 2025 Thu Apr 24 22:56:41 EDT 2025 Tue Jul 01 00:27:35 EDT 2025 Wed Aug 27 08:35:49 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 4 |
Keywords | CTMC transient analysis RAID reliability Solid-state drives |
Language | English |
License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c289t-57c21751f85f33155ea5437628bb2c383e29a29d1969d583be70f8ec019aaf93 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
PQID | 1786941242 |
PQPubID | 85452 |
PageCount | 14 |
ParticipantIDs | proquest_journals_1786941242 crossref_primary_10_1109_TC_2014_2349505 ieee_primary_6880366 crossref_citationtrail_10_1109_TC_2014_2349505 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2016-April-1 2016-4-1 20160401 |
PublicationDateYYYYMMDD | 2016-04-01 |
PublicationDate_xml | – month: 04 year: 2016 text: 2016-April-1 day: 01 |
PublicationDecade | 2010 |
PublicationPlace | New York |
PublicationPlace_xml | – name: New York |
PublicationTitle | IEEE transactions on computers |
PublicationTitleAbbrev | TC |
PublicationYear | 2016 |
Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
References | hess (ref14) 0 ref34 ref12 ref37 ref15 ref31 moon (ref28) 2013 ref30 ref33 ref11 ref32 ref2 ref39 ref17 ref38 ref19 muntz (ref29) 1990 ref18 im (ref16) 2011; 60 weibull (ref36) 1951; 18 ref24 ref23 ref26 ref25 ref20 ref22 ref21 agrawal (ref1) 2008 ref27 enderle (ref10) 0 ref8 grupp (ref13) 2012 ref7 cai (ref5) 2012 bucy (ref3) 2008 ref9 ref4 ref6 sun (ref35) 2011 |
References_xml | – start-page: 521 year: 2012 ident: ref5 article-title: Error patterns in MLC NAND flash memory: Measurement, characterization, and analysis publication-title: Proc Conf Des Autom Test Eur – ident: ref17 doi: 10.1080/03461238.1953.10419459 – ident: ref30 doi: 10.1145/1519065.1519081 – start-page: 2 year: 2012 ident: ref13 article-title: The bleak future of NAND flash memory publication-title: Proc 10th USENIX Conf File Storage Technol – ident: ref21 doi: 10.1145/1982185.1982266 – ident: ref9 doi: 10.1145/2367589.2367603 – start-page: 57 year: 2008 ident: ref1 article-title: Design tradeoffs for SSD performance publication-title: Proc Annu Tech Conf – year: 2011 ident: ref35 article-title: Quantifying reliability of solid-state storage from multiple aspects publication-title: Proc SNAPI – ident: ref23 doi: 10.1109/SRDS.2013.16 – ident: ref22 doi: 10.1145/1629435.1629459 – ident: ref11 doi: 10.1145/1089733.1089735 – ident: ref26 doi: 10.1145/2093139.2093143 – ident: ref25 doi: 10.1006/jpdc.1993.1013 – start-page: 162 year: 1990 ident: ref29 article-title: Performance analysis of disk arrays under failure publication-title: Proc 16th Int Conf Very Large Data Bases – ident: ref15 doi: 10.1145/1534530.1534544 – ident: ref38 doi: 10.1109/ICCNC.2012.6167470 – year: 2008 ident: ref3 article-title: The DiskSim simulation environment version 4.0 reference manual – ident: ref8 doi: 10.1007/978-1-4757-4828-4_3 – ident: ref7 doi: 10.1109/12.543706 – ident: ref39 doi: 10.1007/978-3-642-24403-2_20 – volume: 18 start-page: 293 year: 1951 ident: ref36 article-title: A statistical distribution function of wide applicability publication-title: J Appl Mech doi: 10.1115/1.4010337 – ident: ref12 doi: 10.1145/1669112.1669118 – start-page: 1 year: 2013 ident: ref28 article-title: Don't let RAID raid the lifetime of your SSD array publication-title: Proc 5th UNESIX Conf Hot Topics Storage File Syst – ident: ref2 doi: 10.1145/1807060.1807061 – ident: ref18 doi: 10.1145/1987816.1987835 – ident: ref19 doi: 10.1109/DSN.2013.6575359 – ident: ref33 doi: 10.1080/15326348908807130 – ident: ref27 doi: 10.1109/RELPHY.2008.4558857 – ident: ref37 doi: 10.1109/RELDIS.1997.632790 – year: 0 ident: ref10 – ident: ref6 doi: 10.1145/1555349.1555371 – ident: ref20 doi: 10.1145/2349896.2349900 – ident: ref34 doi: 10.1109/CMPCON.1989.301913 – year: 0 ident: ref14 – ident: ref31 doi: 10.1109/ISCIT.2009.5341269 – ident: ref32 doi: 10.1145/50202.50214 – volume: 60 start-page: 80 year: 2011 ident: ref16 article-title: Flash-aware RAID techniques for dependable and high-performance flash memory SSD publication-title: IEEE Trans Comput doi: 10.1109/TC.2010.197 – ident: ref4 doi: 10.1109/FTCS.1993.627346 – ident: ref24 doi: 10.1145/2465529.2465546 |
SSID | ssj0006209 |
Score | 2.2607214 |
Snippet | Solid-state drives (SSDs) have been widely deployed in desktops and data centers. However, SSDs suffer from bit errors, and the bit error rate is time... |
SourceID | proquest crossref ieee |
SourceType | Aggregation Database Enrichment Source Index Database Publisher |
StartPage | 1131 |
SubjectTerms | Aging Arrays Ash Disk drives Error analysis Numerical models Reliability Transient analysis |
Title | Analysis of Reliability Dynamics of SSD RAID |
URI | https://ieeexplore.ieee.org/document/6880366 https://www.proquest.com/docview/1786941242 |
Volume | 65 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE Electronic Library (IEL) customDbUrl: eissn: 1557-9956 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0006209 issn: 0018-9340 databaseCode: RIE dateStart: 19680101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwED21nWCg0IIoFJSBgaFJk9ixk7FqqQoSDDRI3SLHdRZQiyAd4Ndzdpyq4kNiiyI7snxn3734-R3AFYaAImdx5BKB2ISKRLkiUr6rcwma0IJJYdgWD2z2RO8W0aIBg-1dGKWUIZ8pTz-as_zlWm70r7IhQ2cjjDWhyXlS3dXa7rqspnMEuIAJ9a2MT-Anw3SsKVzUCwmiAV2nbicCmZIqP_ZhE1ymbbivh1VxSp69TZl78vObYuN_x30IBzbLdEaVWxxBQ6060K4rODh2QXdgf0eOsAuDWqHEWReO5ipXGt4fzqQqW2_ez-cT53F0OzmGdHqTjmeurabgSgRVpRtxifAjCgq0DiGYRigRUdxewjjPQ4lAVYWJCJOl1stZRjHJFfeLWEnMAYUoEnICrdV6pU7B4bSQCJR4LCWj3Oc5hv1QURaouIhJwHrg1ROcSas0rgtevGQGcfhJlo4zbZHMWqQH19sOr5XIxt9Nu3p-t83s1PagX1sws4vwPQt4rK_pYhJy9nuvc9jDb7OKiNOHVvm2UReYY5T5pXGuL3exyLM |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT8IwFH5BPKgHUdCIou7gwQMb29p13ZGABBQ4yEy4LV3pLhowOg761_u6H4T4I_G2LG3W9L32vW_9-j2AGwwBScy4ZxKB2ISKQJnCU7apcwka0IRJkbEtpmz4RO_n3rwC7c1dGKVURj5Tln7MzvIXK7nWv8o6DJ2NMLYDux6iCj-_rbXZd1lJ6HBwCRNqF0I-jh10wp4mcVHLJYgHdKW6rRiUFVX5sRNn4WVQg0k5sJxV8myt09iSn980G_878iM4LPJMo5s7xjFU1LIOtbKGg1Es6TocbAkSNqBdapQYq8TQbOVcxfvD6OeF67P3s1nfeOyO-icQDu7C3tAs6imYEmFVanq-RADiOQnahxBMJJTwKG4wLo9jVyJUVW4g3GChFXMWHiex8u2EK4lZoBBJQE6hulwt1RkYPk0kQiWfS8mob_sxBn5XUeYonnDisCZY5QRHstAa1yUvXqIMc9hBFPYibZGosEgTbjcdXnOZjb-bNvT8bpoVU9uEVmnBqFiG75Hjc31RF9OQ8997XcPeMJyMo_Fo-nAB-_gdltNyWlBN39bqEjOONL7KHO0LNVvMBA |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Analysis+of+Reliability+Dynamics+of+SSD+RAID&rft.jtitle=IEEE+transactions+on+computers&rft.au=Li%2C+Yongkun&rft.au=Lee%2C+Patrick+PC&rft.au=John+CS+Lui&rft.date=2016-04-01&rft.pub=The+Institute+of+Electrical+and+Electronics+Engineers%2C+Inc.+%28IEEE%29&rft.issn=0018-9340&rft.eissn=1557-9956&rft.volume=65&rft.issue=4&rft.spage=1131&rft_id=info:doi/10.1109%2FTC.2014.2349505&rft.externalDBID=NO_FULL_TEXT&rft.externalDocID=4045665361 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0018-9340&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0018-9340&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0018-9340&client=summon |