More Time-Space Tradeoffs for Finding a Shortest Unique Substring

We extend recent results regarding finding shortest unique substrings (SUSs) to obtain new time-space tradeoffs for this problem and the generalization of finding k-mismatch SUSs. Our new results include the first algorithm for finding a k-mismatch SUS in sublinear space, which we obtain by extendin...

Full description

Saved in:
Bibliographic Details
Published inAlgorithms Vol. 13; no. 9; p. 234
Main Authors Bannai, Hideo, Gagie, Travis, Hoppenworth, Gary, Puglisi, Simon J., Russo, Luís M. S.
Format Journal Article
LanguageEnglish
Published MDPI AG 01.09.2020
Subjects
Online AccessGet full text
ISSN1999-4893
1999-4893
DOI10.3390/a13090234

Cover

Abstract We extend recent results regarding finding shortest unique substrings (SUSs) to obtain new time-space tradeoffs for this problem and the generalization of finding k-mismatch SUSs. Our new results include the first algorithm for finding a k-mismatch SUS in sublinear space, which we obtain by extending an algorithm by Senanayaka (2019) and combining it with a result on sketching by Gawrychowski and Starikovskaya (2019). We first describe how, given a text T of length n and m words of workspace, with high probability we can find an SUS of length L in O(n(L/m)logL) time using random access to T, or in O(n(L/m)log2(L)loglogσ) time using O((L/m)log2L) sequential passes over T. We then describe how, for constant k, with high probability, we can find a k-mismatch SUS in O(n1+ϵL/m) time using O(nϵL/m) sequential passes over T, again using only m words of workspace. Finally, we also describe a deterministic algorithm that takes O(nτlogσlogn) time to find an SUS using O(n/τ) words of workspace, where τ is a parameter.
AbstractList We extend recent results regarding finding shortest unique substrings (SUSs) to obtain new time-space tradeoffs for this problem and the generalization of finding k-mismatch SUSs. Our new results include the first algorithm for finding a k-mismatch SUS in sublinear space, which we obtain by extending an algorithm by Senanayaka (2019) and combining it with a result on sketching by Gawrychowski and Starikovskaya (2019). We first describe how, given a text T of length n and m words of workspace, with high probability we can find an SUS of length L in O(n(L/m)logL) time using random access to T, or in O(n(L/m)log2(L)loglogσ) time using O((L/m)log2L) sequential passes over T. We then describe how, for constant k, with high probability, we can find a k-mismatch SUS in O(n1+ϵL/m) time using O(nϵL/m) sequential passes over T, again using only m words of workspace. Finally, we also describe a deterministic algorithm that takes O(nτlogσlogn) time to find an SUS using O(n/τ) words of workspace, where τ is a parameter.
We extend recent results regarding finding shortest unique substrings (SUSs) to obtain new time-space tradeoffs for this problem and the generalization of finding k-mismatch SUSs. Our new results include the first algorithm for finding a k-mismatch SUS in sublinear space, which we obtain by extending an algorithm by Senanayaka (2019) and combining it with a result on sketching by Gawrychowski and Starikovskaya (2019). We first describe how, given a text T of length n and m words of workspace, with high probability we can find an SUS of length L in O(n(L/m) log L) time using random access to T, or in O(n(L/m) log 2(L) log log σ) time using O((L/m) log 2L) sequential passes over T. We then describe how, for constant k, with high probability, we can find a k-mismatch SUS in O(n1+ϵL/m) time using O(nϵL/m) sequential passes over T, again using only m words of workspace. Finally, we also describe a deterministic algorithm that takes O(nτ log σ log n) time to find an SUS using O(n/τ) words of workspace, where τ is a parameter.
Author Russo, Luís M. S.
Gagie, Travis
Bannai, Hideo
Puglisi, Simon J.
Hoppenworth, Gary
Author_xml – sequence: 1
  givenname: Hideo
  orcidid: 0000-0002-6856-5185
  surname: Bannai
  fullname: Bannai, Hideo
– sequence: 2
  givenname: Travis
  orcidid: 0000-0003-3689-327X
  surname: Gagie
  fullname: Gagie, Travis
– sequence: 3
  givenname: Gary
  surname: Hoppenworth
  fullname: Hoppenworth, Gary
– sequence: 4
  givenname: Simon J.
  orcidid: 0000-0001-7668-7636
  surname: Puglisi
  fullname: Puglisi, Simon J.
– sequence: 5
  givenname: Luís M. S.
  orcidid: 0000-0002-1966-1808
  surname: Russo
  fullname: Russo, Luís M. S.
BookMark eNp1kE9PAjEUxBuDiYAe_AZ71WSlu29pt0dCREk0HoDzpn9esWTZYrfE8O1dxRBj9PQmb2Z-hxmQXuMbJOQ6o3cAgo5kBlTQHIoz0s-EEGlRCuj90Bdk0LYbStlYsKxPJs8-YLJ0W0wXO6k7GaRBb22bWB-SmWuMa9aJTBavPkRsY7Jq3Nsek8VetTF03iU5t7Ju8er7Dslqdr-cPqZPLw_z6eQp1SBYTI2AMctKW1pALLnRlgoNBTCNGlCNBWUCDVOouYbScgmoLVNWsUwCGA5DMj9yjZebahfcVoZD5aWrvh4-rCsZotM1VkhZyUyZITd5wXNQOS-UFqzIOQqtbMe6PbL2zU4e3mVdn4AZrT6HrE5DduHRMayDb9uAttIuyuh8E4N09Z-Nm1-N_-kfqtKDwQ
CitedBy_id crossref_primary_10_3390_a16010021
Cites_doi 10.1007/BF01206331
10.3390/a13090224
10.1145/321941.321946
10.1007/978-3-319-11918-2_16
10.1007/978-3-319-04298-5_44
10.1007/978-3-319-16480-9_36
10.1186/1471-2105-6-123
10.1007/978-3-662-48971-0_63
10.1109/SWAT.1973.13
10.1109/TCBB.2020.2968531
10.1016/0304-3975(85)90157-4
10.1007/978-3-319-11918-2_18
10.1016/j.tcs.2017.08.002
10.1109/TCBB.2019.2935061
10.1145/116825.116845
10.1007/978-3-319-07566-2_18
ContentType Journal Article
DBID AAYXX
CITATION
ADTOC
UNPAY
DOA
DOI 10.3390/a13090234
DatabaseName CrossRef
Unpaywall for CDI: Periodical Content
Unpaywall
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
DatabaseTitleList CrossRef

Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1999-4893
ExternalDocumentID oai_doaj_org_article_e0686d81e7d24723b274bc96427e9cbf
10.3390/a13090234
10_3390_a13090234
GroupedDBID 23M
2WC
5VS
8FE
8FG
AADQD
AAFWJ
AAYXX
ABDBF
ABJCF
ABUWG
ACUHS
ADBBV
AFKRA
AFPKN
AFZYC
ALMA_UNASSIGNED_HOLDINGS
AMVHM
ARAPS
AZQEC
BCNDV
BENPR
BGLVJ
BPHCQ
CCPQU
CITATION
DWQXO
E3Z
ESX
GNUQQ
GROUPED_DOAJ
HCIFZ
IAO
ICD
J9A
K6V
K7-
KQ8
L6V
M7S
MODMG
M~E
OK1
OVT
P2P
PHGZM
PHGZT
PIMPY
PQGLB
PQQKQ
PROAC
PTHSS
TR2
TUS
ADTOC
C1A
IPNFZ
ITC
RIG
UNPAY
ID FETCH-LOGICAL-c396t-d935618f8f3ee87dcf09c3436cec3eb59069ed6bec7c38f7a3ecf6bfb61a33d73
IEDL.DBID UNPAY
ISSN 1999-4893
IngestDate Tue Oct 14 18:59:19 EDT 2025
Sun Oct 26 04:09:20 EDT 2025
Thu Oct 16 04:44:01 EDT 2025
Thu Apr 24 23:08:15 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 9
Language English
License cc-by
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c396t-d935618f8f3ee87dcf09c3436cec3eb59069ed6bec7c38f7a3ecf6bfb61a33d73
ORCID 0000-0002-6856-5185
0000-0001-7668-7636
0000-0003-3689-327X
0000-0002-1966-1808
OpenAccessLink https://proxy.k.utb.cz/login?url=https://www.mdpi.com/1999-4893/13/9/234/pdf?version=1600413230
ParticipantIDs doaj_primary_oai_doaj_org_article_e0686d81e7d24723b274bc96427e9cbf
unpaywall_primary_10_3390_a13090234
crossref_citationtrail_10_3390_a13090234
crossref_primary_10_3390_a13090234
PublicationCentury 2000
PublicationDate 2020-09-01
PublicationDateYYYYMMDD 2020-09-01
PublicationDate_xml – month: 09
  year: 2020
  text: 2020-09-01
  day: 01
PublicationDecade 2020
PublicationTitle Algorithms
PublicationYear 2020
Publisher MDPI AG
Publisher_xml – name: MDPI AG
References Ganguly (ref_9) 2017; 700
ref_14
ref_13
ref_12
ref_11
ref_10
ref_1
ref_3
Blumer (ref_16) 1985; 40
ref_2
Crochemore (ref_17) 1991; 38
ref_18
Ukkonen (ref_20) 1995; 14
McCreight (ref_19) 1976; 23
ref_15
ref_8
ref_5
ref_4
ref_7
ref_6
References_xml – volume: 14
  start-page: 249
  year: 1995
  ident: ref_20
  article-title: On-Line Construction of Suffix Trees
  publication-title: Algorithmica
  doi: 10.1007/BF01206331
– ident: ref_1
  doi: 10.3390/a13090224
– volume: 23
  start-page: 262
  year: 1976
  ident: ref_19
  article-title: A Space-Economical Suffix Tree Construction Algorithm
  publication-title: J. ACM
  doi: 10.1145/321941.321946
– ident: ref_3
– ident: ref_5
  doi: 10.1007/978-3-319-11918-2_16
– ident: ref_7
  doi: 10.1007/978-3-319-04298-5_44
– ident: ref_10
– ident: ref_4
  doi: 10.1007/978-3-319-16480-9_36
– ident: ref_2
  doi: 10.1186/1471-2105-6-123
– ident: ref_11
  doi: 10.1007/978-3-662-48971-0_63
– ident: ref_18
  doi: 10.1109/SWAT.1973.13
– ident: ref_12
  doi: 10.1109/TCBB.2020.2968531
– ident: ref_15
– ident: ref_14
– volume: 40
  start-page: 31
  year: 1985
  ident: ref_16
  article-title: The Smallest Automaton Recognizing the Subwords of a Text
  publication-title: Theor. Comput. Sci.
  doi: 10.1016/0304-3975(85)90157-4
– ident: ref_8
  doi: 10.1007/978-3-319-11918-2_18
– volume: 700
  start-page: 75
  year: 2017
  ident: ref_9
  article-title: Space–time trade-offs for finding shortest unique substrings and maximal unique matches
  publication-title: Theor. Comput. Sci.
  doi: 10.1016/j.tcs.2017.08.002
– ident: ref_13
  doi: 10.1109/TCBB.2019.2935061
– volume: 38
  start-page: 651
  year: 1991
  ident: ref_17
  article-title: Two-Way String Matching
  publication-title: J. ACM
  doi: 10.1145/116825.116845
– ident: ref_6
  doi: 10.1007/978-3-319-07566-2_18
SSID ssj0065961
Score 2.1937392
Snippet We extend recent results regarding finding shortest unique substrings (SUSs) to obtain new time-space tradeoffs for this problem and the generalization of...
SourceID doaj
unpaywall
crossref
SourceType Open Website
Open Access Repository
Enrichment Source
Index Database
StartPage 234
SubjectTerms k-mismatch SUS
Karp–Rabin
shortest unique substring
sketching
suffix trees
time-space tradeoff
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3PS8MwFA6yi3rwtzh_EdSDl7K2yZLmOMUxhHmZg91Kkr6gMLqxdYj_vS9tVyYoXryVEnjle8l77yt53yPkTuMxM0rLAEOdC7gWPDAcOU-SaBM7YUJdTm8YvojBmD9PupONUV_-TlglD1wB1wHfw5AlEcgs5jJmBmmUsQrLZgnKGuejb5ioNZmqYrDoKhFVOkIMSX1HY6RWmJ34t-xTivTvku1VPtefH3o63cgs_QOyV5eEtFd9yiHZgvyI7K_HLdD69B2T3nC2AOp7NoIRMl18XOgMZs4tKVaetP9eNqhQTUdv_gbtsqDjUp2V-uBQ-P93J2Tcf3p9HAT1BITAMiWKIFMM65vEJY4BJDKzLlSWcSYsWAamq0KhIBPoB2lZ4qRmYBFgZ0SkGcskOyWtfJbDGaHCeh0WyVVsI-6wqsg0IHxoxgDwyLXJ_RqZ1Nby4H5KxTRFmuBBTBsQ2-SmWTqvNDF-WvTg4W0WeBnr8gU6N62dm_7l3Da5bZzzu6nz_zB1QXZiT6XL62OXpFUsVnCF9UZhrsut9QUqoNIJ
  priority: 102
  providerName: Directory of Open Access Journals
Title More Time-Space Tradeoffs for Finding a Shortest Unique Substring
URI https://www.mdpi.com/1999-4893/13/9/234/pdf?version=1600413230
https://doaj.org/article/e0686d81e7d24723b274bc96427e9cbf
UnpaywallVersion publishedVersion
Volume 13
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAFT
  databaseName: Open Access Digital Library
  customDbUrl:
  eissn: 1999-4893
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0065961
  issn: 1999-4893
  databaseCode: KQ8
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 1999-4893
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0065961
  issn: 1999-4893
  databaseCode: DOA
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVEBS
  databaseName: EBSCOhost Academic Search Ultimate
  customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn
  eissn: 1999-4893
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0065961
  issn: 1999-4893
  databaseCode: ABDBF
  dateStart: 20091201
  isFulltext: true
  titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn
  providerName: EBSCOhost
– providerCode: PRVEBS
  databaseName: EBSCOhost Mathematics Source - HOST
  customDbUrl:
  eissn: 1999-4893
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0065961
  issn: 1999-4893
  databaseCode: AMVHM
  dateStart: 20091201
  isFulltext: true
  titleUrlDefault: https://www.ebsco.com/products/research-databases/mathematics-source
  providerName: EBSCOhost
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 1999-4893
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0065961
  issn: 1999-4893
  databaseCode: M~E
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl: http://www.proquest.com/pqcentral?accountid=15518
  eissn: 1999-4893
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0065961
  issn: 1999-4893
  databaseCode: BENPR
  dateStart: 20080301
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Technology Collection
  customDbUrl:
  eissn: 1999-4893
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0065961
  issn: 1999-4893
  databaseCode: 8FG
  dateStart: 20080301
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/technologycollection1
  providerName: ProQuest
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3fixMxEB6096A-3PkT6-kR1Adf9ra7SZPNk1SxHMIdwlk4n5b8mOhhaUu7Vby__iZpWlQUxLdlmWXDTjL5vuzMNwAvDS0zq40qKNSFQhgpCiuI8zSNsXWQdmBS94bTM3kyEe8vhhf5wG2V0yqJil-mIJ1K5KM6SlnxUpc1F-XCh9ff8klSJaNaFCcQfRP25JCweA_2JmcfRp_Sr-T87EZOiBO3Lw0FbE2blPhlE0pa_Xfg1nq2MD--m-n0pw1mfADtdmibvJKvx-vOHrur31Qb_3_sd2E_Y0822kyWe3ADZ_fhYNvXgeVl_gBGp_MlslgcUpwTpabLpfE4D2HFCOKy8WWqhGGGnX-Jqbqrjk2SDCyLUaiLB4UPYTJ-9_HtSZFbLRSOa9kVXnMCUk1oAkdslHdhoB0XXDp0HO1QD6RGL8nhyvEmKMPRkSeDlZXh3Cv-CHqz-QwfA5MuCr4ooWtXiUDwxRt0NtBrLKKoQh9ebb9967IOeWyHMW2Jj0Q3tTs39eH5znSxEd_4k9Gb6MCdQdTLTjfmy89tXn4txkoY31SofC1UzS2Rces0kS-FmkbXhxc79__9VU_-yeoQbteRlKdEtKfQ65ZrfEbIpbNHifEf5Ul6DY4F5_g
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3fixMxEB6096A-3PkTe3oS1Adf9ra7SZPNk9TDcgh3CGfhfFryY3Ielra0W0X_eidpWlQUxLdlmWXDTjL5vuzMNwAvDS0zq40qKNSFQhgpCiuI8zSNsXWQdmBS94azc3k6Ee8uh5f5wG2V0yqJil-nIJ1K5KM6SlnxUpc1F-XCh9df8klSJaNaFCcQfRP25JCweA_2JufvRx_Tr-T87EZOiBO3Lw0FbE2blPhlE0pa_Xfg1nq2MN--mun0pw1mfADtdmibvJLPx-vOHrvvv6k2_v_Y78J-xp5stJks9-AGzu7DwbavA8vL_AGMzuZLZLE4pLggSk2XS-NxHsKKEcRl4-tUCcMMu_gUU3VXHZskGVgWo1AXDwofwmT89sPJaZFbLRSOa9kVXnMCUk1oAkdslHdhoB0XXDp0HO1QD6RGL8nhyvEmKMPRkSeDlZXh3Cv-CHqz-QwfA5MuCr4ooWtXiUDwxRt0NtBrLKKoQh9ebb9967IOeWyHMW2Jj0Q3tTs39eH5znSxEd_4k9Gb6MCdQdTLTjfmy6s2L78WYyWMbypUvhaq5pbIuHWayJdCTaPrw4ud-__-qsN_snoCt-tIylMi2lPodcs1HhFy6eyzPD1_ACO15wM
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=More+Time-Space+Tradeoffs+for+Finding+a+Shortest+Unique+Substring&rft.jtitle=Algorithms&rft.au=Hideo+Bannai&rft.au=Travis+Gagie&rft.au=Gary+Hoppenworth&rft.au=Simon+J.+Puglisi&rft.date=2020-09-01&rft.pub=MDPI+AG&rft.eissn=1999-4893&rft.volume=13&rft.issue=9&rft.spage=234&rft_id=info:doi/10.3390%2Fa13090234&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_e0686d81e7d24723b274bc96427e9cbf
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1999-4893&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1999-4893&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1999-4893&client=summon