More Time-Space Tradeoffs for Finding a Shortest Unique Substring
We extend recent results regarding finding shortest unique substrings (SUSs) to obtain new time-space tradeoffs for this problem and the generalization of finding k-mismatch SUSs. Our new results include the first algorithm for finding a k-mismatch SUS in sublinear space, which we obtain by extendin...
        Saved in:
      
    
          | Published in | Algorithms Vol. 13; no. 9; p. 234 | 
|---|---|
| Main Authors | , , , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
            MDPI AG
    
        01.09.2020
     | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1999-4893 1999-4893  | 
| DOI | 10.3390/a13090234 | 
Cover
| Abstract | We extend recent results regarding finding shortest unique substrings (SUSs) to obtain new time-space tradeoffs for this problem and the generalization of finding k-mismatch SUSs. Our new results include the first algorithm for finding a k-mismatch SUS in sublinear space, which we obtain by extending an algorithm by Senanayaka (2019) and combining it with a result on sketching by Gawrychowski and Starikovskaya (2019). We first describe how, given a text T of length n and m words of workspace, with high probability we can find an SUS of length L in O(n(L/m)logL) time using random access to T, or in O(n(L/m)log2(L)loglogσ) time using O((L/m)log2L) sequential passes over T. We then describe how, for constant k, with high probability, we can find a k-mismatch SUS in O(n1+ϵL/m) time using O(nϵL/m) sequential passes over T, again using only m words of workspace. Finally, we also describe a deterministic algorithm that takes O(nτlogσlogn) time to find an SUS using O(n/τ) words of workspace, where τ is a parameter. | 
    
|---|---|
| AbstractList | We extend recent results regarding finding shortest unique substrings (SUSs) to obtain new time-space tradeoffs for this problem and the generalization of finding k-mismatch SUSs. Our new results include the first algorithm for finding a k-mismatch SUS in sublinear space, which we obtain by extending an algorithm by Senanayaka (2019) and combining it with a result on sketching by Gawrychowski and Starikovskaya (2019). We first describe how, given a text T of length n and m words of workspace, with high probability we can find an SUS of length L in O(n(L/m)logL) time using random access to T, or in O(n(L/m)log2(L)loglogσ) time using O((L/m)log2L) sequential passes over T. We then describe how, for constant k, with high probability, we can find a k-mismatch SUS in O(n1+ϵL/m) time using O(nϵL/m) sequential passes over T, again using only m words of workspace. Finally, we also describe a deterministic algorithm that takes O(nτlogσlogn) time to find an SUS using O(n/τ) words of workspace, where τ is a parameter. We extend recent results regarding finding shortest unique substrings (SUSs) to obtain new time-space tradeoffs for this problem and the generalization of finding k-mismatch SUSs. Our new results include the first algorithm for finding a k-mismatch SUS in sublinear space, which we obtain by extending an algorithm by Senanayaka (2019) and combining it with a result on sketching by Gawrychowski and Starikovskaya (2019). We first describe how, given a text T of length n and m words of workspace, with high probability we can find an SUS of length L in O(n(L/m) log L) time using random access to T, or in O(n(L/m) log 2(L) log log σ) time using O((L/m) log 2L) sequential passes over T. We then describe how, for constant k, with high probability, we can find a k-mismatch SUS in O(n1+ϵL/m) time using O(nϵL/m) sequential passes over T, again using only m words of workspace. Finally, we also describe a deterministic algorithm that takes O(nτ log σ log n) time to find an SUS using O(n/τ) words of workspace, where τ is a parameter.  | 
    
| Author | Russo, Luís M. S. Gagie, Travis Bannai, Hideo Puglisi, Simon J. Hoppenworth, Gary  | 
    
| Author_xml | – sequence: 1 givenname: Hideo orcidid: 0000-0002-6856-5185 surname: Bannai fullname: Bannai, Hideo – sequence: 2 givenname: Travis orcidid: 0000-0003-3689-327X surname: Gagie fullname: Gagie, Travis – sequence: 3 givenname: Gary surname: Hoppenworth fullname: Hoppenworth, Gary – sequence: 4 givenname: Simon J. orcidid: 0000-0001-7668-7636 surname: Puglisi fullname: Puglisi, Simon J. – sequence: 5 givenname: Luís M. S. orcidid: 0000-0002-1966-1808 surname: Russo fullname: Russo, Luís M. S.  | 
    
| BookMark | eNp1kE9PAjEUxBuDiYAe_AZ71WSlu29pt0dCREk0HoDzpn9esWTZYrfE8O1dxRBj9PQmb2Z-hxmQXuMbJOQ6o3cAgo5kBlTQHIoz0s-EEGlRCuj90Bdk0LYbStlYsKxPJs8-YLJ0W0wXO6k7GaRBb22bWB-SmWuMa9aJTBavPkRsY7Jq3Nsek8VetTF03iU5t7Ju8er7Dslqdr-cPqZPLw_z6eQp1SBYTI2AMctKW1pALLnRlgoNBTCNGlCNBWUCDVOouYbScgmoLVNWsUwCGA5DMj9yjZebahfcVoZD5aWrvh4-rCsZotM1VkhZyUyZITd5wXNQOS-UFqzIOQqtbMe6PbL2zU4e3mVdn4AZrT6HrE5DduHRMayDb9uAttIuyuh8E4N09Z-Nm1-N_-kfqtKDwQ | 
    
| CitedBy_id | crossref_primary_10_3390_a16010021 | 
    
| Cites_doi | 10.1007/BF01206331 10.3390/a13090224 10.1145/321941.321946 10.1007/978-3-319-11918-2_16 10.1007/978-3-319-04298-5_44 10.1007/978-3-319-16480-9_36 10.1186/1471-2105-6-123 10.1007/978-3-662-48971-0_63 10.1109/SWAT.1973.13 10.1109/TCBB.2020.2968531 10.1016/0304-3975(85)90157-4 10.1007/978-3-319-11918-2_18 10.1016/j.tcs.2017.08.002 10.1109/TCBB.2019.2935061 10.1145/116825.116845 10.1007/978-3-319-07566-2_18  | 
    
| ContentType | Journal Article | 
    
| DBID | AAYXX CITATION ADTOC UNPAY DOA  | 
    
| DOI | 10.3390/a13090234 | 
    
| DatabaseName | CrossRef Unpaywall for CDI: Periodical Content Unpaywall DOAJ Directory of Open Access Journals  | 
    
| DatabaseTitle | CrossRef | 
    
| DatabaseTitleList | CrossRef | 
    
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Computer Science | 
    
| EISSN | 1999-4893 | 
    
| ExternalDocumentID | oai_doaj_org_article_e0686d81e7d24723b274bc96427e9cbf 10.3390/a13090234 10_3390_a13090234  | 
    
| GroupedDBID | 23M 2WC 5VS 8FE 8FG AADQD AAFWJ AAYXX ABDBF ABJCF ABUWG ACUHS ADBBV AFKRA AFPKN AFZYC ALMA_UNASSIGNED_HOLDINGS AMVHM ARAPS AZQEC BCNDV BENPR BGLVJ BPHCQ CCPQU CITATION DWQXO E3Z ESX GNUQQ GROUPED_DOAJ HCIFZ IAO ICD J9A K6V K7- KQ8 L6V M7S MODMG M~E OK1 OVT P2P PHGZM PHGZT PIMPY PQGLB PQQKQ PROAC PTHSS TR2 TUS ADTOC C1A IPNFZ ITC RIG UNPAY  | 
    
| ID | FETCH-LOGICAL-c396t-d935618f8f3ee87dcf09c3436cec3eb59069ed6bec7c38f7a3ecf6bfb61a33d73 | 
    
| IEDL.DBID | UNPAY | 
    
| ISSN | 1999-4893 | 
    
| IngestDate | Tue Oct 14 18:59:19 EDT 2025 Sun Oct 26 04:09:20 EDT 2025 Thu Oct 16 04:44:01 EDT 2025 Thu Apr 24 23:08:15 EDT 2025  | 
    
| IsDoiOpenAccess | true | 
    
| IsOpenAccess | true | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| Issue | 9 | 
    
| Language | English | 
    
| License | cc-by | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-c396t-d935618f8f3ee87dcf09c3436cec3eb59069ed6bec7c38f7a3ecf6bfb61a33d73 | 
    
| ORCID | 0000-0002-6856-5185 0000-0001-7668-7636 0000-0003-3689-327X 0000-0002-1966-1808  | 
    
| OpenAccessLink | https://proxy.k.utb.cz/login?url=https://www.mdpi.com/1999-4893/13/9/234/pdf?version=1600413230 | 
    
| ParticipantIDs | doaj_primary_oai_doaj_org_article_e0686d81e7d24723b274bc96427e9cbf unpaywall_primary_10_3390_a13090234 crossref_citationtrail_10_3390_a13090234 crossref_primary_10_3390_a13090234  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2020-09-01 | 
    
| PublicationDateYYYYMMDD | 2020-09-01 | 
    
| PublicationDate_xml | – month: 09 year: 2020 text: 2020-09-01 day: 01  | 
    
| PublicationDecade | 2020 | 
    
| PublicationTitle | Algorithms | 
    
| PublicationYear | 2020 | 
    
| Publisher | MDPI AG | 
    
| Publisher_xml | – name: MDPI AG | 
    
| References | Ganguly (ref_9) 2017; 700 ref_14 ref_13 ref_12 ref_11 ref_10 ref_1 ref_3 Blumer (ref_16) 1985; 40 ref_2 Crochemore (ref_17) 1991; 38 ref_18 Ukkonen (ref_20) 1995; 14 McCreight (ref_19) 1976; 23 ref_15 ref_8 ref_5 ref_4 ref_7 ref_6  | 
    
| References_xml | – volume: 14 start-page: 249 year: 1995 ident: ref_20 article-title: On-Line Construction of Suffix Trees publication-title: Algorithmica doi: 10.1007/BF01206331 – ident: ref_1 doi: 10.3390/a13090224 – volume: 23 start-page: 262 year: 1976 ident: ref_19 article-title: A Space-Economical Suffix Tree Construction Algorithm publication-title: J. ACM doi: 10.1145/321941.321946 – ident: ref_3 – ident: ref_5 doi: 10.1007/978-3-319-11918-2_16 – ident: ref_7 doi: 10.1007/978-3-319-04298-5_44 – ident: ref_10 – ident: ref_4 doi: 10.1007/978-3-319-16480-9_36 – ident: ref_2 doi: 10.1186/1471-2105-6-123 – ident: ref_11 doi: 10.1007/978-3-662-48971-0_63 – ident: ref_18 doi: 10.1109/SWAT.1973.13 – ident: ref_12 doi: 10.1109/TCBB.2020.2968531 – ident: ref_15 – ident: ref_14 – volume: 40 start-page: 31 year: 1985 ident: ref_16 article-title: The Smallest Automaton Recognizing the Subwords of a Text publication-title: Theor. Comput. Sci. doi: 10.1016/0304-3975(85)90157-4 – ident: ref_8 doi: 10.1007/978-3-319-11918-2_18 – volume: 700 start-page: 75 year: 2017 ident: ref_9 article-title: Space–time trade-offs for finding shortest unique substrings and maximal unique matches publication-title: Theor. Comput. Sci. doi: 10.1016/j.tcs.2017.08.002 – ident: ref_13 doi: 10.1109/TCBB.2019.2935061 – volume: 38 start-page: 651 year: 1991 ident: ref_17 article-title: Two-Way String Matching publication-title: J. ACM doi: 10.1145/116825.116845 – ident: ref_6 doi: 10.1007/978-3-319-07566-2_18  | 
    
| SSID | ssj0065961 | 
    
| Score | 2.1937392 | 
    
| Snippet | We extend recent results regarding finding shortest unique substrings (SUSs) to obtain new time-space tradeoffs for this problem and the generalization of... | 
    
| SourceID | doaj unpaywall crossref  | 
    
| SourceType | Open Website Open Access Repository Enrichment Source Index Database  | 
    
| StartPage | 234 | 
    
| SubjectTerms | k-mismatch SUS Karp–Rabin shortest unique substring sketching suffix trees time-space tradeoff  | 
    
| SummonAdditionalLinks | – databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3PS8MwFA6yi3rwtzh_EdSDl7K2yZLmOMUxhHmZg91Kkr6gMLqxdYj_vS9tVyYoXryVEnjle8l77yt53yPkTuMxM0rLAEOdC7gWPDAcOU-SaBM7YUJdTm8YvojBmD9PupONUV_-TlglD1wB1wHfw5AlEcgs5jJmBmmUsQrLZgnKGuejb5ioNZmqYrDoKhFVOkIMSX1HY6RWmJ34t-xTivTvku1VPtefH3o63cgs_QOyV5eEtFd9yiHZgvyI7K_HLdD69B2T3nC2AOp7NoIRMl18XOgMZs4tKVaetP9eNqhQTUdv_gbtsqDjUp2V-uBQ-P93J2Tcf3p9HAT1BITAMiWKIFMM65vEJY4BJDKzLlSWcSYsWAamq0KhIBPoB2lZ4qRmYBFgZ0SkGcskOyWtfJbDGaHCeh0WyVVsI-6wqsg0IHxoxgDwyLXJ_RqZ1Nby4H5KxTRFmuBBTBsQ2-SmWTqvNDF-WvTg4W0WeBnr8gU6N62dm_7l3Da5bZzzu6nz_zB1QXZiT6XL62OXpFUsVnCF9UZhrsut9QUqoNIJ priority: 102 providerName: Directory of Open Access Journals  | 
    
| Title | More Time-Space Tradeoffs for Finding a Shortest Unique Substring | 
    
| URI | https://www.mdpi.com/1999-4893/13/9/234/pdf?version=1600413230 https://doaj.org/article/e0686d81e7d24723b274bc96427e9cbf  | 
    
| UnpaywallVersion | publishedVersion | 
    
| Volume | 13 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAFT databaseName: Open Access Digital Library customDbUrl: eissn: 1999-4893 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0065961 issn: 1999-4893 databaseCode: KQ8 dateStart: 20080101 isFulltext: true titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html providerName: Colorado Alliance of Research Libraries – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 1999-4893 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0065961 issn: 1999-4893 databaseCode: DOA dateStart: 20080101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVEBS databaseName: EBSCOhost Academic Search Ultimate customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn eissn: 1999-4893 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0065961 issn: 1999-4893 databaseCode: ABDBF dateStart: 20091201 isFulltext: true titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn providerName: EBSCOhost – providerCode: PRVEBS databaseName: EBSCOhost Mathematics Source - HOST customDbUrl: eissn: 1999-4893 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0065961 issn: 1999-4893 databaseCode: AMVHM dateStart: 20091201 isFulltext: true titleUrlDefault: https://www.ebsco.com/products/research-databases/mathematics-source providerName: EBSCOhost – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 1999-4893 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0065961 issn: 1999-4893 databaseCode: M~E dateStart: 20080101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre – providerCode: PRVPQU databaseName: ProQuest Central customDbUrl: http://www.proquest.com/pqcentral?accountid=15518 eissn: 1999-4893 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0065961 issn: 1999-4893 databaseCode: BENPR dateStart: 20080301 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Technology Collection customDbUrl: eissn: 1999-4893 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0065961 issn: 1999-4893 databaseCode: 8FG dateStart: 20080301 isFulltext: true titleUrlDefault: https://search.proquest.com/technologycollection1 providerName: ProQuest  | 
    
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3fixMxEB6096A-3PkT6-kR1Adf9ra7SZPNk1SxHMIdwlk4n5b8mOhhaUu7Vby__iZpWlQUxLdlmWXDTjL5vuzMNwAvDS0zq40qKNSFQhgpCiuI8zSNsXWQdmBS94bTM3kyEe8vhhf5wG2V0yqJil-mIJ1K5KM6SlnxUpc1F-XCh9ff8klSJaNaFCcQfRP25JCweA_2JmcfRp_Sr-T87EZOiBO3Lw0FbE2blPhlE0pa_Xfg1nq2MD--m-n0pw1mfADtdmibvJKvx-vOHrur31Qb_3_sd2E_Y0822kyWe3ADZ_fhYNvXgeVl_gBGp_MlslgcUpwTpabLpfE4D2HFCOKy8WWqhGGGnX-Jqbqrjk2SDCyLUaiLB4UPYTJ-9_HtSZFbLRSOa9kVXnMCUk1oAkdslHdhoB0XXDp0HO1QD6RGL8nhyvEmKMPRkSeDlZXh3Cv-CHqz-QwfA5MuCr4ooWtXiUDwxRt0NtBrLKKoQh9ebb9967IOeWyHMW2Jj0Q3tTs39eH5znSxEd_4k9Gb6MCdQdTLTjfmy89tXn4txkoY31SofC1UzS2Rces0kS-FmkbXhxc79__9VU_-yeoQbteRlKdEtKfQ65ZrfEbIpbNHifEf5Ul6DY4F5_g | 
    
| linkProvider | Unpaywall | 
    
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3fixMxEB6096A-3PkTe3oS1Adf9ra7SZPNk9TDcgh3CGfhfFryY3Ielra0W0X_eidpWlQUxLdlmWXDTjL5vuzMNwAvDS0zq40qKNSFQhgpCiuI8zSNsXWQdmBS94azc3k6Ee8uh5f5wG2V0yqJil-nIJ1K5KM6SlnxUpc1F-XCh9df8klSJaNaFCcQfRP25JCweA_2JufvRx_Tr-T87EZOiBO3Lw0FbE2blPhlE0pa_Xfg1nq2MN--mun0pw1mfADtdmibvJLPx-vOHrvvv6k2_v_Y78J-xp5stJks9-AGzu7DwbavA8vL_AGMzuZLZLE4pLggSk2XS-NxHsKKEcRl4-tUCcMMu_gUU3VXHZskGVgWo1AXDwofwmT89sPJaZFbLRSOa9kVXnMCUk1oAkdslHdhoB0XXDp0HO1QD6RGL8nhyvEmKMPRkSeDlZXh3Cv-CHqz-QwfA5MuCr4ooWtXiUDwxRt0NtBrLKKoQh9ebb9967IOeWyHMW2Jj0Q3tTs39eH5znSxEd_4k9Gb6MCdQdTLTjfmy6s2L78WYyWMbypUvhaq5pbIuHWayJdCTaPrw4ud-__-qsN_snoCt-tIylMi2lPodcs1HhFy6eyzPD1_ACO15wM | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=More+Time-Space+Tradeoffs+for+Finding+a+Shortest+Unique+Substring&rft.jtitle=Algorithms&rft.au=Hideo+Bannai&rft.au=Travis+Gagie&rft.au=Gary+Hoppenworth&rft.au=Simon+J.+Puglisi&rft.date=2020-09-01&rft.pub=MDPI+AG&rft.eissn=1999-4893&rft.volume=13&rft.issue=9&rft.spage=234&rft_id=info:doi/10.3390%2Fa13090234&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_e0686d81e7d24723b274bc96427e9cbf | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1999-4893&client=summon | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1999-4893&client=summon | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1999-4893&client=summon |