External memory BWT and LCP computation for sequence collections with applications

Background Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Burrows–Wheeler Transform (BWT) and the longest common prefix (LCP) array. Because of the...

Full description

Saved in:
Bibliographic Details
Published inAlgorithms for molecular biology Vol. 14; no. 1; pp. 6 - 15
Main Authors Egidi, Lavinia, Louza, Felipe A., Manzini, Giovanni, Telles, Guilherme P.
Format Journal Article
LanguageEnglish
Published London BioMed Central 08.03.2019
BioMed Central Ltd
Springer Nature B.V
BMC
Subjects
Online AccessGet full text
ISSN1748-7188
1748-7188
DOI10.1186/s13015-019-0140-0

Cover

Abstract Background Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Burrows–Wheeler Transform (BWT) and the longest common prefix (LCP) array. Because of the sheer size of the input it is important to build these data structures in external memory and time using in the best possible way the available RAM. Results We propose a space-efficient algorithm to compute the BWT and LCP array for a collection of sequences in the external or semi-external memory setting. Our algorithm splits the input collection into subcollections sufficiently small that it can compute their BWT in RAM using an optimal linear time algorithm. Next, it merges the partial BWTs in external or semi-external memory and in the process it also computes the LCP values. Our algorithm can be modified to output two additional arrays that, combined with the BWT and LCP array, provide simple, scan-based, external memory algorithms for three well known problems in bioinformatics: the computation of maximal repeats, the all pairs suffix–prefix overlaps, and the construction of succinct de Bruijn graphs. Conclusions We prove that our algorithm performs O ( n maxlcp ) sequential I/Os, where n is the total length of the collection and maxlcp is the maximum LCP value. The experimental results show that our algorithm is only slightly slower than the state of the art for short sequences but it is up to 40 times faster for longer sequences or when the available RAM is at least equal to the size of the input.
AbstractList Background Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Burrows–Wheeler Transform (BWT) and the longest common prefix (LCP) array. Because of the sheer size of the input it is important to build these data structures in external memory and time using in the best possible way the available RAM. Results We propose a space-efficient algorithm to compute the BWT and LCP array for a collection of sequences in the external or semi-external memory setting. Our algorithm splits the input collection into subcollections sufficiently small that it can compute their BWT in RAM using an optimal linear time algorithm. Next, it merges the partial BWTs in external or semi-external memory and in the process it also computes the LCP values. Our algorithm can be modified to output two additional arrays that, combined with the BWT and LCP array, provide simple, scan-based, external memory algorithms for three well known problems in bioinformatics: the computation of maximal repeats, the all pairs suffix–prefix overlaps, and the construction of succinct de Bruijn graphs. Conclusions We prove that our algorithm performs O ( n maxlcp ) sequential I/Os, where n is the total length of the collection and maxlcp is the maximum LCP value. The experimental results show that our algorithm is only slightly slower than the state of the art for short sequences but it is up to 40 times faster for longer sequences or when the available RAM is at least equal to the size of the input.
Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Burrows-Wheeler Transform (BWT) and the longest common prefix (LCP) array. Because of the sheer size of the input it is important to build these data structures in external memory and time using in the best possible way the available RAM. We propose a space-efficient algorithm to compute the BWT and LCP array for a collection of sequences in the external or semi-external memory setting. Our algorithm splits the input collection into subcollections sufficiently small that it can compute their BWT in RAM using an optimal linear time algorithm. Next, it merges the partial BWTs in external or semi-external memory and in the process it also computes the LCP values. Our algorithm can be modified to output two additional arrays that, combined with the BWT and LCP array, provide simple, scan-based, external memory algorithms for three well known problems in bioinformatics: the computation of maximal repeats, the all pairs suffix-prefix overlaps, and the construction of succinct de Bruijn graphs. We prove that our algorithm performs O(nmaxlcp) $${\mathcal {O}}(n\, \mathsf {maxlcp})$$ sequential I/Os, where n is the total length of the collection and maxlcp $$\mathsf {maxlcp}$$ is the maximum LCP value. The experimental results show that our algorithm is only slightly slower than the state of the art for short sequences but it is up to 40 times faster for longer sequences or when the available RAM is at least equal to the size of the input.
Abstract Background Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Burrows–Wheeler Transform (BWT) and the longest common prefix (LCP) array. Because of the sheer size of the input it is important to build these data structures in external memory and time using in the best possible way the available RAM. Results We propose a space-efficient algorithm to compute the BWT and LCP array for a collection of sequences in the external or semi-external memory setting. Our algorithm splits the input collection into subcollections sufficiently small that it can compute their BWT in RAM using an optimal linear time algorithm. Next, it merges the partial BWTs in external or semi-external memory and in the process it also computes the LCP values. Our algorithm can be modified to output two additional arrays that, combined with the BWT and LCP array, provide simple, scan-based, external memory algorithms for three well known problems in bioinformatics: the computation of maximal repeats, the all pairs suffix–prefix overlaps, and the construction of succinct de Bruijn graphs. Conclusions We prove that our algorithm performs $${\mathcal {O}}(n\, \mathsf {maxlcp})$$ O(nmaxlcp) sequential I/Os, where n is the total length of the collection and $$\mathsf {maxlcp}$$ maxlcp is the maximum LCP value. The experimental results show that our algorithm is only slightly slower than the state of the art for short sequences but it is up to 40 times faster for longer sequences or when the available RAM is at least equal to the size of the input.
Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Burrows-Wheeler Transform (BWT) and the longest common prefix (LCP) array. Because of the sheer size of the input it is important to build these data structures in external memory and time using in the best possible way the available RAM. We propose a space-efficient algorithm to compute the BWT and LCP array for a collection of sequences in the external or semi-external memory setting. Our algorithm splits the input collection into subcollections sufficiently small that it can compute their BWT in RAM using an optimal linear time algorithm. Next, it merges the partial BWTs in external or semi-external memory and in the process it also computes the LCP values. Our algorithm can be modified to output two additional arrays that, combined with the BWT and LCP array, provide simple, scan-based, external memory algorithms for three well known problems in bioinformatics: the computation of maximal repeats, the all pairs suffix-prefix overlaps, and the construction of succinct de Bruijn graphs. We prove that our algorithm performs sequential I/Os, where is the total length of the collection and is the maximum LCP value. The experimental results show that our algorithm is only slightly slower than the state of the art for short sequences but it is up to 40 times faster for longer sequences or when the available RAM is at least equal to the size of the input.
Background Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Burrows-Wheeler Transform (BWT) and the longest common prefix (LCP) array. Because of the sheer size of the input it is important to build these data structures in external memory and time using in the best possible way the available RAM. Results We propose a space-efficient algorithm to compute the BWT and LCP array for a collection of sequences in the external or semi-external memory setting. Our algorithm splits the input collection into subcollections sufficiently small that it can compute their BWT in RAM using an optimal linear time algorithm. Next, it merges the partial BWTs in external or semi-external memory and in the process it also computes the LCP values. Our algorithm can be modified to output two additional arrays that, combined with the BWT and LCP array, provide simple, scan-based, external memory algorithms for three well known problems in bioinformatics: the computation of maximal repeats, the all pairs suffix-prefix overlaps, and the construction of succinct de Bruijn graphs. Conclusions We prove that our algorithm performs O(nmaxlcp) $${\mathcal {O}}(n\, \mathsf {maxlcp})$$ sequential I/Os, where n is the total length of the collection and maxlcp $$\mathsf {maxlcp}$$ is the maximum LCP value. The experimental results show that our algorithm is only slightly slower than the state of the art for short sequences but it is up to 40 times faster for longer sequences or when the available RAM is at least equal to the size of the input. Keywords: Burrows-Wheeler Transform, Longest common prefix array, Maximal repeats, All pairs suffix-prefix overlaps, Succinct de Bruijn graph, External memory algorithms
Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Burrows-Wheeler Transform (BWT) and the longest common prefix (LCP) array. Because of the sheer size of the input it is important to build these data structures in external memory and time using in the best possible way the available RAM.BACKGROUNDSequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Burrows-Wheeler Transform (BWT) and the longest common prefix (LCP) array. Because of the sheer size of the input it is important to build these data structures in external memory and time using in the best possible way the available RAM.We propose a space-efficient algorithm to compute the BWT and LCP array for a collection of sequences in the external or semi-external memory setting. Our algorithm splits the input collection into subcollections sufficiently small that it can compute their BWT in RAM using an optimal linear time algorithm. Next, it merges the partial BWTs in external or semi-external memory and in the process it also computes the LCP values. Our algorithm can be modified to output two additional arrays that, combined with the BWT and LCP array, provide simple, scan-based, external memory algorithms for three well known problems in bioinformatics: the computation of maximal repeats, the all pairs suffix-prefix overlaps, and the construction of succinct de Bruijn graphs.RESULTSWe propose a space-efficient algorithm to compute the BWT and LCP array for a collection of sequences in the external or semi-external memory setting. Our algorithm splits the input collection into subcollections sufficiently small that it can compute their BWT in RAM using an optimal linear time algorithm. Next, it merges the partial BWTs in external or semi-external memory and in the process it also computes the LCP values. Our algorithm can be modified to output two additional arrays that, combined with the BWT and LCP array, provide simple, scan-based, external memory algorithms for three well known problems in bioinformatics: the computation of maximal repeats, the all pairs suffix-prefix overlaps, and the construction of succinct de Bruijn graphs.We prove that our algorithm performs O ( n maxlcp ) sequential I/Os, where n is the total length of the collection and maxlcp is the maximum LCP value. The experimental results show that our algorithm is only slightly slower than the state of the art for short sequences but it is up to 40 times faster for longer sequences or when the available RAM is at least equal to the size of the input.CONCLUSIONSWe prove that our algorithm performs O ( n maxlcp ) sequential I/Os, where n is the total length of the collection and maxlcp is the maximum LCP value. The experimental results show that our algorithm is only slightly slower than the state of the art for short sequences but it is up to 40 times faster for longer sequences or when the available RAM is at least equal to the size of the input.
Background Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Burrows–Wheeler Transform (BWT) and the longest common prefix (LCP) array. Because of the sheer size of the input it is important to build these data structures in external memory and time using in the best possible way the available RAM. Results We propose a space-efficient algorithm to compute the BWT and LCP array for a collection of sequences in the external or semi-external memory setting. Our algorithm splits the input collection into subcollections sufficiently small that it can compute their BWT in RAM using an optimal linear time algorithm. Next, it merges the partial BWTs in external or semi-external memory and in the process it also computes the LCP values. Our algorithm can be modified to output two additional arrays that, combined with the BWT and LCP array, provide simple, scan-based, external memory algorithms for three well known problems in bioinformatics: the computation of maximal repeats, the all pairs suffix–prefix overlaps, and the construction of succinct de Bruijn graphs. Conclusions We prove that our algorithm performs \({\mathcal {O}}(n\, \mathsf {maxlcp})\) sequential I/Os, where n is the total length of the collection and \(\mathsf {maxlcp}\) is the maximum LCP value. The experimental results show that our algorithm is only slightly slower than the state of the art for short sequences but it is up to 40 times faster for longer sequences or when the available RAM is at least equal to the size of the input.
ArticleNumber 6
Audience Academic
Author Louza, Felipe A.
Manzini, Giovanni
Telles, Guilherme P.
Egidi, Lavinia
Author_xml – sequence: 1
  givenname: Lavinia
  surname: Egidi
  fullname: Egidi, Lavinia
  organization: DiSIT, University of Eastern Piedmont
– sequence: 2
  givenname: Felipe A.
  orcidid: 0000-0003-2931-1470
  surname: Louza
  fullname: Louza, Felipe A.
  email: louza@usp.br
  organization: Department of Computing and Mathematics, University of São Paulo
– sequence: 3
  givenname: Giovanni
  surname: Manzini
  fullname: Manzini, Giovanni
  organization: DiSIT, University of Eastern Piedmont, IIT CNR
– sequence: 4
  givenname: Guilherme P.
  surname: Telles
  fullname: Telles, Guilherme P.
  organization: Institute of Computing, University of Campinas
BackLink https://www.ncbi.nlm.nih.gov/pubmed/30899322$$D View this record in MEDLINE/PubMed
BookMark eNqNkk1v1DAQhiNURD_gB3BBkbhwSRnHjuNckNpVgUorgVARR8tx7K1Xjh3shNJ_j3eztN0KELIiW-Nn3sy8nuPswHmnsuwlglOEGH0bEQZUFYCa9BEo4El2hGrCihoxdvDgfJgdx7gGIBUG_Cw7xMCaBpflUfbl4ueoghM271Xvw21-_u0qF67Ll4vPufT9MI1iNN7l2oc8qu-TclKlC2uV3MRjfmPG61wMgzVyS8bn2VMtbFQvdvtJ9vX9xdXiY7H89OFycbYsJAU8FlUqvWkY0bXSGolaYtoCQC0oTcUB6Rpa0o60skSk6xSrWdVphXBVISop6vBJdjnrdl6s-RBML8It98LwbcCHFRdhNNIqXioJ0BIBFSWk1Y3QhNZaa6ZZCw0RSauctSY3iNsbYe2dIAK-MZvPZvNkNt-YzSElvZuThqntVSeVG4Owe5Xs3zhzzVf-B6cEGKMkCbzZCQSfnI0j702UylrhlJ8iL1FDK0QYrhP6-hG69tPm3bYUhprQBt1TK5G6Nk779F-5EeVnFUOE4LJmiTr9A5VWp3oj04hpk-J7Ca8eNnrX4e8xSkA9AzL4GIPSXJp5bpKysf_0ED3K_B_fd48VE-tWKtx78fekX7BQ-Iw
CitedBy_id crossref_primary_10_1007_s40484_019_0181_x
crossref_primary_10_1016_j_ipl_2019_105887
crossref_primary_10_1016_j_tcs_2020_11_024
crossref_primary_10_1093_bioinformatics_btae333
crossref_primary_10_1016_j_jksuci_2022_04_015
crossref_primary_10_1007_s00453_021_00855_2
crossref_primary_10_1186_s13015_023_00232_4
crossref_primary_10_1016_j_tcs_2020_11_041
crossref_primary_10_1016_j_ic_2023_105088
crossref_primary_10_1016_j_tcs_2019_11_002
crossref_primary_10_1016_j_tcs_2019_11_001
crossref_primary_10_1186_s13015_024_00263_5
crossref_primary_10_1007_s00236_024_00467_7
crossref_primary_10_1186_s12859_020_03586_3
crossref_primary_10_1186_s12859_020_03628_w
Cites_doi 10.1109/TCBB.2011.127
10.1007/3-540-44888-8_5
10.1145/1216370.1216372
10.1016/j.tcs.2012.02.002
10.1007/978-3-030-00479-8_13
10.1007/978-3-319-67428-5_15
10.1007/s11786-016-0281-1
10.1145/2493175.2493180
10.1007/3-540-45749-6_61
10.1007/978-3-540-27810-8_32
10.1016/j.ipl.2009.10.015
10.1016/j.tcs.2007.07.014
10.1007/978-3-030-00479-8_23
10.1137/1.9781611974782.26
10.1017/CBO9781139940023
10.1093/bioinformatics/btx067
10.1007/s00453-016-0165-4
10.1145/2591796.2591885
10.1109/DCC.2015.70
10.1145/2649387.2649431
10.1007/978-3-662-49529-2_13
10.1016/j.tcs.2017.03.039
10.1093/bioinformatics/btu584
10.1137/0222058
10.1016/0020-0190(92)90176-V
10.1371/journal.pcbi.1005944
10.1007/978-3-319-02432-5_5
10.1002/spe.844
10.1016/j.jda.2016.03.003
10.1007/978-3-662-44753-6_23
10.1007/s00453-011-9535-0
10.1007/978-3-642-16321-0_36
10.1017/CBO9780511574931
10.1016/j.jda.2016.04.002
10.1007/978-3-642-33122-0_18
10.1145/384192.384193
10.1186/s13015-017-0117-9
ContentType Journal Article
Copyright The Author(s) 2019
COPYRIGHT 2019 BioMed Central Ltd.
Copyright © 2019. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: The Author(s) 2019
– notice: COPYRIGHT 2019 BioMed Central Ltd.
– notice: Copyright © 2019. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID C6C
AAYXX
CITATION
NPM
3V.
7QO
7QP
7QR
7TK
7X7
7XB
8FD
8FE
8FG
8FH
8FI
8FJ
8FK
ABJCF
ABUWG
AEUYN
AFKRA
ARAPS
AZQEC
BBNVY
BENPR
BGLVJ
BHPHI
CCPQU
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
HCIFZ
K9.
L6V
LK8
M0S
M7P
M7S
P5Z
P62
P64
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
PTHSS
7X8
5PM
ADTOC
UNPAY
DOA
DOI 10.1186/s13015-019-0140-0
DatabaseName Springer Nature OA Free Journals
CrossRef
PubMed
ProQuest Central (Corporate)
Biotechnology Research Abstracts
Calcium & Calcified Tissue Abstracts
Chemoreception Abstracts
Neurosciences Abstracts
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Natural Science Journals
ProQuest Hospital Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
Materials Science & Engineering Collection
ProQuest Central (Alumni)
ProQuest One Sustainability
ProQuest Central UK/Ireland
Advanced Technologies & Computer Science Collection
ProQuest Central Essentials
Biological Science Collection
ProQuest Central
Technology Collection
Natural Science Collection
ProQuest One Community College
ProQuest Central
Engineering Research Database
Health Research Premium Collection (ProQuest)
Health Research Premium Collection (Alumni)
ProQuest Central Student
SciTech Premium Collection
ProQuest Health & Medical Complete (Alumni)
ProQuest Engineering Collection
Biological Sciences
Health & Medical Collection (Alumni Edition)
Biological Science Database
Engineering Database
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Biotechnology and BioEngineering Abstracts
ProQuest Central Premium
ProQuest One Academic
Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
Engineering Collection
MEDLINE - Academic
PubMed Central (Full Participant titles)
Unpaywall for CDI: Periodical Content
Unpaywall
Consulter via DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
PubMed
Publicly Available Content Database
ProQuest Central Student
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
SciTech Premium Collection
ProQuest Central China
ProQuest One Applied & Life Sciences
ProQuest One Sustainability
Health Research Premium Collection
Natural Science Collection
Biological Science Collection
Chemoreception Abstracts
ProQuest Central (New)
Engineering Collection
Advanced Technologies & Aerospace Collection
Engineering Database
ProQuest Biological Science Collection
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
Biological Science Database
Neurosciences Abstracts
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
Engineering Research Database
ProQuest One Academic
Calcium & Calcified Tissue Abstracts
ProQuest One Academic (New)
Technology Collection
Technology Research Database
ProQuest One Academic Middle East (New)
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Natural Science Collection
ProQuest Central
ProQuest Health & Medical Research Collection
ProQuest Engineering Collection
Biotechnology Research Abstracts
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
ProQuest SciTech Collection
Advanced Technologies & Aerospace Database
Materials Science & Engineering Collection
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList


PubMed

MEDLINE - Academic
Publicly Available Content Database
Database_xml – sequence: 1
  dbid: C6C
  name: Springer Nature OA Free Journals
  url: http://www.springeropen.com/
  sourceTypes: Publisher
– sequence: 2
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 3
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 4
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
– sequence: 5
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
Computer Science
EISSN 1748-7188
EndPage 15
ExternalDocumentID oai_doaj_org_article_2ec00b4a05644bf9af467fff8f8b094a
10.1186/s13015-019-0140-0
PMC6408864
A581443278
30899322
10_1186_s13015_019_0140_0
Genre Journal Article
GrantInformation_xml – fundername: Ministero dell’Istruzione, dell’Università e della Ricerca
  grantid: PRIN 201534HNXC
  funderid: http://dx.doi.org/10.13039/501100003407
– fundername: Fundação de Amparo à Pesquisa do Estado de São Paulo
  grantid: 2017/09105-0
  funderid: http://dx.doi.org/10.13039/501100001807
– fundername: ;
  grantid: PRIN 201534HNXC
– fundername: ;
  grantid: 2017/09105-0
GroupedDBID 0R~
23M
2WC
53G
5GY
5VS
6J9
7X7
8FE
8FG
8FH
8FI
8FJ
AAFWJ
AAJSJ
AASML
ABDBF
ABJCF
ABUWG
ACGFO
ACGFS
ACIHN
ACIWK
ACPRK
ACUHS
ADBBV
ADMLS
ADRAZ
ADUKV
AEAQA
AENEX
AEUYN
AFKRA
AFPKN
AFRAH
AHBYD
AHMBA
AHYZX
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
ARAPS
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BGLVJ
BHPHI
BMC
BPHCQ
BVXVI
C6C
CCPQU
CS3
DIK
E3Z
EBD
EBLON
EBS
EJD
ESX
F5P
FYUFA
GROUPED_DOAJ
GX1
HCIFZ
HMCUK
HYE
IAO
IGS
IHR
ITC
KQ8
L6V
LK8
M48
M7P
M7S
M~E
O5R
O5S
OK1
OVT
P2P
P62
PGMZT
PHGZM
PHGZT
PIMPY
PQGLB
PQQKQ
PROAC
PTHSS
PUEGO
RBZ
RNS
ROL
RPM
RSV
SBL
SOJ
TR2
TUS
UKHRP
WOQ
WOW
~8M
AAYXX
CITATION
-A0
2VQ
3V.
4.4
ACRMQ
ADINQ
AHSBF
ALIPV
C1A
C24
H13
IPNFZ
NPM
RIG
7QO
7QP
7QR
7TK
7XB
8FD
8FK
AZQEC
DWQXO
FR3
GNUQQ
K9.
P64
PJZUB
PKEHL
PPXIY
PQEST
PQUKI
PRINS
7X8
5PM
ADTOC
UNPAY
ID FETCH-LOGICAL-c603t-50159984f7eff1a7c36b0007a6689904d9626d4bc214dde8785dfe135516c61d3
IEDL.DBID M48
ISSN 1748-7188
IngestDate Tue Oct 14 19:05:01 EDT 2025
Sun Oct 26 03:30:27 EDT 2025
Tue Sep 30 16:42:48 EDT 2025
Sun Aug 24 03:17:10 EDT 2025
Tue Oct 07 06:15:03 EDT 2025
Mon Oct 20 22:34:18 EDT 2025
Mon Oct 20 16:22:21 EDT 2025
Wed Feb 19 02:27:28 EST 2025
Wed Oct 01 03:57:04 EDT 2025
Thu Apr 24 23:09:17 EDT 2025
Sat Sep 06 07:29:42 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords All pairs suffix–prefix overlaps
Maximal repeats
External memory algorithms
Burrows–Wheeler Transform
Longest common prefix array
Succinct de Bruijn graph
Language English
License Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
cc-by
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c603t-50159984f7eff1a7c36b0007a6689904d9626d4bc214dde8785dfe135516c61d3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ORCID 0000-0003-2931-1470
OpenAccessLink http://journals.scholarsportal.info/openUrl.xqy?doi=10.1186/s13015-019-0140-0
PMID 30899322
PQID 2193074691
PQPubID 55040
PageCount 15
ParticipantIDs doaj_primary_oai_doaj_org_article_2ec00b4a05644bf9af467fff8f8b094a
unpaywall_primary_10_1186_s13015_019_0140_0
pubmedcentral_primary_oai_pubmedcentral_nih_gov_6408864
proquest_miscellaneous_2196514837
proquest_journals_2193074691
gale_infotracmisc_A581443278
gale_infotracacademiconefile_A581443278
pubmed_primary_30899322
crossref_citationtrail_10_1186_s13015_019_0140_0
crossref_primary_10_1186_s13015_019_0140_0
springer_journals_10_1186_s13015_019_0140_0
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2019-03-08
PublicationDateYYYYMMDD 2019-03-08
PublicationDate_xml – month: 03
  year: 2019
  text: 2019-03-08
  day: 08
PublicationDecade 2010
PublicationPlace London
PublicationPlace_xml – name: London
– name: England
PublicationTitle Algorithms for molecular biology
PublicationTitleAbbrev Algorithms Mol Biol
PublicationTitleAlternate Algorithms Mol Biol
PublicationYear 2019
Publisher BioMed Central
BioMed Central Ltd
Springer Nature B.V
BMC
Publisher_xml – name: BioMed Central
– name: BioMed Central Ltd
– name: Springer Nature B.V
– name: BMC
References 140_CR10
U Manber (140_CR3) 1993; 22
140_CR12
FA Louza (140_CR14) 2017; 12
140_CR34
E Ohlebusch (140_CR24) 2010; 110
140_CR36
140_CR9
140_CR37
140_CR16
140_CR38
140_CR17
140_CR39
G Navarro (140_CR5) 2007; 39
140_CR6
140_CR8
MO Külekci (140_CR21) 2012; 9
D Gusfield (140_CR23) 1992; 41
140_CR7
140_CR1
S Gog (140_CR4) 2013; 18
G Marçais (140_CR43) 2018; 14
D Gusfield (140_CR41) 1997
J Kärkkäinen (140_CR13) 2017; 11
G Nong (140_CR33) 2013; 31
MD Muggli (140_CR44) 2017; 33
J Kärkkäinen (140_CR11) 2016; 21
AX Cox (140_CR19) 2016; 37
140_CR40
140_CR20
140_CR22
140_CR45
140_CR46
140_CR47
140_CR26
140_CR27
140_CR28
140_CR29
WHA Tustumi (140_CR25) 2016; 37
J Holt (140_CR35) 2014; 30
V Mäkinen (140_CR2) 2015
R Dementiev (140_CR42) 2008; 38
P Bonizzoni (140_CR30) 2017; 78
S Mantaci (140_CR31) 2007; 387
MJ Bauer (140_CR18) 2013; 483
FA Louza (140_CR32) 2017; 678
J Vitter (140_CR15) 2001; 33
References_xml – volume: 9
  start-page: 421
  issue: 2
  year: 2012
  ident: 140_CR21
  publication-title: IEEE/ACM Trans Comput Biol Bioinform
  doi: 10.1109/TCBB.2011.127
– ident: 140_CR6
  doi: 10.1007/3-540-44888-8_5
– volume: 39
  start-page: 1
  year: 2007
  ident: 140_CR5
  publication-title: ACM Comput Surv
  doi: 10.1145/1216370.1216372
– volume: 483
  start-page: 134
  year: 2013
  ident: 140_CR18
  publication-title: Theor Comput Sci
  doi: 10.1016/j.tcs.2012.02.002
– ident: 140_CR47
  doi: 10.1007/978-3-030-00479-8_13
– ident: 140_CR34
  doi: 10.1007/978-3-319-67428-5_15
– volume: 11
  start-page: 137
  issue: 2
  year: 2017
  ident: 140_CR13
  publication-title: Math Comput Sci
  doi: 10.1007/s11786-016-0281-1
– volume: 31
  start-page: 15
  issue: 3
  year: 2013
  ident: 140_CR33
  publication-title: ACM Trans Inform Syst
  doi: 10.1145/2493175.2493180
– ident: 140_CR8
  doi: 10.1007/3-540-45749-6_61
– ident: 140_CR46
– ident: 140_CR7
  doi: 10.1007/978-3-540-27810-8_32
– ident: 140_CR40
– volume: 110
  start-page: 123
  issue: 3
  year: 2010
  ident: 140_CR24
  publication-title: Inform Process Lett
  doi: 10.1016/j.ipl.2009.10.015
– ident: 140_CR1
– volume: 387
  start-page: 298
  issue: 3
  year: 2007
  ident: 140_CR31
  publication-title: Theor Comput Sci
  doi: 10.1016/j.tcs.2007.07.014
– ident: 140_CR45
  doi: 10.1007/978-3-030-00479-8_23
– ident: 140_CR17
  doi: 10.1137/1.9781611974782.26
– ident: 140_CR39
– ident: 140_CR37
– volume-title: Genome-Scale Algorithm Design: biological sequence analysis in the era of high-throughput sequencing
  year: 2015
  ident: 140_CR2
  doi: 10.1017/CBO9781139940023
– volume: 33
  start-page: 3181
  issue: 20
  year: 2017
  ident: 140_CR44
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btx067
– volume: 78
  start-page: 394
  issue: 2
  year: 2017
  ident: 140_CR30
  publication-title: Algorithmica
  doi: 10.1007/s00453-016-0165-4
– ident: 140_CR16
  doi: 10.1145/2591796.2591885
– ident: 140_CR27
  doi: 10.1109/DCC.2015.70
– ident: 140_CR36
  doi: 10.1145/2649387.2649431
– ident: 140_CR26
  doi: 10.1007/978-3-662-49529-2_13
– volume: 18
  start-page: 2
  year: 2013
  ident: 140_CR4
  publication-title: ACM J Exp Algorith
– ident: 140_CR9
– volume: 678
  start-page: 22
  year: 2017
  ident: 140_CR32
  publication-title: Theor Comput Sci
  doi: 10.1016/j.tcs.2017.03.039
– volume: 30
  start-page: 3524
  issue: 24
  year: 2014
  ident: 140_CR35
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btu584
– volume: 22
  start-page: 935
  issue: 5
  year: 1993
  ident: 140_CR3
  publication-title: SIAM J Comput
  doi: 10.1137/0222058
– ident: 140_CR20
– volume: 41
  start-page: 181
  issue: 4
  year: 1992
  ident: 140_CR23
  publication-title: Inform Process Lett
  doi: 10.1016/0020-0190(92)90176-V
– volume: 14
  start-page: e1005944
  issue: 1
  year: 2018
  ident: 140_CR43
  publication-title: PLoS Comput Biol
  doi: 10.1371/journal.pcbi.1005944
– ident: 140_CR12
  doi: 10.1007/978-3-319-02432-5_5
– volume: 38
  start-page: 589
  issue: 6
  year: 2008
  ident: 140_CR42
  publication-title: Softw Pract Exper
  doi: 10.1002/spe.844
– volume: 37
  start-page: 17
  year: 2016
  ident: 140_CR19
  publication-title: J Discrete Algorith
  doi: 10.1016/j.jda.2016.03.003
– ident: 140_CR38
– ident: 140_CR29
  doi: 10.1007/978-3-662-44753-6_23
– ident: 140_CR10
  doi: 10.1007/s00453-011-9535-0
– volume: 21
  start-page: 1
  issue: 1
  year: 2016
  ident: 140_CR11
  publication-title: ACM J Exp Algorith
– ident: 140_CR22
  doi: 10.1007/978-3-642-16321-0_36
– volume-title: Algorithms on strings, trees, and sequences: computer science and computational biology
  year: 1997
  ident: 140_CR41
  doi: 10.1017/CBO9780511574931
– volume: 37
  start-page: 34
  year: 2016
  ident: 140_CR25
  publication-title: J Discrete Algorith
  doi: 10.1016/j.jda.2016.04.002
– ident: 140_CR28
  doi: 10.1007/978-3-642-33122-0_18
– volume: 33
  start-page: 209
  issue: 2
  year: 2001
  ident: 140_CR15
  publication-title: ACM Comput Surv
  doi: 10.1145/384192.384193
– volume: 12
  start-page: 26
  issue: 1
  year: 2017
  ident: 140_CR14
  publication-title: Algorith Mol Biol
  doi: 10.1186/s13015-017-0117-9
SSID ssj0045303
Score 2.3748362
Snippet Background Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search...
Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations....
Background Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search...
Abstract Background Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast...
SourceID doaj
unpaywall
pubmedcentral
proquest
gale
pubmed
crossref
springer
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 6
SubjectTerms Algorithms
All pairs suffix–prefix overlaps
Arrays
Bioinformatics
Biomedical and Life Sciences
Burrows-Wheeler transform
Cellular and Medical Topics
Collection
Computation
Computational biology
Computational Biology/Bioinformatics
Computer science
Construction
Data collection
Data structures
External memory algorithms
Life Sciences
Longest common prefix array
Maximal repeats
Physiological
Random access memory
Selected papers from WABI 2018
Succinct de Bruijn graph
Technology
SummonAdditionalLinks – databaseName: Consulter via DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3da9RAEB-kIOpDqd-xVVYQBEtokv3I5rEtLUVURFrs27LZ7NLCNS3eHeX-e2c2H14U2hcf7uV298js_ObrZjID8CGTOuAnT5WtZYr2VqQ6VD7lgttcBscrT28jf_2mTs7E53N5vjbqi2rCuvbA3cXtFd5lWS0sGmoh6lDZgKIdQtBB1xiaRNco09UQTHU6WEjUzH0OM9dqb46aOqciNaoNonrGiRWKzfr_VclrNunveskxafoEHi3bG7u6tbPZml063oLN3qFk-x0hT-GBb5_Bw27E5Oo5_Djq2zyzK6qpXbGDn6fMtg37cviduTjSIfKGofPKhrpqRuiINVrtnNE_tWw9z_0Czo6PTg9P0n6OQupUxhepRNoxqhKh9CHktnRcRd_AKoXRViaaCqOaRtSuyAVqO11q2QSfc8qhOZU3_CVstNetfw2straogiy09F6EusTTzpbe4w-VBTIngWy4V-P6JuM062JmYrChlelYYZAVJtaVZQl8Go_cdB027tp8QMwaN1Jz7PgFQsb0kDH3QSaBj8RqQyKMD-ds_yYCkkjNsMw-YlcIXpQ6gZ3JThQ9N10ewGJ60Z8bNAGchrhUeQLvx2U6SeVsrb9exj0KPVXNywReddgaSeKUiEU1m0A5Qd2E5ulKe3kRG4MrgTZDiQR2B3z-eaw7rnR3hPD9DHjzPxiwDY-LKIs8zfQObCx-Lf1b9O0W9bsoxr8BBT9Hhg
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: ProQuest Central
  dbid: BENPR
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwhR1db9Mw8DQ6IeCBj_EVGMhISEhM0ZLYcZwHhNap04SgmqZN7C1yHBuQSlrWVqj_njs3yRqQykNe4nNk-75z5zuAt1GqHD5xKHWZhqhvRahcbkMuuI5TZ3hu6Tbyl7E8vRSfrtKrHRi3d2EorbKViV5QV1ND_8gPkbM49cbI44-zXyF1jaLoattCQzetFaoPvsTYLdhNqDLWAHaHo_HZeSubRYoSu4ltxkoezlGCx5S8RjlDlOfY006-iP-_onpDV_2dR9kFU-_BnWU906vfejLZ0FcnD-F-Y2iyozVlPIIdW-_B7XXrydUePGjbObCGux_D-aipCM1-Uvrtig2_XjBdV-zz8RkzHtyjkaGdy9oUbEaE5NO56jmjn7psMyT-BC5PRhfHp2HTciE0MuKLMMXjQAdMuMw6F-vMcOnNCC0lOmaRqHJ0gCpRmiQWKBhVptLK2ZhTuM3IuOJPYVBPa_scWKl1krs0Uam1wpUZzjY6sxY_lCUu1wFE7VEXpqlHTm0xJoX3S5Qs1tgpEDuFT0GLAnjfTZmti3FsAx4S_jpAqqPtX0yvvxUNWxaJNVFUCo1moBAlrsqh4nDOKadKdHxxke8I-wVxOy7O6ObSAm6R6mYVR0jmQvAkUwHs9yCRS01_uKWfopES8-KGpgN40w3TTMp8q-106WEkGrWKZwE8W5NbtyVOMVuUyAFkPULs7bk_Uv_47muIS4HqRYoADlqSvVnWliM96Kj6_wh4sX3LL-Fu4hmPh5Hah8HiemlfoYG3KF83XPsHg4BK9Q
  priority: 102
  providerName: ProQuest
– databaseName: Springer Nature OA Free Journals
  dbid: C6C
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3da9RAEB-0ItoH8asarbKCIFiCSfYjm8f2aCmiItJi35bNZhcLZ1q8O-T--85sPryoVHy4l9vdsJv5zvx2BuB1JnXAX54qW8sU7a1Idah8ygW3uQyOV55uI3_8pI5PxfszedYXi6a7MJv5-1yrdwvUsTnBywjVQ0jEm3ALbZSKeVk1G5SukKiK-6TlX5dNzE6szv-nDt4wQr8DJMcs6TbcWbWXdv3TzucbhujoPtzrPUi235H8Adzw7UO43fWUXD-CL4d9XWf2nUC0a3bw9YTZtmEfZp-Ziz0cIjEYeqtsAFIzYocIymoXjD7Nss3E9mM4PTo8mR2nfeOE1KmML1OJZ8cwSoTSh5Db0nEVnQGrFIZXmWgqDGMaUbsiF6jedKllE3zOKWnmVN7wHdhqL1r_FFhtbVEFWWjpvQh1iaudLb3HB5VFqGwC2fBejeurilNzi7mJ0YVWpiOFQVKYCCTLEng7LrnsSmpcN_mAiDVOpGrY8Q9kEtMLlym8y7JaWHTmhKhxVwHVfwhBB11j-IqbfEOkNiSzuDln-6sHeESqfmX2kVmF4EWpE9idzERZc9PhgVlML-sLgzqfU9eWKk_g1ThMKwm_1vqLVZyj0DXVvEzgScdb45E4ZV5RryZQTrhucubpSHv-LVYCVwKNhBIJ7A38-Wtb17zSvZGF_02AZ__17Odwt4hCx9NM78LW8sfKv0CvbVm_jPJ6BShwNm8
  priority: 102
  providerName: Springer Nature
– databaseName: Unpaywall
  dbid: UNPAY
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpR39a9Qw9DFviPqDn1OrUyIIgqO3tknT9MfbsTFEx5AN508hTROV3XrH7g45_3pf0g-vUybiDwfl8lLykvfZ9xGA11EqLP7ikKsiDVHfslDY3ISUURWnVtPcuGrkD0f88JS9O0vPNmDc1sKoycW0GK6Xnk-8vMYHfb47K23N5oLvzlH6xi7xzOX7uBzFG7DJUzTIB7B5enQ8-lyXQooQpa9o4pl_nNfTSL5x_-_ieU0_Xc2d7AKod-DWspqp1Xc1mazpqIN7ULbY1akp58PlohjqH1caP_4n-vfhbmPDklFNdA9gw1QP4WZ9q-XqEXzcbzpLkwuXxrsie59OiKpK8n58TLS_RcKTA0F7mbSp3MQRpE8Lq-bEfRwm66H1LTg92D8ZH4bN1Q2h5hFdhCmuDR05ZjNjbawyTbk3RxTn6OBFrMzRkSpZoZOYoYAVmUhLa2LqwnaaxyV9DINqWpmnQAqlktymiUiNYbbIcLZWmTH4oiyxuQogao9P6qavubteYyK9fyO4rLdK4lZJn8oWBfC2mzKrm3pcB7znaKIDdP24_R_Tyy-yYW-ZGB1FBVNoTjJW4KosKiBrrbCiQAcaF_nGUZR0UsOdpWqKHxBF139LjpBdGKNJJgLY7kEit-v-cEuTspE2c4lah7p7Y_I4gFfdsJvpMugqM116GI7GsaBZAE9qEu5Qoi72i5I9gKxH3D2c-yPVt6--FzlnqKY4C2CnZYNfy7pmS3c6Tvn7ATz7J-jncDvxTEHDSGzDYHG5NC_QblwULxu58BP372LR
  priority: 102
  providerName: Unpaywall
Title External memory BWT and LCP computation for sequence collections with applications
URI https://link.springer.com/article/10.1186/s13015-019-0140-0
https://www.ncbi.nlm.nih.gov/pubmed/30899322
https://www.proquest.com/docview/2193074691
https://www.proquest.com/docview/2196514837
https://pubmed.ncbi.nlm.nih.gov/PMC6408864
https://almob.biomedcentral.com/track/pdf/10.1186/s13015-019-0140-0
https://doaj.org/article/2ec00b4a05644bf9af467fff8f8b094a
UnpaywallVersion publishedVersion
Volume 14
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVADU
  databaseName: BioMedCentral
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: RBZ
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: https://www.biomedcentral.com/search/
  providerName: BioMedCentral
– providerCode: PRVAFT
  databaseName: Open Access Digital Library
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: KQ8
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: DOA
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVEBS
  databaseName: EBSCOhost Academic Search Ultimate
  customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: ABDBF
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn
  providerName: EBSCOhost
– providerCode: PRVEBS
  databaseName: Inspec with Full Text
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: ADMLS
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text
  providerName: EBSCOhost
– providerCode: PRVBFR
  databaseName: Free Medical Journals
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: DIK
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: http://www.freemedicaljournals.com
  providerName: Flying Publisher
– providerCode: PRVFQY
  databaseName: GFMER Free Medical Journals
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: GX1
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php
  providerName: Geneva Foundation for Medical Education and Research
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: M~E
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
– providerCode: PRVAQN
  databaseName: PubMed Central
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: RPM
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/
  providerName: National Library of Medicine
– providerCode: PRVPQU
  databaseName: Health & Medical Collection
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: 7X7
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/healthcomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central (ProQuest)
  customDbUrl: http://www.proquest.com/pqcentral?accountid=15518
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: BENPR
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Technology Collection
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: 8FG
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/technologycollection1
  providerName: ProQuest
– providerCode: PRVFZP
  databaseName: Scholars Portal Journals: Open Access
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 20250228
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: M48
  dateStart: 20060601
  isFulltext: true
  titleUrlDefault: http://journals.scholarsportal.info
  providerName: Scholars Portal
– providerCode: PRVAVX
  databaseName: Springer Nature HAS Fully OA
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: AAJSJ
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: https://www.springernature.com
  providerName: Springer Nature
– providerCode: PRVAVX
  databaseName: Springer Nature OA Free Journals
  customDbUrl:
  eissn: 1748-7188
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0045303
  issn: 1748-7188
  databaseCode: C6C
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: http://www.springeropen.com/
  providerName: Springer Nature
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3di9QwEB_uA1EfxG-r5xJBEDyqbZOm6YPI7rLrId6yHLe4PoW0TVRYu-d-oPvfO8m2davHiQ9b2CYpSeY3mZnOdAbgeRALg7_Q5yqLfZS3zBcm1T5lVIWxyWmq7dfIpyN-MmHvp_F0D-ryVtUGLi817Ww9qcli9urn981bZPg3juEFf73Eczi0IWg28sdGK-7DIQqq1FZyOGWNU4HFeFxXjs1Lh7VEk8vg__c5vSOo_gyibDypN-H6urxQmx9qNtsRVsPbcKvSMkl3C4s7sKfLu3BtW3dycw_OBlXuZ_LNBtpuSO_jOVFlQT70xyR3dR4cwQhqtKQOtiYWMi5wq1wS-_qW7Dq_78NkODjvn_hVcQU_5wFd-TGuHU0tZhJtTKiSnHKnMCjO0QQLWJGiqVOwLI9ChkegSERcGB1S61jLeVjQB3BQzkv9CEimVJSaOBKx1sxkCY7OVaI1PiiJTKo8COp9lXmVedwWwJhJZ4EILrekkEgK6YLNAg9eNkMutmk3rurcs8RqOtqM2e7GfPFZVgwoI50HQcYUKnyMZTgrgyLCGCOMyNDExUm-sKSWFmk4uVxVnyfgEm2GLNlFQDNGo0R4cNTqifyYt5trsMgazhLlArWVXdLQg2dNsx1pY9xKPV-7PhzVV0ETDx5usdUsiVrvLJ69HiQt1LXW3G4pv35x2cI5Q0HCmQfHNT5_T-uKLT1uIPxvAjz-r2c_gRuRYzrqB-IIDlaLtX6Kmt0q68B-Mk3wKobvOnDYG4zGZ_ivz_sd966k4_gZWyajcffTLwCyS-M
linkProvider Scholars Portal
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3db9MwED-NITR44GN8BQYYCYTEFC2JncR5QGgbmzrWTQh1om_GcWxAKmlZW039p_gbOTsfa0AqT3voS2NHtu_jd5c73wG8CmJu8Bf6icxjH_GW-dxk2qeMyjA2imba3kY-OU16Z-zjMB6uwe_mLoxNq2x0olPUxVjZb-Q7KFnU9sbIwveTX77tGmWjq00LjYotjvXiAl226bujD0jf11F0eDDY7_l1VwFfJQGd-TECIPoYzKTamFCmiiYOKWWSoO8RsCJDG79guYpChrLPUx4XRofURpRUEhYU33sNrjOKugTlJx22Dh6LEQ_qyGnIk50p4kNoU-NsRpLNouxgn2sR8C8QLCHh31mabaj2FmzMy4lcXMjRaAkND-_C7dqMJbsV392DNV1uwo2qseViE-40zSJIrTvuw-eDut40-WmTexdk78uAyLIg_f1PRLnhjkkIWtGkSfAmlk1dslg5JfaTMVkOuD-Asys5-oewXo5L_RhILmWUmTjisdbM5CnOVjLVGl-URiaTHgTNUQtVVzu3TTdGwnk9PBEVdQRSR7gEt8CDt-2USVXqY9XgPUu_dqCt0u3-GJ9_E7XQi0irIMiZRCOTsRxXZRCWjDHc8BzdalzkG0t9YXUJLk7J-koEbtFW5RK7KESM0SjlHmx1RqIOUN3HDf-IWgdNxaXEePCyfWxn2ry6Uo_nbkyCJjOnqQePKnZrt0RtRBj1vQdphxE7e-4-KX98dxXKE4bglTAPthuWvVzWiiPdbrn6_wR4snrLL2CjNzjpi_7R6fFTuBk5IaR-wLdgfXY-18_QlJzlz538Evh61QrjD4sZf64
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1bb9MwFD6CIW4PiMuAwAAjISExRUviS5zHrawaMKYJbWJvluPYMKlLq7UV6r_n2LnQABrioS-1Hdk595zP5wC8Sbh0-EtjoUseo71lsXSFjSmjOuXO0ML628ifj8TBKft4xs_aPqfzDu3epSSbOw2-SlO92JlVrhFxKXbmqHlTDzrzWB-PT7wONxgaN9_CYCRGnSpmHBV0m8r867KBMQo1-__UzGum6XfYZJ87vQu3l_VMr37oyWTNPI3vw73WryS7DSM8gGu2fgg3m06Tq0fwZb-t9kwuPLR2Rfa-nhBdV-RwdExM6OwQSETQhyUdvJp4JglQrXpO_Adbsp7u3oTT8f7J6CBu2ynERiR0EXM8OwZXzOXWuVTnhorgImghMOhKWFVgcFOx0mQpQ6Unc8krZ1PqU2lGpBV9DBv1tLZPgZRaZ4XjmeTWMlfmuNro3Fp8UJ65QkeQdO9VmbbWuG95MVEh5pBCNaRQSAoV4GVJBO_6JbOm0MZVk_c8sfqJvkZ2-GN6-U21Iqcya5KkZBpdPMZK3JVDo-Cck06WGNTiJt96Uisvybg5o9sLCXhEXxNL7SILM0azXEawNZiJEmiGwx2zqFYDzBVaAup7uRRpBK_7Yb_So9pqO12GOQIdVknzCJ40vNUfifp8LGrbCPIB1w3OPBypz7-H-uACRUEKFsF2x5-_tnXFK93uWfjfBHj2X89-BbeO34_V4YejT8_hThbkj8aJ3IKNxeXSvkC3blG-DKL7Ey5YQaU
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpR39a9Qw9DFviPqDn1OrUyIIgqO3tknT9MfbsTFEx5AN508hTROV3XrH7g45_3pf0g-vUybiDwfl8lLykvfZ9xGA11EqLP7ikKsiDVHfslDY3ISUURWnVtPcuGrkD0f88JS9O0vPNmDc1sKoycW0GK6Xnk-8vMYHfb47K23N5oLvzlH6xi7xzOX7uBzFG7DJUzTIB7B5enQ8-lyXQooQpa9o4pl_nNfTSL5x_-_ieU0_Xc2d7AKod-DWspqp1Xc1mazpqIN7ULbY1akp58PlohjqH1caP_4n-vfhbmPDklFNdA9gw1QP4WZ9q-XqEXzcbzpLkwuXxrsie59OiKpK8n58TLS_RcKTA0F7mbSp3MQRpE8Lq-bEfRwm66H1LTg92D8ZH4bN1Q2h5hFdhCmuDR05ZjNjbawyTbk3RxTn6OBFrMzRkSpZoZOYoYAVmUhLa2LqwnaaxyV9DINqWpmnQAqlktymiUiNYbbIcLZWmTH4oiyxuQogao9P6qavubteYyK9fyO4rLdK4lZJn8oWBfC2mzKrm3pcB7znaKIDdP24_R_Tyy-yYW-ZGB1FBVNoTjJW4KosKiBrrbCiQAcaF_nGUZR0UsOdpWqKHxBF139LjpBdGKNJJgLY7kEit-v-cEuTspE2c4lah7p7Y_I4gFfdsJvpMugqM116GI7GsaBZAE9qEu5Qoi72i5I9gKxH3D2c-yPVt6--FzlnqKY4C2CnZYNfy7pmS3c6Tvn7ATz7J-jncDvxTEHDSGzDYHG5NC_QblwULxu58BP372LR
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=External+memory+BWT+and+LCP+computation+for+sequence+collections+with+applications&rft.jtitle=Algorithms+for+molecular+biology&rft.au=Egidi%2C+Lavinia&rft.au=Louza%2C+Felipe+A.&rft.au=Manzini%2C+Giovanni&rft.au=Telles%2C+Guilherme+P.&rft.date=2019-03-08&rft.pub=BioMed+Central&rft.eissn=1748-7188&rft.volume=14&rft.issue=1&rft_id=info:doi/10.1186%2Fs13015-019-0140-0&rft.externalDocID=10_1186_s13015_019_0140_0
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1748-7188&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1748-7188&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1748-7188&client=summon