A bioinformatician's guide to the forefront of suffix array construction algorithms
The suffix array and its variants are text-indexing data structures that have become indispensable in the field of bioinformatics. With the uninitiated in mind, we provide an accessible exposition of the SA-IS algorithm, which is the state of the art in suffix array construction. We also describe Di...
        Saved in:
      
    
          | Published in | Briefings in bioinformatics Vol. 15; no. 2; pp. 138 - 154 | 
|---|---|
| Main Authors | , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        England
          Oxford Publishing Limited (England)
    
        01.03.2014
     Oxford University Press  | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1467-5463 1477-4054 1477-4054  | 
| DOI | 10.1093/bib/bbt081 | 
Cover
| Summary: | The suffix array and its variants are text-indexing data structures that have become indispensable in the field of bioinformatics. With the uninitiated in mind, we provide an accessible exposition of the SA-IS algorithm, which is the state of the art in suffix array construction. We also describe DisLex, a technique that allows standard suffix array construction algorithms to create modified suffix arrays designed to enable a simple form of inexact matching needed to support 'spaced seeds' and 'subset seeds' used in many biological applications. | 
|---|---|
| Bibliography: | SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23  | 
| ISSN: | 1467-5463 1477-4054 1477-4054  | 
| DOI: | 10.1093/bib/bbt081 |