PERF: an exhaustive algorithm for ultra-fast and efficient identification of microsatellites from large DNA sequences

Abstract Motivation Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have high...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 34; no. 6; pp. 943 - 948
Main Authors Avvaru, Akshay Kumar, Sowpati, Divya Tej, Mishra, Rakesh Kumar
Format Journal Article
LanguageEnglish
Published England Oxford University Press 15.03.2018
Subjects
Online AccessGet full text
ISSN1367-4803
1367-4811
1460-2059
1367-4811
DOI10.1093/bioinformatics/btx721

Cover

Abstract Abstract Motivation Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations. Results We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis. Availability and implementation PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license. Supplementary information Supplementary data are available at Bioinformatics online.
AbstractList Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations.MotivationMicrosatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations.We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis.ResultsWe present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis.PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license.Availability and implementationPERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license.tej@ccmb.res.in.Contacttej@ccmb.res.in.Supplementary data are available at Bioinformatics online.Supplementary informationSupplementary data are available at Bioinformatics online.
Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations. We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis. PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license. tej@ccmb.res.in. Supplementary data are available at Bioinformatics online.
Abstract Motivation Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations. Results We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis. Availability and implementation PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license. Supplementary information Supplementary data are available at Bioinformatics online.
Author Avvaru, Akshay Kumar
Mishra, Rakesh Kumar
Sowpati, Divya Tej
Author_xml – sequence: 1
  givenname: Akshay Kumar
  surname: Avvaru
  fullname: Avvaru, Akshay Kumar
  organization: CSIR - Centre for Cellular and Molecular Biology, Hyderabad, Telangana, India
– sequence: 2
  givenname: Divya Tej
  surname: Sowpati
  fullname: Sowpati, Divya Tej
  email: tej@ccmb.res.in
  organization: CSIR - Centre for Cellular and Molecular Biology, Hyderabad, Telangana, India
– sequence: 3
  givenname: Rakesh Kumar
  surname: Mishra
  fullname: Mishra, Rakesh Kumar
  organization: CSIR - Centre for Cellular and Molecular Biology, Hyderabad, Telangana, India
BackLink https://www.ncbi.nlm.nih.gov/pubmed/29121165$$D View this record in MEDLINE/PubMed
BookMark eNqNkUtvFiEYhUlT05v-BA1LN2O5DfONrpraqkmjxnQ_4fLSYhj4BEbbfy912ia6sRteSM7hHB4O0W5MERB6SckbSkZ-rH3y0aU8q-pNOdb1ZmB0Bx1QIUnHSD_utj2XQyc2hO-jw1K-E9JTIcQe2mcjZZTK_gAtX8--nb_FKmK4uVZLqf4nYBWuUvb1esYtAC-hZtU5VWqTWQzOeeMhVuxtW307tQop4uTw7E1ORVUIwVco2OU046DyFeD3n09wgR8LRAPlOXrmVCjw4n4eocvzs8vTj93Flw-fTk8uOiNGWTujlGDjaHsggxOSEWKJhpFZ21POdBvaDcQwYTZWD04JzqXgVvdKCE0MP0JyvXaJW3X7S4UwbbOfVb6dKJnuME5_Y5xWjM34ejVuc2qVS51mX0x7lYqQljLRUXI2SLHhTfrqXrroGexjwAPiJuhXwR2aksE9ucO7f3zG1z-k23f48F83Wd1p2T4x8DcaHMDG
CitedBy_id crossref_primary_10_1007_s11295_024_01671_9
crossref_primary_10_3390_plants13182619
crossref_primary_10_1007_s00438_024_02190_x
crossref_primary_10_1093_bioinformatics_btz551
crossref_primary_10_1016_j_dib_2019_104545
crossref_primary_10_1534_g3_120_401151
crossref_primary_10_1007_s12041_019_1134_x
crossref_primary_10_1016_j_cbi_2020_109226
crossref_primary_10_1093_bioinformatics_btab124
crossref_primary_10_1007_s44281_024_00043_6
crossref_primary_10_1007_s10577_023_09738_4
crossref_primary_10_1016_j_imu_2020_100356
crossref_primary_10_1016_j_ygeno_2021_04_023
crossref_primary_10_1093_gbe_evae066
crossref_primary_10_1038_s41598_024_53739_0
crossref_primary_10_1111_1755_0998_14062
crossref_primary_10_1007_s12298_021_00989_1
crossref_primary_10_1186_s13024_018_0274_4
crossref_primary_10_3389_fgene_2020_00706
crossref_primary_10_3389_fgene_2024_1474611
crossref_primary_10_3389_fpls_2022_947164
crossref_primary_10_1016_j_fsiae_2024_100084
crossref_primary_10_1093_nar_gkz886
crossref_primary_10_3390_ani10101792
crossref_primary_10_1007_s11227_021_04025_7
crossref_primary_10_1186_s12864_019_5516_5
crossref_primary_10_3389_fdata_2021_727216
crossref_primary_10_1038_s41598_023_50117_0
crossref_primary_10_1093_genetics_iyad198
Cites_doi 10.1016/0168-9525(92)90137-S
10.1093/bioinformatics/btw298
10.1016/S0168-9525(97)01008-1
10.1093/hmg/ddi024
10.1093/nar/27.2.573
10.1093/nar/gkg617
10.1101/gr.184001
10.1093/bioinformatics/btp163
10.1038/nrg1348
10.1038/ncomms2872
10.1093/bioinformatics/btx538
10.1016/j.gene.2014.08.052
10.1007/s00122-002-1031-0
10.1002/bies.200900111
10.4161/rna.24326
10.1093/nar/gkm271
10.1093/bioinformatics/btq033
10.1093/nar/gks881
10.1093/bib/bbs023
10.1101/gr.070409.107
10.1006/geno.1994.1151
ContentType Journal Article
Copyright The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com 2017
Copyright_xml – notice: The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com 2017
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
ADTOC
UNPAY
DOI 10.1093/bioinformatics/btx721
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList MEDLINE - Academic
MEDLINE

Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 3
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1460-2059
1367-4811
EndPage 948
ExternalDocumentID 10.1093/bioinformatics/btx721
29121165
10_1093_bioinformatics_btx721
Genre Research Support, Non-U.S. Gov't
Journal Article
GrantInformation_xml – fundername: Council of Scientific and Industrial Research
  funderid: 10.13039/501100001412
– fundername: CSIR
  grantid: BSC0118; BSC0121
  funderid: 10.13039/501100001332
GroupedDBID -~X
.2P
5GY
AAMVS
ABJNI
ABPTD
ACGFS
ADZXQ
ALMA_UNASSIGNED_HOLDINGS
F5P
HW0
Q5Y
RD5
ROZ
TLC
TN5
TOX
WH7
---
-E4
.DC
.I3
0R~
23N
2WC
4.4
48X
53G
5WA
70D
AAIJN
AAIMJ
AAJKP
AAJQQ
AAKPC
AAMDB
AAOGV
AAPQZ
AAPXW
AAUQX
AAVAP
AAVLN
AAYXX
ABEJV
ABEUO
ABGNP
ABIXL
ABNKS
ABPQP
ABQLI
ABWST
ABXVV
ABZBJ
ACIWK
ACPRK
ACUFI
ACUXJ
ACYTK
ADBBV
ADEYI
ADEZT
ADFTL
ADGKP
ADGZP
ADHKW
ADHZD
ADMLS
ADOCK
ADPDF
ADRDM
ADRTK
ADVEK
ADYVW
ADZTZ
AECKG
AEGPL
AEJOX
AEKKA
AEKSI
AELWJ
AEMDU
AENEX
AENZO
AEPUE
AETBJ
AEWNT
AFFZL
AFGWE
AFIYH
AFOFC
AFRAH
AGINJ
AGKEF
AGQXC
AGSYK
AHMBA
AHXPO
AIJHB
AJEEA
AJEUX
AKHUL
AKWXX
ALTZX
ALUQC
AMNDL
APIBT
APWMN
ARIXL
ASPBG
AVWKF
AXUDD
AYOIW
AZVOD
BAWUL
BAYMD
BHONS
BQDIO
BQUQU
BSWAC
BTQHN
C45
CDBKE
CITATION
CS3
CZ4
DAKXR
DIK
DILTD
DU5
D~K
EBD
EBS
EE~
EJD
EMOBN
F9B
FEDTE
FHSFR
FLIZI
FLUFQ
FOEOM
FQBLK
GAUVT
GJXCC
GROUPED_DOAJ
GX1
H13
H5~
HAR
HZ~
IOX
J21
JXSIZ
KAQDR
KOP
KQ8
KSI
KSN
M-Z
MK~
ML0
N9A
NGC
NLBLG
NMDNZ
NOMLY
NU-
O9-
OAWHX
ODMLO
OJQWA
OK1
OVD
OVEED
P2P
PAFKI
PEELM
PQQKQ
Q1.
R44
RNS
ROL
RPM
RUSNO
RW1
RXO
SV3
TEORI
TJP
TR2
W8F
WOQ
X7H
YAYTL
YKOAZ
YXANX
ZKX
~91
~KM
ADRIX
AFXEN
BCRHZ
CGR
CUY
CVF
ECM
EIF
M49
NPM
ROX
7X8
.-4
.GJ
1TH
ABEFU
ABNGD
ACUKT
ADTOC
AFFNX
AGQPQ
AI.
AQDSO
ATTQO
AZFZN
C1A
CAG
COF
ELUNK
HVGLF
NTWIH
NVLIB
O0~
O~Y
PB-
RNI
RZF
RZO
UNPAY
VH1
ZGI
ID FETCH-LOGICAL-c496t-caa4299d5e07f46200d0be92dd5132bdd5bf70c24c8db7fa433643db5a44b0c3
IEDL.DBID UNPAY
ISSN 1367-4803
1367-4811
IngestDate Wed Oct 01 16:34:41 EDT 2025
Wed Oct 01 13:51:35 EDT 2025
Wed Feb 19 02:35:55 EST 2025
Tue Jul 01 03:27:23 EDT 2025
Thu Apr 24 22:51:18 EDT 2025
Wed Apr 02 07:03:23 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 6
Language English
License This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/about_us/legal/notices)
https://academic.oup.com/journals/pages/about_us/legal/notices
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c496t-caa4299d5e07f46200d0be92dd5132bdd5bf70c24c8db7fa433643db5a44b0c3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
OpenAccessLink https://proxy.k.utb.cz/login?url=https://academic.oup.com/bioinformatics/article-pdf/34/6/943/25119307/btx721.pdf
PMID 29121165
PQID 1963276483
PQPubID 23479
PageCount 6
ParticipantIDs unpaywall_primary_10_1093_bioinformatics_btx721
proquest_miscellaneous_1963276483
pubmed_primary_29121165
crossref_primary_10_1093_bioinformatics_btx721
crossref_citationtrail_10_1093_bioinformatics_btx721
oup_primary_10_1093_bioinformatics_btx721
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2018-03-15
PublicationDateYYYYMMDD 2018-03-15
PublicationDate_xml – month: 03
  year: 2018
  text: 2018-03-15
  day: 15
PublicationDecade 2010
PublicationPlace England
PublicationPlace_xml – name: England
PublicationTitle Bioinformatics
PublicationTitleAlternate Bioinformatics
PublicationYear 2018
Publisher Oxford University Press
Publisher_xml – name: Oxford University Press
References Ellegren (2023012712472810200_btx721-B3) 2004; 5
Pickett (2023012712472810200_btx721-B13) 2016; 32
Lim (2023012712472810200_btx721-B11) 2013; 14
Pathak (2023012712472810200_btx721-B12) 2013; 10
Kumar (2023012712472810200_btx721-B10) 2010; 32
Kumar (2023012712472810200_btx721-B9) 2013; 4
Hearne (2023012712472810200_btx721-B6) 1992; 8
Thiel (2023012712472810200_btx721-B19) 2003; 106
Zietkiewicz (2023012712472810200_btx721-B21) 1994; 20
Cock (2023012712472810200_btx721-B2) 2009; 25
Ramamoorthy (2023012712472810200_btx721-B17) 2014; 551
Usdin (2023012712472810200_btx721-B20) 2008; 18
Pietrobono (2023012712472810200_btx721-B15) 2005; 14
Temnykh (2023012712472810200_btx721-B18) 2001; 11
Pickett (2023012712472810200_btx721-B14) 2017; 33
Benson (2023012712472810200_btx721-B1) 1999; 27
Kashi (2023012712472810200_btx721-B7) 1997; 13
Quinlan (2023012712472810200_btx721-B16) 2010; 26
Girgis (2023012712472810200_btx721-B4) 2013; 41
Greene (2023012712472810200_btx721-B5) 2007; 35
Kolpakov (2023012712472810200_btx721-B8) 2003; 31
References_xml – volume: 8
  start-page: 288
  year: 1992
  ident: 2023012712472810200_btx721-B6
  article-title: Microsatellites for linkage analysis of genetic-traits
  publication-title: Trends Genet
  doi: 10.1016/0168-9525(92)90137-S
– volume: 32
  start-page: 2707
  year: 2016
  ident: 2023012712472810200_btx721-B13
  article-title: SA-SSR: a suffix array-based algorithm for exhaustive and efficient SSR discovery in large genetic sequences
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btw298
– volume: 13
  start-page: 74
  year: 1997
  ident: 2023012712472810200_btx721-B7
  article-title: Simple sequence repeats as a source of quantitative genetic variation
  publication-title: Trends Genet
  doi: 10.1016/S0168-9525(97)01008-1
– volume: 14
  start-page: 267
  year: 2005
  ident: 2023012712472810200_btx721-B15
  article-title: Molecular dissection of the events leading to inactivation of the FMR1 gene
  publication-title: Hum. Mol. Genet
  doi: 10.1093/hmg/ddi024
– volume: 27
  start-page: 573
  year: 1999
  ident: 2023012712472810200_btx721-B1
  article-title: Tandem repeats finder: a program to analyze DNA sequences
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/27.2.573
– volume: 31
  start-page: 3672
  year: 2003
  ident: 2023012712472810200_btx721-B8
  article-title: mreps: efficient and flexible detection of tandem repeats in DNA
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkg617
– volume: 11
  start-page: 1441
  year: 2001
  ident: 2023012712472810200_btx721-B18
  article-title: Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential
  publication-title: Genome Res
  doi: 10.1101/gr.184001
– volume: 25
  start-page: 1422
  year: 2009
  ident: 2023012712472810200_btx721-B2
  article-title: Biopython: freely available Python tools for computational molecular biology and bioinformatics
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btp163
– volume: 5
  start-page: 435
  year: 2004
  ident: 2023012712472810200_btx721-B3
  article-title: Microsatellites: Simple sequences with complex evolution
  publication-title: Nat. Rev. Genet
  doi: 10.1038/nrg1348
– volume: 4
  start-page: 1844
  year: 2013
  ident: 2023012712472810200_btx721-B9
  article-title: GATA simple sequence repeats function as enhancer blocker boundaries
  publication-title: Nat. Commun
  doi: 10.1038/ncomms2872
– volume: 33
  start-page: 3922
  year: 2017
  ident: 2023012712472810200_btx721-B14
  article-title: Kmer-SSR: a fast and exhaustive SSR Search Algorithm
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btx538
– volume: 551
  start-page: 167
  year: 2014
  ident: 2023012712472810200_btx721-B17
  article-title: Length and sequence dependent accumulation of simple sequence repeats in vertebrates: potential role in genome organization and regulation
  publication-title: Gene
  doi: 10.1016/j.gene.2014.08.052
– volume: 106
  start-page: 411
  year: 2003
  ident: 2023012712472810200_btx721-B19
  article-title: Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.)
  publication-title: Theor Appl Genet
  doi: 10.1007/s00122-002-1031-0
– volume: 32
  start-page: 165
  year: 2010
  ident: 2023012712472810200_btx721-B10
  article-title: Repeat performance: how do genome packaging and regulation depend on simple sequence repeats?
  publication-title: Bioessays
  doi: 10.1002/bies.200900111
– volume: 10
  start-page: 564
  year: 2013
  ident: 2023012712472810200_btx721-B12
  article-title: AAGAG repeat RNA is an essential component of nuclear matrix in Drosophila
  publication-title: RNA. Biol
  doi: 10.4161/rna.24326
– volume: 35
  start-page: 3383
  year: 2007
  ident: 2023012712472810200_btx721-B5
  article-title: Repeat-induced epigenetic changes in intron 1 of the frataxin gene and its consequences in Friedreich ataxia
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkm271
– volume: 26
  start-page: 841
  year: 2010
  ident: 2023012712472810200_btx721-B16
  article-title: BEDTools: a flexible suite of utilities for comparing genomic features
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btq033
– volume: 41
  start-page: e22.
  year: 2013
  ident: 2023012712472810200_btx721-B4
  article-title: MsDetector: toward a standard computational tool for DNA microsatellites detection
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gks881
– volume: 14
  start-page: 67
  year: 2013
  ident: 2023012712472810200_btx721-B11
  article-title: Review of tandem repeat search tools: a systematic approach to evaluating algorithmic performance
  publication-title: Brief. Bioinform
  doi: 10.1093/bib/bbs023
– volume: 18
  start-page: 1011
  year: 2008
  ident: 2023012712472810200_btx721-B20
  article-title: The biological effects of simple tandem repeats: Lessons from the repeat expansion diseases
  publication-title: Genome Res
  doi: 10.1101/gr.070409.107
– volume: 20
  start-page: 176
  year: 1994
  ident: 2023012712472810200_btx721-B21
  article-title: Genome fingerprinting by simple sequence repeat (Ssr)-anchored polymerase chain-reaction amplification
  publication-title: Genomics
  doi: 10.1006/geno.1994.1151
SSID ssj0051444
ssj0005056
Score 2.441159
Snippet Abstract Motivation Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used...
Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of...
SourceID unpaywall
proquest
pubmed
crossref
oup
SourceType Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 943
SubjectTerms Algorithms
Genome, Human
Genomics - methods
Humans
Microsatellite Repeats
Sequence Analysis, DNA - methods
Software
Title PERF: an exhaustive algorithm for ultra-fast and efficient identification of microsatellites from large DNA sequences
URI https://www.ncbi.nlm.nih.gov/pubmed/29121165
https://www.proquest.com/docview/1963276483
https://academic.oup.com/bioinformatics/article-pdf/34/6/943/25119307/btx721.pdf
UnpaywallVersion publishedVersion
Volume 34
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAFT
  databaseName: Open Access Digital Library
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: KQ8
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVEBS
  databaseName: Inspec with Full Text
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: ADMLS
  dateStart: 19980101
  isFulltext: true
  titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text
  providerName: EBSCOhost
– providerCode: PRVBFR
  databaseName: Free Medical Journals
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 20241101
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: DIK
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://www.freemedicaljournals.com
  providerName: Flying Publisher
– providerCode: PRVFQY
  databaseName: GFMER Free Medical Journals
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 20241101
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: GX1
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php
  providerName: Geneva Foundation for Medical Education and Research
– providerCode: PRVAQN
  databaseName: PubMed Central
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: RPM
  dateStart: 20070101
  isFulltext: true
  titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/
  providerName: National Library of Medicine
– providerCode: PRVOVD
  databaseName: Journals@Ovid LWW All Open Access Journal Collection Rolling
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: OVEED
  dateStart: 20010101
  isFulltext: true
  titleUrlDefault: http://ovidsp.ovid.com/
  providerName: Ovid
– providerCode: PRVASL
  databaseName: Oxford Journals Open Access Collection
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: TOX
  dateStart: 19850101
  isFulltext: true
  titleUrlDefault: https://academic.oup.com/journals/
  providerName: Oxford University Press
– providerCode: PRVASL
  databaseName: Oxford Journals Open Access Collection
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 20220930
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: TOX
  dateStart: 19850101
  isFulltext: true
  titleUrlDefault: https://academic.oup.com/journals/
  providerName: Oxford University Press
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1bb9MwFD7aOiHgYdwGdIPJSLzwkKRJbKfhrRorExKlQp1UniLfwiqypLSJ2Pj1HOdSrXthSLwkluJLbB_7fJbP-Q7A2yCmNTGco7hKHUol7oMKUzpigvPUKMGsN_LnCT87p5_mbL4D084XRrRW4W7n0iAXRUshammLvXY8naVOvZB63Itp6FmYHKOwerK8whONix93YY8zROc92DufTEffGveryKHDOlhym_b9zqknDm-31NS1pa62XOBuINGHcL_Kl-L6l8iyG9pp_Ah-dv1qjFJ-uFUpXfX7FuXj_-z4Y9hvoSwZNUWewI7Jn8K9Jrjl9TOopqdfx--JyIm5urDsQrivEpF9L1aL8uKSYJukysqVcFKxLjGbJqams0AtSBa6NWKq5YYUKbm0hoNrUfOHIjom1i2GZNaMnXyYjMjGIvwAZuPT2cmZ0wZ5cBSNeekoIaxK1MwMopRyXLR6IE0caM3woCzxJdNooAKqhlpGqaBhiCBKSyZQsAYqfA69vMjNSyCRlL7UnBlhEJbIeCgV1zowCAlT4euwD7SbykS1BOg2DkeWNBfxYbI95EkzqH1wN8WWDQPI3wq8wzm8a943nTQluK7tZY3ITVGtE7szBhGnQ_zvF42YbaoMYkvMx1kfvI3c3a29w38ucQQPECPWbpg-ewW9clWZ14jDSnkMux_nPj5nX-bH7TL7A5igPZQ
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1La9wwEB7SDaXtIX23mz5QoZcebK9tSV73tjRZQqHLUhJIT0bPZKljb3dtmvTXd-THks0lKfRkgTWWJY00n9DMNwAfo5Q2xHCe4sp6lErcBxWWdMIE59YowVw08rcZPzqhX0_Z6Q7M-1gY0XmF-31Ig1yUHYWooy0OuvH0ltoGMQ14kNI4cDA5RWUNZHWJJxofX96DXc4QnQ9g92Q2n_xow68Sj46bZMldOQz7oJ40vtlS-60tc7UVAncNiT6CB3WxFFe_RZ5fs07Tx_Cr71frlPLTryvpqz83KB__Z8efwF4HZcmkFXkKO6Z4Bvfb5JZXz6GeH36ffiaiIOby3LEL4b5KRH5WrhbV-QXBNkmdVyvhWbGusJompqGzQCtIFrpzYmr0hpSWXDjHwbVo-EMRHRMXFkNy58ZODmYTsvEIfwHH08PjL0del-TBUzTllaeEcCZRMzNKLOW4aPVImjTSmuFBWeJD2mSkIqrGWiZW0DhGEKUlE6hYIxW_hEFRFuY1kETKUGrOjDAIS2Q6loprHRmEhFaEOh4C7acyUx0BusvDkWftRXycbQ951g7qEPyN2LJlALlN4BPO4V3rfui1KcN17S5rRGHKep25nTFKOB3jf79q1WzzySh1xHycDSHY6N3d2tv_Z4k38BAxYhOGGbK3MKhWtXmHOKyS77ul9RfHKTuH
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=PERF%3A+an+exhaustive+algorithm+for+ultra-fast+and+efficient+identification+of+microsatellites+from+large+DNA+sequences&rft.jtitle=Bioinformatics+%28Oxford%2C+England%29&rft.au=Avvaru%2C+Akshay+Kumar&rft.au=Sowpati%2C+Divya+Tej&rft.au=Mishra%2C+Rakesh+Kumar&rft.date=2018-03-15&rft.issn=1367-4811&rft.eissn=1367-4811&rft.volume=34&rft.issue=6&rft.spage=943&rft_id=info:doi/10.1093%2Fbioinformatics%2Fbtx721&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1367-4803&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1367-4803&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1367-4803&client=summon