PERF: an exhaustive algorithm for ultra-fast and efficient identification of microsatellites from large DNA sequences
Abstract Motivation Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have high...
Saved in:
| Published in | Bioinformatics Vol. 34; no. 6; pp. 943 - 948 |
|---|---|
| Main Authors | , , |
| Format | Journal Article |
| Language | English |
| Published |
England
Oxford University Press
15.03.2018
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 1367-4803 1367-4811 1460-2059 1367-4811 |
| DOI | 10.1093/bioinformatics/btx721 |
Cover
| Abstract | Abstract
Motivation
Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations.
Results
We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis.
Availability and implementation
PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license.
Supplementary information
Supplementary data are available at Bioinformatics online. |
|---|---|
| AbstractList | Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations.MotivationMicrosatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations.We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis.ResultsWe present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis.PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license.Availability and implementationPERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license.tej@ccmb.res.in.Contacttej@ccmb.res.in.Supplementary data are available at Bioinformatics online.Supplementary informationSupplementary data are available at Bioinformatics online. Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations. We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis. PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license. tej@ccmb.res.in. Supplementary data are available at Bioinformatics online. Abstract Motivation Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations. Results We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis. Availability and implementation PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license. Supplementary information Supplementary data are available at Bioinformatics online. |
| Author | Avvaru, Akshay Kumar Mishra, Rakesh Kumar Sowpati, Divya Tej |
| Author_xml | – sequence: 1 givenname: Akshay Kumar surname: Avvaru fullname: Avvaru, Akshay Kumar organization: CSIR - Centre for Cellular and Molecular Biology, Hyderabad, Telangana, India – sequence: 2 givenname: Divya Tej surname: Sowpati fullname: Sowpati, Divya Tej email: tej@ccmb.res.in organization: CSIR - Centre for Cellular and Molecular Biology, Hyderabad, Telangana, India – sequence: 3 givenname: Rakesh Kumar surname: Mishra fullname: Mishra, Rakesh Kumar organization: CSIR - Centre for Cellular and Molecular Biology, Hyderabad, Telangana, India |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/29121165$$D View this record in MEDLINE/PubMed |
| BookMark | eNqNkUtvFiEYhUlT05v-BA1LN2O5DfONrpraqkmjxnQ_4fLSYhj4BEbbfy912ia6sRteSM7hHB4O0W5MERB6SckbSkZ-rH3y0aU8q-pNOdb1ZmB0Bx1QIUnHSD_utj2XQyc2hO-jw1K-E9JTIcQe2mcjZZTK_gAtX8--nb_FKmK4uVZLqf4nYBWuUvb1esYtAC-hZtU5VWqTWQzOeeMhVuxtW307tQop4uTw7E1ORVUIwVco2OU046DyFeD3n09wgR8LRAPlOXrmVCjw4n4eocvzs8vTj93Flw-fTk8uOiNGWTujlGDjaHsggxOSEWKJhpFZ21POdBvaDcQwYTZWD04JzqXgVvdKCE0MP0JyvXaJW3X7S4UwbbOfVb6dKJnuME5_Y5xWjM34ejVuc2qVS51mX0x7lYqQljLRUXI2SLHhTfrqXrroGexjwAPiJuhXwR2aksE9ucO7f3zG1z-k23f48F83Wd1p2T4x8DcaHMDG |
| CitedBy_id | crossref_primary_10_1007_s11295_024_01671_9 crossref_primary_10_3390_plants13182619 crossref_primary_10_1007_s00438_024_02190_x crossref_primary_10_1093_bioinformatics_btz551 crossref_primary_10_1016_j_dib_2019_104545 crossref_primary_10_1534_g3_120_401151 crossref_primary_10_1007_s12041_019_1134_x crossref_primary_10_1016_j_cbi_2020_109226 crossref_primary_10_1093_bioinformatics_btab124 crossref_primary_10_1007_s44281_024_00043_6 crossref_primary_10_1007_s10577_023_09738_4 crossref_primary_10_1016_j_imu_2020_100356 crossref_primary_10_1016_j_ygeno_2021_04_023 crossref_primary_10_1093_gbe_evae066 crossref_primary_10_1038_s41598_024_53739_0 crossref_primary_10_1111_1755_0998_14062 crossref_primary_10_1007_s12298_021_00989_1 crossref_primary_10_1186_s13024_018_0274_4 crossref_primary_10_3389_fgene_2020_00706 crossref_primary_10_3389_fgene_2024_1474611 crossref_primary_10_3389_fpls_2022_947164 crossref_primary_10_1016_j_fsiae_2024_100084 crossref_primary_10_1093_nar_gkz886 crossref_primary_10_3390_ani10101792 crossref_primary_10_1007_s11227_021_04025_7 crossref_primary_10_1186_s12864_019_5516_5 crossref_primary_10_3389_fdata_2021_727216 crossref_primary_10_1038_s41598_023_50117_0 crossref_primary_10_1093_genetics_iyad198 |
| Cites_doi | 10.1016/0168-9525(92)90137-S 10.1093/bioinformatics/btw298 10.1016/S0168-9525(97)01008-1 10.1093/hmg/ddi024 10.1093/nar/27.2.573 10.1093/nar/gkg617 10.1101/gr.184001 10.1093/bioinformatics/btp163 10.1038/nrg1348 10.1038/ncomms2872 10.1093/bioinformatics/btx538 10.1016/j.gene.2014.08.052 10.1007/s00122-002-1031-0 10.1002/bies.200900111 10.4161/rna.24326 10.1093/nar/gkm271 10.1093/bioinformatics/btq033 10.1093/nar/gks881 10.1093/bib/bbs023 10.1101/gr.070409.107 10.1006/geno.1994.1151 |
| ContentType | Journal Article |
| Copyright | The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com 2017 |
| Copyright_xml | – notice: The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com 2017 |
| DBID | AAYXX CITATION CGR CUY CVF ECM EIF NPM 7X8 ADTOC UNPAY |
| DOI | 10.1093/bioinformatics/btx721 |
| DatabaseName | CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic Unpaywall for CDI: Periodical Content Unpaywall |
| DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic |
| DatabaseTitleList | MEDLINE - Academic MEDLINE |
| Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 3 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Biology |
| EISSN | 1460-2059 1367-4811 |
| EndPage | 948 |
| ExternalDocumentID | 10.1093/bioinformatics/btx721 29121165 10_1093_bioinformatics_btx721 |
| Genre | Research Support, Non-U.S. Gov't Journal Article |
| GrantInformation_xml | – fundername: Council of Scientific and Industrial Research funderid: 10.13039/501100001412 – fundername: CSIR grantid: BSC0118; BSC0121 funderid: 10.13039/501100001332 |
| GroupedDBID | -~X .2P 5GY AAMVS ABJNI ABPTD ACGFS ADZXQ ALMA_UNASSIGNED_HOLDINGS F5P HW0 Q5Y RD5 ROZ TLC TN5 TOX WH7 --- -E4 .DC .I3 0R~ 23N 2WC 4.4 48X 53G 5WA 70D AAIJN AAIMJ AAJKP AAJQQ AAKPC AAMDB AAOGV AAPQZ AAPXW AAUQX AAVAP AAVLN AAYXX ABEJV ABEUO ABGNP ABIXL ABNKS ABPQP ABQLI ABWST ABXVV ABZBJ ACIWK ACPRK ACUFI ACUXJ ACYTK ADBBV ADEYI ADEZT ADFTL ADGKP ADGZP ADHKW ADHZD ADMLS ADOCK ADPDF ADRDM ADRTK ADVEK ADYVW ADZTZ AECKG AEGPL AEJOX AEKKA AEKSI AELWJ AEMDU AENEX AENZO AEPUE AETBJ AEWNT AFFZL AFGWE AFIYH AFOFC AFRAH AGINJ AGKEF AGQXC AGSYK AHMBA AHXPO AIJHB AJEEA AJEUX AKHUL AKWXX ALTZX ALUQC AMNDL APIBT APWMN ARIXL ASPBG AVWKF AXUDD AYOIW AZVOD BAWUL BAYMD BHONS BQDIO BQUQU BSWAC BTQHN C45 CDBKE CITATION CS3 CZ4 DAKXR DIK DILTD DU5 D~K EBD EBS EE~ EJD EMOBN F9B FEDTE FHSFR FLIZI FLUFQ FOEOM FQBLK GAUVT GJXCC GROUPED_DOAJ GX1 H13 H5~ HAR HZ~ IOX J21 JXSIZ KAQDR KOP KQ8 KSI KSN M-Z MK~ ML0 N9A NGC NLBLG NMDNZ NOMLY NU- O9- OAWHX ODMLO OJQWA OK1 OVD OVEED P2P PAFKI PEELM PQQKQ Q1. R44 RNS ROL RPM RUSNO RW1 RXO SV3 TEORI TJP TR2 W8F WOQ X7H YAYTL YKOAZ YXANX ZKX ~91 ~KM ADRIX AFXEN BCRHZ CGR CUY CVF ECM EIF M49 NPM ROX 7X8 .-4 .GJ 1TH ABEFU ABNGD ACUKT ADTOC AFFNX AGQPQ AI. AQDSO ATTQO AZFZN C1A CAG COF ELUNK HVGLF NTWIH NVLIB O0~ O~Y PB- RNI RZF RZO UNPAY VH1 ZGI |
| ID | FETCH-LOGICAL-c496t-caa4299d5e07f46200d0be92dd5132bdd5bf70c24c8db7fa433643db5a44b0c3 |
| IEDL.DBID | UNPAY |
| ISSN | 1367-4803 1367-4811 |
| IngestDate | Wed Oct 01 16:34:41 EDT 2025 Wed Oct 01 13:51:35 EDT 2025 Wed Feb 19 02:35:55 EST 2025 Tue Jul 01 03:27:23 EDT 2025 Thu Apr 24 22:51:18 EDT 2025 Wed Apr 02 07:03:23 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 6 |
| Language | English |
| License | This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/about_us/legal/notices) https://academic.oup.com/journals/pages/about_us/legal/notices |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c496t-caa4299d5e07f46200d0be92dd5132bdd5bf70c24c8db7fa433643db5a44b0c3 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| OpenAccessLink | https://proxy.k.utb.cz/login?url=https://academic.oup.com/bioinformatics/article-pdf/34/6/943/25119307/btx721.pdf |
| PMID | 29121165 |
| PQID | 1963276483 |
| PQPubID | 23479 |
| PageCount | 6 |
| ParticipantIDs | unpaywall_primary_10_1093_bioinformatics_btx721 proquest_miscellaneous_1963276483 pubmed_primary_29121165 crossref_primary_10_1093_bioinformatics_btx721 crossref_citationtrail_10_1093_bioinformatics_btx721 oup_primary_10_1093_bioinformatics_btx721 |
| ProviderPackageCode | CITATION AAYXX |
| PublicationCentury | 2000 |
| PublicationDate | 2018-03-15 |
| PublicationDateYYYYMMDD | 2018-03-15 |
| PublicationDate_xml | – month: 03 year: 2018 text: 2018-03-15 day: 15 |
| PublicationDecade | 2010 |
| PublicationPlace | England |
| PublicationPlace_xml | – name: England |
| PublicationTitle | Bioinformatics |
| PublicationTitleAlternate | Bioinformatics |
| PublicationYear | 2018 |
| Publisher | Oxford University Press |
| Publisher_xml | – name: Oxford University Press |
| References | Ellegren (2023012712472810200_btx721-B3) 2004; 5 Pickett (2023012712472810200_btx721-B13) 2016; 32 Lim (2023012712472810200_btx721-B11) 2013; 14 Pathak (2023012712472810200_btx721-B12) 2013; 10 Kumar (2023012712472810200_btx721-B10) 2010; 32 Kumar (2023012712472810200_btx721-B9) 2013; 4 Hearne (2023012712472810200_btx721-B6) 1992; 8 Thiel (2023012712472810200_btx721-B19) 2003; 106 Zietkiewicz (2023012712472810200_btx721-B21) 1994; 20 Cock (2023012712472810200_btx721-B2) 2009; 25 Ramamoorthy (2023012712472810200_btx721-B17) 2014; 551 Usdin (2023012712472810200_btx721-B20) 2008; 18 Pietrobono (2023012712472810200_btx721-B15) 2005; 14 Temnykh (2023012712472810200_btx721-B18) 2001; 11 Pickett (2023012712472810200_btx721-B14) 2017; 33 Benson (2023012712472810200_btx721-B1) 1999; 27 Kashi (2023012712472810200_btx721-B7) 1997; 13 Quinlan (2023012712472810200_btx721-B16) 2010; 26 Girgis (2023012712472810200_btx721-B4) 2013; 41 Greene (2023012712472810200_btx721-B5) 2007; 35 Kolpakov (2023012712472810200_btx721-B8) 2003; 31 |
| References_xml | – volume: 8 start-page: 288 year: 1992 ident: 2023012712472810200_btx721-B6 article-title: Microsatellites for linkage analysis of genetic-traits publication-title: Trends Genet doi: 10.1016/0168-9525(92)90137-S – volume: 32 start-page: 2707 year: 2016 ident: 2023012712472810200_btx721-B13 article-title: SA-SSR: a suffix array-based algorithm for exhaustive and efficient SSR discovery in large genetic sequences publication-title: Bioinformatics doi: 10.1093/bioinformatics/btw298 – volume: 13 start-page: 74 year: 1997 ident: 2023012712472810200_btx721-B7 article-title: Simple sequence repeats as a source of quantitative genetic variation publication-title: Trends Genet doi: 10.1016/S0168-9525(97)01008-1 – volume: 14 start-page: 267 year: 2005 ident: 2023012712472810200_btx721-B15 article-title: Molecular dissection of the events leading to inactivation of the FMR1 gene publication-title: Hum. Mol. Genet doi: 10.1093/hmg/ddi024 – volume: 27 start-page: 573 year: 1999 ident: 2023012712472810200_btx721-B1 article-title: Tandem repeats finder: a program to analyze DNA sequences publication-title: Nucleic Acids Res doi: 10.1093/nar/27.2.573 – volume: 31 start-page: 3672 year: 2003 ident: 2023012712472810200_btx721-B8 article-title: mreps: efficient and flexible detection of tandem repeats in DNA publication-title: Nucleic Acids Res doi: 10.1093/nar/gkg617 – volume: 11 start-page: 1441 year: 2001 ident: 2023012712472810200_btx721-B18 article-title: Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential publication-title: Genome Res doi: 10.1101/gr.184001 – volume: 25 start-page: 1422 year: 2009 ident: 2023012712472810200_btx721-B2 article-title: Biopython: freely available Python tools for computational molecular biology and bioinformatics publication-title: Bioinformatics doi: 10.1093/bioinformatics/btp163 – volume: 5 start-page: 435 year: 2004 ident: 2023012712472810200_btx721-B3 article-title: Microsatellites: Simple sequences with complex evolution publication-title: Nat. Rev. Genet doi: 10.1038/nrg1348 – volume: 4 start-page: 1844 year: 2013 ident: 2023012712472810200_btx721-B9 article-title: GATA simple sequence repeats function as enhancer blocker boundaries publication-title: Nat. Commun doi: 10.1038/ncomms2872 – volume: 33 start-page: 3922 year: 2017 ident: 2023012712472810200_btx721-B14 article-title: Kmer-SSR: a fast and exhaustive SSR Search Algorithm publication-title: Bioinformatics doi: 10.1093/bioinformatics/btx538 – volume: 551 start-page: 167 year: 2014 ident: 2023012712472810200_btx721-B17 article-title: Length and sequence dependent accumulation of simple sequence repeats in vertebrates: potential role in genome organization and regulation publication-title: Gene doi: 10.1016/j.gene.2014.08.052 – volume: 106 start-page: 411 year: 2003 ident: 2023012712472810200_btx721-B19 article-title: Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.) publication-title: Theor Appl Genet doi: 10.1007/s00122-002-1031-0 – volume: 32 start-page: 165 year: 2010 ident: 2023012712472810200_btx721-B10 article-title: Repeat performance: how do genome packaging and regulation depend on simple sequence repeats? publication-title: Bioessays doi: 10.1002/bies.200900111 – volume: 10 start-page: 564 year: 2013 ident: 2023012712472810200_btx721-B12 article-title: AAGAG repeat RNA is an essential component of nuclear matrix in Drosophila publication-title: RNA. Biol doi: 10.4161/rna.24326 – volume: 35 start-page: 3383 year: 2007 ident: 2023012712472810200_btx721-B5 article-title: Repeat-induced epigenetic changes in intron 1 of the frataxin gene and its consequences in Friedreich ataxia publication-title: Nucleic Acids Res doi: 10.1093/nar/gkm271 – volume: 26 start-page: 841 year: 2010 ident: 2023012712472810200_btx721-B16 article-title: BEDTools: a flexible suite of utilities for comparing genomic features publication-title: Bioinformatics doi: 10.1093/bioinformatics/btq033 – volume: 41 start-page: e22. year: 2013 ident: 2023012712472810200_btx721-B4 article-title: MsDetector: toward a standard computational tool for DNA microsatellites detection publication-title: Nucleic Acids Res doi: 10.1093/nar/gks881 – volume: 14 start-page: 67 year: 2013 ident: 2023012712472810200_btx721-B11 article-title: Review of tandem repeat search tools: a systematic approach to evaluating algorithmic performance publication-title: Brief. Bioinform doi: 10.1093/bib/bbs023 – volume: 18 start-page: 1011 year: 2008 ident: 2023012712472810200_btx721-B20 article-title: The biological effects of simple tandem repeats: Lessons from the repeat expansion diseases publication-title: Genome Res doi: 10.1101/gr.070409.107 – volume: 20 start-page: 176 year: 1994 ident: 2023012712472810200_btx721-B21 article-title: Genome fingerprinting by simple sequence repeat (Ssr)-anchored polymerase chain-reaction amplification publication-title: Genomics doi: 10.1006/geno.1994.1151 |
| SSID | ssj0051444 ssj0005056 |
| Score | 2.441159 |
| Snippet | Abstract
Motivation
Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used... Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of... |
| SourceID | unpaywall proquest pubmed crossref oup |
| SourceType | Open Access Repository Aggregation Database Index Database Enrichment Source Publisher |
| StartPage | 943 |
| SubjectTerms | Algorithms Genome, Human Genomics - methods Humans Microsatellite Repeats Sequence Analysis, DNA - methods Software |
| Title | PERF: an exhaustive algorithm for ultra-fast and efficient identification of microsatellites from large DNA sequences |
| URI | https://www.ncbi.nlm.nih.gov/pubmed/29121165 https://www.proquest.com/docview/1963276483 https://academic.oup.com/bioinformatics/article-pdf/34/6/943/25119307/btx721.pdf |
| UnpaywallVersion | publishedVersion |
| Volume | 34 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAFT databaseName: Open Access Digital Library customDbUrl: eissn: 1460-2059 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: KQ8 dateStart: 19960101 isFulltext: true titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html providerName: Colorado Alliance of Research Libraries – providerCode: PRVEBS databaseName: Inspec with Full Text customDbUrl: eissn: 1460-2059 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: ADMLS dateStart: 19980101 isFulltext: true titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text providerName: EBSCOhost – providerCode: PRVBFR databaseName: Free Medical Journals customDbUrl: eissn: 1460-2059 dateEnd: 20241101 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: DIK dateStart: 19960101 isFulltext: true titleUrlDefault: http://www.freemedicaljournals.com providerName: Flying Publisher – providerCode: PRVFQY databaseName: GFMER Free Medical Journals customDbUrl: eissn: 1460-2059 dateEnd: 20241101 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: GX1 dateStart: 19960101 isFulltext: true titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php providerName: Geneva Foundation for Medical Education and Research – providerCode: PRVAQN databaseName: PubMed Central customDbUrl: eissn: 1460-2059 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: RPM dateStart: 20070101 isFulltext: true titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/ providerName: National Library of Medicine – providerCode: PRVOVD databaseName: Journals@Ovid LWW All Open Access Journal Collection Rolling customDbUrl: eissn: 1460-2059 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: OVEED dateStart: 20010101 isFulltext: true titleUrlDefault: http://ovidsp.ovid.com/ providerName: Ovid – providerCode: PRVASL databaseName: Oxford Journals Open Access Collection customDbUrl: eissn: 1460-2059 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: TOX dateStart: 19850101 isFulltext: true titleUrlDefault: https://academic.oup.com/journals/ providerName: Oxford University Press – providerCode: PRVASL databaseName: Oxford Journals Open Access Collection customDbUrl: eissn: 1460-2059 dateEnd: 20220930 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: TOX dateStart: 19850101 isFulltext: true titleUrlDefault: https://academic.oup.com/journals/ providerName: Oxford University Press |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1bb9MwFD7aOiHgYdwGdIPJSLzwkKRJbKfhrRorExKlQp1UniLfwiqypLSJ2Pj1HOdSrXthSLwkluJLbB_7fJbP-Q7A2yCmNTGco7hKHUol7oMKUzpigvPUKMGsN_LnCT87p5_mbL4D084XRrRW4W7n0iAXRUshammLvXY8naVOvZB63Itp6FmYHKOwerK8whONix93YY8zROc92DufTEffGveryKHDOlhym_b9zqknDm-31NS1pa62XOBuINGHcL_Kl-L6l8iyG9pp_Ah-dv1qjFJ-uFUpXfX7FuXj_-z4Y9hvoSwZNUWewI7Jn8K9Jrjl9TOopqdfx--JyIm5urDsQrivEpF9L1aL8uKSYJukysqVcFKxLjGbJqams0AtSBa6NWKq5YYUKbm0hoNrUfOHIjom1i2GZNaMnXyYjMjGIvwAZuPT2cmZ0wZ5cBSNeekoIaxK1MwMopRyXLR6IE0caM3woCzxJdNooAKqhlpGqaBhiCBKSyZQsAYqfA69vMjNSyCRlL7UnBlhEJbIeCgV1zowCAlT4euwD7SbykS1BOg2DkeWNBfxYbI95EkzqH1wN8WWDQPI3wq8wzm8a943nTQluK7tZY3ITVGtE7szBhGnQ_zvF42YbaoMYkvMx1kfvI3c3a29w38ucQQPECPWbpg-ewW9clWZ14jDSnkMux_nPj5nX-bH7TL7A5igPZQ |
| linkProvider | Unpaywall |
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1La9wwEB7SDaXtIX23mz5QoZcebK9tSV73tjRZQqHLUhJIT0bPZKljb3dtmvTXd-THks0lKfRkgTWWJY00n9DMNwAfo5Q2xHCe4sp6lErcBxWWdMIE59YowVw08rcZPzqhX0_Z6Q7M-1gY0XmF-31Ig1yUHYWooy0OuvH0ltoGMQ14kNI4cDA5RWUNZHWJJxofX96DXc4QnQ9g92Q2n_xow68Sj46bZMldOQz7oJ40vtlS-60tc7UVAncNiT6CB3WxFFe_RZ5fs07Tx_Cr71frlPLTryvpqz83KB__Z8efwF4HZcmkFXkKO6Z4Bvfb5JZXz6GeH36ffiaiIOby3LEL4b5KRH5WrhbV-QXBNkmdVyvhWbGusJompqGzQCtIFrpzYmr0hpSWXDjHwbVo-EMRHRMXFkNy58ZODmYTsvEIfwHH08PjL0del-TBUzTllaeEcCZRMzNKLOW4aPVImjTSmuFBWeJD2mSkIqrGWiZW0DhGEKUlE6hYIxW_hEFRFuY1kETKUGrOjDAIS2Q6loprHRmEhFaEOh4C7acyUx0BusvDkWftRXycbQ951g7qEPyN2LJlALlN4BPO4V3rfui1KcN17S5rRGHKep25nTFKOB3jf79q1WzzySh1xHycDSHY6N3d2tv_Z4k38BAxYhOGGbK3MKhWtXmHOKyS77ul9RfHKTuH |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=PERF%3A+an+exhaustive+algorithm+for+ultra-fast+and+efficient+identification+of+microsatellites+from+large+DNA+sequences&rft.jtitle=Bioinformatics+%28Oxford%2C+England%29&rft.au=Avvaru%2C+Akshay+Kumar&rft.au=Sowpati%2C+Divya+Tej&rft.au=Mishra%2C+Rakesh+Kumar&rft.date=2018-03-15&rft.issn=1367-4811&rft.eissn=1367-4811&rft.volume=34&rft.issue=6&rft.spage=943&rft_id=info:doi/10.1093%2Fbioinformatics%2Fbtx721&rft.externalDBID=NO_FULL_TEXT |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1367-4803&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1367-4803&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1367-4803&client=summon |