HMMSplicer: A Tool for Efficient and Sensitive Discovery of Known and Novel Splice Junctions in RNA-Seq Data
High-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the r...
        Saved in:
      
    
          | Published in | PloS one Vol. 5; no. 11; p. e13875 | 
|---|---|
| Main Authors | , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        United States
          Public Library of Science
    
        08.11.2010
     Public Library of Science (PLoS)  | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1932-6203 1932-6203  | 
| DOI | 10.1371/journal.pone.0013875 | 
Cover
| Abstract | High-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the reference genome, especially in the case where a splice junction is previously unknown.
Here we introduce HMMSplicer, an accurate and efficient algorithm for discovering canonical and non-canonical splice junctions in short read datasets. HMMSplicer identifies more splice junctions than currently available algorithms when tested on publicly available A. thaliana, P. falciparum, and H. sapiens datasets without a reduction in specificity.
HMMSplicer was found to perform especially well in compact genomes and on genes with low expression levels, alternative splice isoforms, or non-canonical splice junctions. Because HHMSplicer does not rely on pre-built gene models, the products of inexact splicing are also detected. For H. sapiens, we find 3.6% of 3' splice sites and 1.4% of 5' splice sites are inexact, typically differing by 3 bases in either direction. In addition, HMMSplicer provides a score for every predicted junction allowing the user to set a threshold to tune false positive rates depending on the needs of the experiment. HMMSplicer is implemented in Python. Code and documentation are freely available at http://derisilab.ucsf.edu/software/hmmsplicer. | 
    
|---|---|
| AbstractList | High-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the reference genome, especially in the case where a splice junction is previously unknown. Here we introduce HMMSplicer, an accurate and efficient algorithm for discovering canonical and non-canonical splice junctions in short read datasets. HMMSplicer identifies more splice junctions than currently available algorithms when tested on publicly available A. thaliana, P. falciparum, and H. sapiens datasets without a reduction in specificity. HMMSplicer was found to perform especially well in compact genomes and on genes with low expression levels, alternative splice isoforms, or non-canonical splice junctions. Because HHMSplicer does not rely on pre-built gene models, the products of inexact splicing are also detected. For H. sapiens, we find 3.6% of 3' splice sites and 1.4% of 5' splice sites are inexact, typically differing by 3 bases in either direction. In addition, HMMSplicer provides a score for every predicted junction allowing the user to set a threshold to tune false positive rates depending on the needs of the experiment. HMMSplicer is implemented in Python. Code and documentation are freely available at http://derisilab.ucsf.edu/software/hmmsplicer. High-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the reference genome, especially in the case where a splice junction is previously unknown.BACKGROUNDHigh-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the reference genome, especially in the case where a splice junction is previously unknown.Here we introduce HMMSplicer, an accurate and efficient algorithm for discovering canonical and non-canonical splice junctions in short read datasets. HMMSplicer identifies more splice junctions than currently available algorithms when tested on publicly available A. thaliana, P. falciparum, and H. sapiens datasets without a reduction in specificity.METHODOLOGY/PRINCIPAL FINDINGSHere we introduce HMMSplicer, an accurate and efficient algorithm for discovering canonical and non-canonical splice junctions in short read datasets. HMMSplicer identifies more splice junctions than currently available algorithms when tested on publicly available A. thaliana, P. falciparum, and H. sapiens datasets without a reduction in specificity.HMMSplicer was found to perform especially well in compact genomes and on genes with low expression levels, alternative splice isoforms, or non-canonical splice junctions. Because HHMSplicer does not rely on pre-built gene models, the products of inexact splicing are also detected. For H. sapiens, we find 3.6% of 3' splice sites and 1.4% of 5' splice sites are inexact, typically differing by 3 bases in either direction. In addition, HMMSplicer provides a score for every predicted junction allowing the user to set a threshold to tune false positive rates depending on the needs of the experiment. HMMSplicer is implemented in Python. Code and documentation are freely available at http://derisilab.ucsf.edu/software/hmmsplicer.CONCLUSIONS/SIGNIFICANCEHMMSplicer was found to perform especially well in compact genomes and on genes with low expression levels, alternative splice isoforms, or non-canonical splice junctions. Because HHMSplicer does not rely on pre-built gene models, the products of inexact splicing are also detected. For H. sapiens, we find 3.6% of 3' splice sites and 1.4% of 5' splice sites are inexact, typically differing by 3 bases in either direction. In addition, HMMSplicer provides a score for every predicted junction allowing the user to set a threshold to tune false positive rates depending on the needs of the experiment. HMMSplicer is implemented in Python. Code and documentation are freely available at http://derisilab.ucsf.edu/software/hmmsplicer. Background High-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the reference genome, especially in the case where a splice junction is previously unknown. Methodology/Principal Findings Here we introduce HMMSplicer, an accurate and efficient algorithm for discovering canonical and non-canonical splice junctions in short read datasets. HMMSplicer identifies more splice junctions than currently available algorithms when tested on publicly available A. thaliana, P. falciparum, and H. sapiens datasets without a reduction in specificity. Conclusions/Significance HMMSplicer was found to perform especially well in compact genomes and on genes with low expression levels, alternative splice isoforms, or non-canonical splice junctions. Because HHMSplicer does not rely on pre-built gene models, the products of inexact splicing are also detected. For H. sapiens, we find 3.6% of 3' splice sites and 1.4% of 5' splice sites are inexact, typically differing by 3 bases in either direction. In addition, HMMSplicer provides a score for every predicted junction allowing the user to set a threshold to tune false positive rates depending on the needs of the experiment. HMMSplicer is implemented in Python. Code and documentation are freely available at High-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the reference genome, especially in the case where a splice junction is previously unknown. Here we introduce HMMSplicer, an accurate and efficient algorithm for discovering canonical and non-canonical splice junctions in short read datasets. HMMSplicer identifies more splice junctions than currently available algorithms when tested on publicly available A. thaliana, P. falciparum, and H. sapiens datasets without a reduction in specificity. HMMSplicer was found to perform especially well in compact genomes and on genes with low expression levels, alternative splice isoforms, or non-canonical splice junctions. Because HHMSplicer does not rely on pre-built gene models, the products of inexact splicing are also detected. For H. sapiens, we find 3.6% of 3' splice sites and 1.4% of 5' splice sites are inexact, typically differing by 3 bases in either direction. In addition, HMMSplicer provides a score for every predicted junction allowing the user to set a threshold to tune false positive rates depending on the needs of the experiment. HMMSplicer is implemented in Python. Code and documentation are freely available at http://derisilab.ucsf.edu/software/hmmsplicer. Background High-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the reference genome, especially in the case where a splice junction is previously unknown. Methodology/Principal Findings Here we introduce HMMSplicer, an accurate and efficient algorithm for discovering canonical and non-canonical splice junctions in short read datasets. HMMSplicer identifies more splice junctions than currently available algorithms when tested on publicly available A. thaliana, P. falciparum, and H. sapiens datasets without a reduction in specificity. Conclusions/Significance HMMSplicer was found to perform especially well in compact genomes and on genes with low expression levels, alternative splice isoforms, or non-canonical splice junctions. Because HHMSplicer does not rely on pre-built gene models, the products of inexact splicing are also detected. For H. sapiens, we find 3.6% of 3′ splice sites and 1.4% of 5′ splice sites are inexact, typically differing by 3 bases in either direction. In addition, HMMSplicer provides a score for every predicted junction allowing the user to set a threshold to tune false positive rates depending on the needs of the experiment. HMMSplicer is implemented in Python. Code and documentation are freely available at http://derisilab.ucsf.edu/software/hmmsplicer. Background High-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the reference genome, especially in the case where a splice junction is previously unknown. Methodology/Principal Findings Here we introduce HMMSplicer, an accurate and efficient algorithm for discovering canonical and non-canonical splice junctions in short read datasets. HMMSplicer identifies more splice junctions than currently available algorithms when tested on publicly available A. thaliana, P. falciparum, and H. sapiens datasets without a reduction in specificity. Conclusions/Significance HMMSplicer was found to perform especially well in compact genomes and on genes with low expression levels, alternative splice isoforms, or non-canonical splice junctions. Because HHMSplicer does not rely on pre-built gene models, the products of inexact splicing are also detected. For H. sapiens, we find 3.6% of 3′ splice sites and 1.4% of 5′ splice sites are inexact, typically differing by 3 bases in either direction. In addition, HMMSplicer provides a score for every predicted junction allowing the user to set a threshold to tune false positive rates depending on the needs of the experiment. HMMSplicer is implemented in Python. Code and documentation are freely available at http://derisilab.ucsf.edu/software/hmmsplicer.  | 
    
| Audience | Academic | 
    
| Author | Dimon, Michelle T. Sorber, Katherine DeRisi, Joseph L.  | 
    
| AuthorAffiliation | 2 Biological and Medical Informatics Program, University of California San Francisco, San Francisco, California, United States of America 3 Howard Hughes Medical Institute, Bethesda, Maryland, United States of America University of North Carolina at Charlotte, United States of America 1 Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America  | 
    
| AuthorAffiliation_xml | – name: 3 Howard Hughes Medical Institute, Bethesda, Maryland, United States of America – name: 1 Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America – name: University of North Carolina at Charlotte, United States of America – name: 2 Biological and Medical Informatics Program, University of California San Francisco, San Francisco, California, United States of America  | 
    
| Author_xml | – sequence: 1 givenname: Michelle T. surname: Dimon fullname: Dimon, Michelle T. – sequence: 2 givenname: Katherine surname: Sorber fullname: Sorber, Katherine – sequence: 3 givenname: Joseph L. surname: DeRisi fullname: DeRisi, Joseph L.  | 
    
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/21079731$$D View this record in MEDLINE/PubMed | 
    
| BookMark | eNqNk11v0zAUhiM0xD7gHyCwhATiosUfcRLvAqnaBivsQ1oHt5bj2J0r1-7ipKP_HqftpnaaYMqFrZPnvD55X2c_2XHeqSR5i2AfkRx9mfi2dsL2Z7HchxCRIqcvkj3ECO5lGJKdjf1ush_CBEJKiix7lexiBHOWE7SX2NPz89HMGqnqQzAA195boH0NTrQ20ijXAOEqMFIumMbMFTg2Qfq5qhfAa_DT-Tu3BC5izYKVEPjROtkY7wIwDlxdDHojdQuORSNeJy-1sEG9Wa8Hya9vJ9dHp72zy-_Do8FZT-aYND2CUJlmkhaUZqxEuoQFKgiTotSQSFixnEJBWCp1RaWCWmKJWaRYpqDKsCYHyfuV7sz6wNdGBY4wo4SmmMFIDFdE5cWEz2ozFfWCe2H4suDrMRd1Y6RVvMQVhRlFKcpRmmWMSY0pKbtdLMoyatGVVutmYnEnrH0QRJB3Wd2PwLus-Dqr2Pd1PWVbTlUlo9m1sFvDbL9x5oaP_ZzjaEBGcBT4tBao_W2rQsOnMR1lrXDKt4EXWcooZOj_ZM5Y3nnTkR8ekU_bt6bGIjpknPZxQNlp8kGakyJ-HySR6j9BxadSUyOjFdrE-lbD562GyDTqTzMWbQh8OLp6Pnv5e5v9uMHeKGGbm-Btu7yk2-C7zUgesrj_XyKQrgBZ-xBqpZ8b9eGjNmka0R0fHTH2381_AYk6Ol8 | 
    
| CitedBy_id | crossref_primary_10_4252_wjsc_v13_i10_1394 crossref_primary_10_1186_1471_2164_15_150 crossref_primary_10_1186_1471_2164_12_82 crossref_primary_10_1093_nar_gkr1171 crossref_primary_10_7554_eLife_01179 crossref_primary_10_1093_nar_gks1311 crossref_primary_10_1371_journal_ppat_1003847 crossref_primary_10_1093_nar_gkq1223 crossref_primary_10_1002_wrna_1121 crossref_primary_10_5808_GI_2012_10_1_58 crossref_primary_10_1038_srep10940 crossref_primary_10_1093_nar_gkr1248 crossref_primary_10_1371_journal_pone_0095057 crossref_primary_10_1093_bioinformatics_btr712 crossref_primary_10_1016_j_cmpb_2015_02_004 crossref_primary_10_1111_j_1365_3024_2011_01340_x crossref_primary_10_1371_journal_pcbi_1003517 crossref_primary_10_1109_TCBB_2014_2304294 crossref_primary_10_1016_j_ygeno_2011_12_003 crossref_primary_10_1590_1678_4324_2016160240 crossref_primary_10_1016_j_biopsych_2013_08_028 crossref_primary_10_1128_MCB_06267_11 crossref_primary_10_1007_s11427_011_4255_x crossref_primary_10_1093_nar_gkv242 crossref_primary_10_1155_2015_831352 crossref_primary_10_1101_gad_304600_117 crossref_primary_10_1002_0471250953_bi1101s36 crossref_primary_10_1093_gbe_evz009 crossref_primary_10_1093_nar_gks1271 crossref_primary_10_1093_bioinformatics_btr427  | 
    
| Cites_doi | 10.1101/gr.078212.108 10.1186/1471-2164-8-255 10.1038/nature07002 10.1093/bioinformatics/btp120 10.1038/nmeth.1226 10.1016/j.cell.2008.03.029 10.1093/bioinformatics/btp336 10.1186/gb-2010-11-3-r34 10.1093/nar/gki025 10.1126/science.1158441 10.1038/nature07509 10.1016/j.neuron.2010.05.007 10.1214/aoms/1177697196 10.1186/gb-2009-10-3-r25 10.1093/nar/gkh045 10.1016/S0092-8674(00)81360-4 10.1038/nature01097 10.1016/j.gene.2005.07.027 10.1093/nar/gkn425 10.1093/hmg/ddp473 10.1186/gb-2009-10-10-242 10.1093/nar/27.15.3219 10.1016/S0092-8674(01)00611-0 10.1093/bioinformatics/btn300 10.1093/nar/gkq041 10.1038/ng.259 10.1093/bioinformatics/btq206 10.1038/nbt.1621 10.1371/journal.pcbi.1000598 10.1371/journal.pone.0003495 10.1101/gr.229102. Article published online before print in May 2002 10.1038/nature08909 10.1101/gr.849004 10.1016/S0092-8674(00)81361-6 10.1016/j.cell.2009.02.009 10.1101/gr.229202. Article published online before March 2002 10.1038/35048692 10.1038/nrg2484 10.1093/nar/gkj031 10.1093/bioinformatics/btp324  | 
    
| ContentType | Journal Article | 
    
| Copyright | COPYRIGHT 2010 Public Library of Science 2010 Dimon et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License: https://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. Dimon et al. 2010  | 
    
| Copyright_xml | – notice: COPYRIGHT 2010 Public Library of Science – notice: 2010 Dimon et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License: https://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. – notice: Dimon et al. 2010  | 
    
| DBID | AAYXX CITATION CGR CUY CVF ECM EIF NPM IOV ISR 3V. 7QG 7QL 7QO 7RV 7SN 7SS 7T5 7TG 7TM 7U9 7X2 7X7 7XB 88E 8AO 8C1 8FD 8FE 8FG 8FH 8FI 8FJ 8FK ABJCF ABUWG AEUYN AFKRA ARAPS ATCPS AZQEC BBNVY BENPR BGLVJ BHPHI C1K CCPQU D1I DWQXO FR3 FYUFA GHDGH GNUQQ H94 HCIFZ K9. KB. KB0 KL. L6V LK8 M0K M0S M1P M7N M7P M7S NAPCQ P5Z P62 P64 PATMY PDBOC PHGZM PHGZT PIMPY PJZUB PKEHL PPXIY PQEST PQGLB PQQKQ PQUKI PRINS PTHSS PYCSY RC3 7X8 5PM ADTOC UNPAY DOA  | 
    
| DOI | 10.1371/journal.pone.0013875 | 
    
| DatabaseName | CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed Gale In Context: Opposing Viewpoints Gale in Context: Science ProQuest Central (Corporate) Animal Behavior Abstracts Bacteriology Abstracts (Microbiology B) Biotechnology Research Abstracts Nursing & Allied Health Database Ecology Abstracts Entomology Abstracts (Full archive) Immunology Abstracts Meteorological & Geoastrophysical Abstracts Nucleic Acids Abstracts Virology and AIDS Abstracts ProQuest Agricultural Science Health & Medical Collection (Proquest) ProQuest Central (purchase pre-March 2016) Medical Database (Alumni Edition) ProQuest Pharma Collection Public Health Database (ProQuest) Technology Research Database ProQuest SciTech Collection ProQuest Technology Collection ProQuest Natural Science Journals Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Materials Science & Engineering ProQuest Central (Alumni) ProQuest One Sustainability ProQuest Central UK/Ireland Advanced Technologies & Computer Science Collection Agricultural & Environmental Science Collection ProQuest Central Essentials Biological Science Database ProQuest Central Technology collection Natural Science Collection Environmental Sciences and Pollution Management ProQuest One Community College ProQuest Materials Science Collection ProQuest Central Engineering Research Database Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Central Student AIDS and Cancer Research Abstracts SciTech Premium Collection (Proquest) ProQuest Health & Medical Complete (Alumni) Materials Science Database (Proquest) Nursing & Allied Health Database (Alumni Edition) Meteorological & Geoastrophysical Abstracts - Academic ProQuest Engineering Collection Biological Sciences Agricultural Science Database Health & Medical Collection (Alumni Edition) Medical Database Algology Mycology and Protozoology Abstracts (Microbiology C) Biological Science Database Engineering Database (Proquest) Nursing & Allied Health Premium ProQuest Central Advanced Technologies & Aerospace Database (via ProQuest) ProQuest Advanced Technologies & Aerospace Collection Biotechnology and BioEngineering Abstracts Environmental Science Database Materials Science Collection ProQuest Central Premium ProQuest One Academic Publicly Available Content Database ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection Environmental Science Collection Genetics Abstracts MEDLINE - Academic PubMed Central (Full Participant titles) Unpaywall for CDI: Periodical Content Unpaywall DOAJ Directory of Open Access Journals  | 
    
| DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) Agricultural Science Database Publicly Available Content Database ProQuest Central Student ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials Nucleic Acids Abstracts SciTech Premium Collection ProQuest Central China Environmental Sciences and Pollution Management ProQuest One Applied & Life Sciences ProQuest One Sustainability Health Research Premium Collection Meteorological & Geoastrophysical Abstracts Natural Science Collection Health & Medical Research Collection Biological Science Collection ProQuest Central (New) ProQuest Medical Library (Alumni) Engineering Collection Advanced Technologies & Aerospace Collection Engineering Database Virology and AIDS Abstracts ProQuest Biological Science Collection ProQuest One Academic Eastern Edition Agricultural Science Collection ProQuest Hospital Collection ProQuest Technology Collection Health Research Premium Collection (Alumni) Biological Science Database Ecology Abstracts ProQuest Hospital Collection (Alumni) Biotechnology and BioEngineering Abstracts Environmental Science Collection Entomology Abstracts Nursing & Allied Health Premium ProQuest Health & Medical Complete ProQuest One Academic UKI Edition Environmental Science Database ProQuest Nursing & Allied Health Source (Alumni) Engineering Research Database ProQuest One Academic Meteorological & Geoastrophysical Abstracts - Academic ProQuest One Academic (New) Technology Collection Technology Research Database ProQuest One Academic Middle East (New) Materials Science Collection ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Natural Science Collection ProQuest Pharma Collection ProQuest Central ProQuest Health & Medical Research Collection Genetics Abstracts ProQuest Engineering Collection Biotechnology Research Abstracts Health and Medicine Complete (Alumni Edition) ProQuest Central Korea Bacteriology Abstracts (Microbiology B) Algology Mycology and Protozoology Abstracts (Microbiology C) Agricultural & Environmental Science Collection AIDS and Cancer Research Abstracts Materials Science Database ProQuest Materials Science Collection ProQuest Public Health ProQuest Nursing & Allied Health Source ProQuest SciTech Collection Advanced Technologies & Aerospace Database ProQuest Medical Library Animal Behavior Abstracts Materials Science & Engineering Collection Immunology Abstracts ProQuest Central (Alumni) MEDLINE - Academic  | 
    
| DatabaseTitleList | MEDLINE - Academic Nucleic Acids Abstracts MEDLINE Agricultural Science Database  | 
    
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Open Access Full Text url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 3 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 4 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository – sequence: 5 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Sciences (General) | 
    
| DocumentTitleAlternate | HMMSplicer | 
    
| EISSN | 1932-6203 | 
    
| ExternalDocumentID | 1295354290 oai_doaj_org_article_b2d50651417146699cf253b6699514cb 10.1371/journal.pone.0013875 PMC2975632 2898643651 A473838703 21079731 10_1371_journal_pone_0013875  | 
    
| Genre | Research Support, Non-U.S. Gov't Journal Article  | 
    
| GeographicLocations | United States--US San Francisco California California  | 
    
| GeographicLocations_xml | – name: San Francisco California – name: United States--US – name: California  | 
    
| GrantInformation_xml | – fundername: Howard Hughes Medical Institute | 
    
| GroupedDBID | --- 123 29O 2WC 53G 5VS 7RV 7X2 7X7 7XC 88E 8AO 8C1 8CJ 8FE 8FG 8FH 8FI 8FJ A8Z AAFWJ AAUCC AAWOE AAYXX ABDBF ABIVO ABJCF ABUWG ACGFO ACIHN ACIWK ACPRK ACUHS ADBBV ADRAZ AEAQA AENEX AEUYN AFKRA AFPKN AFRAH AHMBA ALMA_UNASSIGNED_HOLDINGS AOIJS APEBS ARAPS ATCPS BAWUL BBNVY BCNDV BENPR BGLVJ BHPHI BKEYQ BPHCQ BVXVI BWKFM CCPQU CITATION CS3 D1I D1J D1K DIK DU5 E3Z EAP EAS EBD EMOBN ESTFP ESX EX3 F5P FPL FYUFA GROUPED_DOAJ GX1 HCIFZ HH5 HMCUK HYE IAO IEA IGS IHR IHW INH INR IOV IPNFZ IPY ISE ISR ITC K6- KB. KQ8 L6V LK5 LK8 M0K M1P M48 M7P M7R M7S M~E NAPCQ O5R O5S OK1 OVT P2P P62 PATMY PDBOC PHGZM PHGZT PIMPY PJZUB PPXIY PQGLB PQQKQ PROAC PSQYO PTHSS PUEGO PYCSY RIG RNS RPM SV3 TR2 UKHRP WOQ WOW ~02 ~KM 3V. ALIPV BBORY CGR CUY CVF ECM EIF NPM 7QG 7QL 7QO 7SN 7SS 7T5 7TG 7TM 7U9 7XB 8FD 8FK AZQEC C1K DWQXO FR3 GNUQQ H94 K9. KL. M7N P64 PKEHL PQEST PQUKI PRINS RC3 7X8 5PM ACCTH ADTOC AFFHD BBTPI PV9 RZL UNPAY AAPBV ABPTK N95  | 
    
| ID | FETCH-LOGICAL-c723t-311b46c585569b1fb081839cabf03c0d9750a394cfd5ce0fc2c29fb096e0e62f3 | 
    
| IEDL.DBID | M48 | 
    
| ISSN | 1932-6203 | 
    
| IngestDate | Sun Jul 02 11:04:05 EDT 2023 Fri Oct 03 12:52:37 EDT 2025 Wed Oct 29 11:58:37 EDT 2025 Wed Sep 10 06:00:52 EDT 2025 Thu Oct 02 06:24:55 EDT 2025 Wed Oct 01 14:59:59 EDT 2025 Tue Oct 07 06:35:20 EDT 2025 Mon Oct 20 22:19:02 EDT 2025 Mon Oct 20 16:27:11 EDT 2025 Thu Oct 16 15:12:25 EDT 2025 Thu Oct 16 15:33:09 EDT 2025 Thu May 22 21:20:39 EDT 2025 Wed Feb 19 01:49:05 EST 2025 Thu Apr 24 22:58:54 EDT 2025 Wed Oct 01 02:29:03 EDT 2025  | 
    
| IsDoiOpenAccess | true | 
    
| IsOpenAccess | true | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| Issue | 11 | 
    
| Language | English | 
    
| License | This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. cc-by Creative Commons Attribution License  | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-c723t-311b46c585569b1fb081839cabf03c0d9750a394cfd5ce0fc2c29fb096e0e62f3 | 
    
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 ObjectType-Article-2 ObjectType-Feature-1 Conceived and designed the experiments: MTD KS JLD. Performed the experiments: MTD KS. Analyzed the data: MTD. Wrote the paper: MTD KS JLD.  | 
    
| OpenAccessLink | http://journals.scholarsportal.info/openUrl.xqy?doi=10.1371/journal.pone.0013875 | 
    
| PMID | 21079731 | 
    
| PQID | 1295354290 | 
    
| PQPubID | 1436336 | 
    
| PageCount | e13875 | 
    
| ParticipantIDs | plos_journals_1295354290 doaj_primary_oai_doaj_org_article_b2d50651417146699cf253b6699514cb unpaywall_primary_10_1371_journal_pone_0013875 pubmedcentral_primary_oai_pubmedcentral_nih_gov_2975632 proquest_miscellaneous_864950912 proquest_miscellaneous_799795352 proquest_journals_1295354290 gale_infotracmisc_A473838703 gale_infotracacademiconefile_A473838703 gale_incontextgauss_ISR_A473838703 gale_incontextgauss_IOV_A473838703 gale_healthsolutions_A473838703 pubmed_primary_21079731 crossref_primary_10_1371_journal_pone_0013875 crossref_citationtrail_10_1371_journal_pone_0013875  | 
    
| ProviderPackageCode | CITATION AAYXX  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2010-11-08 | 
    
| PublicationDateYYYYMMDD | 2010-11-08 | 
    
| PublicationDate_xml | – month: 11 year: 2010 text: 2010-11-08 day: 08  | 
    
| PublicationDecade | 2010 | 
    
| PublicationPlace | United States | 
    
| PublicationPlace_xml | – name: United States – name: San Francisco – name: San Francisco, USA  | 
    
| PublicationTitle | PloS one | 
    
| PublicationTitleAlternate | PLoS One | 
    
| PublicationYear | 2010 | 
    
| Publisher | Public Library of Science Public Library of Science (PLoS)  | 
    
| Publisher_xml | – name: Public Library of Science – name: Public Library of Science (PLoS)  | 
    
| References | U Nagalakshmi (ref20) 2008; 320 DA Benson (ref36) 2004; 32 WJ Kent (ref19) 2002; 12 KF Au (ref25) 2010 F Lu (ref33) 2007; 8 JC Dohm (ref26) 2008; 36 (ref30) 2000; 408 KD Pruitt (ref35) 2005; 33 WJ Kent (ref40) 2002; 12 GA Heap (ref27) 2010; 19 BT Wilhelm (ref2) 2008; 453 DW Bryant (ref24) 2010 A Ameur (ref23) 2010; 11 M Yano (ref17) 2010; 66 C Trapnell (ref21) 2009; 25 ET Wang (ref13) 2008; 456 H Nagasaki (ref15) 2005; 364 H Li (ref3) 2009; 25 M Deutsch (ref31) 1999; 27 D Ramsköld (ref37) 2009; 5 R Lister (ref29) 2008; 133 H Li (ref28) 2008; 18 MJ Gardner (ref32) 2002; 419 GE Crooks (ref39) 2004; 14 JS Cox (ref9) 1996; 87 H Yoshida (ref8) 2001; 107 MC Wahl (ref6) 2009; 136 F De Bona (ref42) 2008; 24 R Li (ref4) 2009; 25 Q Pan (ref14) 2008; 40 PJ Shepard (ref12) 2009; 10 Z Wang (ref1) 2009; 10 S Stamm (ref7) 2006; 34 C Trapnell (ref22) 2010; 28 L Baum (ref41) 1970; 41 C Sidrauski (ref10) 1996; 87 S Sen (ref16) 2010 TW Nilsen (ref11) 2010; 463 K Sorber (ref34) 2008; 3 A Mortazavi (ref18) 2008; 5 H Richard (ref38) 2010 B Langmead (ref5) 2009; 10 19825846 - Hum Mol Genet. 2010 Jan 1;19(1):122-34 19451168 - Bioinformatics. 2009 Jul 15;25(14):1754-60 18978789 - Nat Genet. 2008 Dec;40(12):1413-5 20519504 - J Biol Chem. 2010 Aug 13;285(33):25426-37 8898193 - Cell. 1996 Nov 1;87(3):391-404 19015660 - Nat Rev Genet. 2009 Jan;10(1):57-63 11130711 - Nature. 2000 Dec 14;408(6814):796-815 18660515 - Nucleic Acids Res. 2008 Sep;36(16):e105 18978772 - Nature. 2008 Nov 27;456(7221):470-6 18423832 - Cell. 2008 May 2;133(3):523-36 19289445 - Bioinformatics. 2009 May 1;25(9):1105-11 19857271 - Genome Biol. 2009;10(10):242 12045153 - Genome Res. 2002 Jun;12(6):996-1006 18689821 - Bioinformatics. 2008 Aug 15;24(16):i174-80 18488015 - Nature. 2008 Jun 26;453(7199):1239-43 11779464 - Cell. 2001 Dec 28;107(7):881-91 15173120 - Genome Res. 2004 Jun;14(6):1188-90 20371516 - Nucleic Acids Res. 2010 Aug;38(14):4570-8 18451266 - Science. 2008 Jun 6;320(5881):1344-9 19239890 - Cell. 2009 Feb 20;136(4):701-18 20110989 - Nature. 2010 Jan 28;463(7280):457-63 20150413 - Nucleic Acids Res. 2010 Jun;38(10):e112 18941527 - PLoS One. 2008;3(10):e3495 8898194 - Cell. 1996 Nov 1;87(3):405-13 16381912 - Nucleic Acids Res. 2006 Jan 1;34(Database issue):D46-55 20011106 - PLoS Comput Biol. 2009 Dec;5(12):e1000598 17662120 - BMC Genomics. 2007;8:255 19261174 - Genome Biol. 2009;10(3):R25 19497933 - Bioinformatics. 2009 Aug 1;25(15):1966-7 20620871 - Neuron. 2010 Jun 24;66(6):848-58 12368864 - Nature. 2002 Oct 3;419(6906):498-511 18516045 - Nat Methods. 2008 Jul;5(7):621-8 14681350 - Nucleic Acids Res. 2004 Jan 1;32(Database issue):D23-6 18714091 - Genome Res. 2008 Nov;18(11):1851-8 20436464 - Nat Biotechnol. 2010 May;28(5):511-5 15608248 - Nucleic Acids Res. 2005 Jan 1;33(Database issue):D501-4 20236510 - Genome Biol. 2010;11(3):R34 10454621 - Nucleic Acids Res. 1999 Aug 1;27(15):3219-28 16219431 - Gene. 2005 Dec 30;364:53-62 11932250 - Genome Res. 2002 Apr;12(4):656-64 20410051 - Bioinformatics. 2010 Jun 15;26(12):1500-5  | 
    
| References_xml | – volume: 18 start-page: 1851 year: 2008 ident: ref28 article-title: Mapping short DNA sequencing reads and calling variants using mapping quality scores. publication-title: Genome Res doi: 10.1101/gr.078212.108 – volume: 8 start-page: 255 year: 2007 ident: ref33 article-title: cDNA sequences reveal considerable gene prediction inaccuracy in the Plasmodium falciparum genome. publication-title: BMC Genomics doi: 10.1186/1471-2164-8-255 – volume: 453 start-page: 1239 year: 2008 ident: ref2 article-title: Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution. publication-title: Nature doi: 10.1038/nature07002 – volume: 25 start-page: 1105 year: 2009 ident: ref21 article-title: TopHat: discovering splice junctions with RNA-Seq. publication-title: Bioinformatics doi: 10.1093/bioinformatics/btp120 – volume: 5 start-page: 621 year: 2008 ident: ref18 article-title: Mapping and quantifying mammalian transcriptomes by RNA-Seq. publication-title: Nat Methods doi: 10.1038/nmeth.1226 – volume: 133 start-page: 523 year: 2008 ident: ref29 article-title: Highly integrated single-base resolution maps of the epigenome in Arabidopsis. publication-title: Cell doi: 10.1016/j.cell.2008.03.029 – year: 2010 ident: ref16 article-title: Muscleblind-like 1 (Mbnl1) promotes insulin receptor exon 11 inclusion via binding to a downstream evolutionarily conserved intronic enhancer. publication-title: J Biol Chem – volume: 25 start-page: 1966 year: 2009 ident: ref4 article-title: SOAP2: an improved ultrafast tool for short read alignment. publication-title: Bioinformatics doi: 10.1093/bioinformatics/btp336 – volume: 11 start-page: R34 year: 2010 ident: ref23 article-title: Global and unbiased detection of splice junctions from RNA-seq data. publication-title: Genome Biol doi: 10.1186/gb-2010-11-3-r34 – volume: 33 start-page: D501 year: 2005 ident: ref35 article-title: NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. publication-title: Nucl Acids Res doi: 10.1093/nar/gki025 – volume: 320 start-page: 1344 year: 2008 ident: ref20 article-title: The transcriptional landscape of the yeast genome defined by RNA sequencing. publication-title: Science doi: 10.1126/science.1158441 – year: 2010 ident: ref25 article-title: Detection of splice junctions from paired-end RNA-seq data by SpliceMap. – volume: 456 start-page: 470 year: 2008 ident: ref13 article-title: Alternative isoform regulation in human tissue transcriptomes. publication-title: Nature doi: 10.1038/nature07509 – volume: 66 start-page: 848 year: 2010 ident: ref17 article-title: Nova2 Regulates Neuronal Migration through an RNA Switch in Disabled-1 Signaling. publication-title: Neuron doi: 10.1016/j.neuron.2010.05.007 – volume: 41 start-page: 164 year: 1970 ident: ref41 article-title: A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains. publication-title: The Annals of Mathematical Statistics doi: 10.1214/aoms/1177697196 – volume: 10 start-page: R25 year: 2009 ident: ref5 article-title: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. publication-title: Genome Biol doi: 10.1186/gb-2009-10-3-r25 – volume: 32 start-page: D23 year: 2004 ident: ref36 article-title: GenBank: update. publication-title: Nucl Acids Res doi: 10.1093/nar/gkh045 – volume: 87 start-page: 391 year: 1996 ident: ref9 article-title: A Novel Mechanism for Regulating Activity of a Transcription Factor That Controls the Unfolded Protein Response. publication-title: Cell doi: 10.1016/S0092-8674(00)81360-4 – volume: 419 start-page: 498 year: 2002 ident: ref32 article-title: Genome sequence of the human malaria parasite Plasmodium falciparum. publication-title: Nature doi: 10.1038/nature01097 – volume: 364 start-page: 53 year: 2005 ident: ref15 article-title: Species-specific variation of alternative splicing and transcriptional initiation in six eukaryotes. publication-title: Gene doi: 10.1016/j.gene.2005.07.027 – volume: 36 start-page: e105 year: 2008 ident: ref26 article-title: Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. publication-title: Nucleic Acids Res doi: 10.1093/nar/gkn425 – volume: 19 start-page: 122 year: 2010 ident: ref27 article-title: Genome-wide analysis of allelic expression imbalance in human primary cells by high-throughput transcriptome resequencing. publication-title: Hum Mol Genet doi: 10.1093/hmg/ddp473 – volume: 10 start-page: 242 year: 2009 ident: ref12 article-title: The SR protein family. publication-title: Genome Biol doi: 10.1186/gb-2009-10-10-242 – volume: 27 start-page: 3219 year: 1999 ident: ref31 article-title: Intron-exon structures of eukaryotic model organisms. publication-title: Nucleic Acids Res doi: 10.1093/nar/27.15.3219 – volume: 107 start-page: 881 year: 2001 ident: ref8 article-title: XBP1 mRNA is induced by ATF6 and spliced by IRE1 in response to ER stress to produce a highly active transcription factor. publication-title: Cell doi: 10.1016/S0092-8674(01)00611-0 – volume: 24 start-page: i174 year: 2008 ident: ref42 article-title: Optimal spliced alignments of short sequence reads. publication-title: Bioinformatics doi: 10.1093/bioinformatics/btn300 – year: 2010 ident: ref38 article-title: Prediction of alternative isoforms from exon expression levels in RNA-Seq experiments. doi: 10.1093/nar/gkq041 – volume: 40 start-page: 1413 year: 2008 ident: ref14 article-title: Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. publication-title: Nat Genet doi: 10.1038/ng.259 – year: 2010 ident: ref24 article-title: Supersplat – spliced RNA-seq alignment. doi: 10.1093/bioinformatics/btq206 – volume: 28 start-page: 511 year: 2010 ident: ref22 article-title: Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. publication-title: Nat Biotechnol doi: 10.1038/nbt.1621 – volume: 5 start-page: e1000598 year: 2009 ident: ref37 article-title: An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data. publication-title: PLoS Comput Biol doi: 10.1371/journal.pcbi.1000598 – volume: 3 start-page: e3495 year: 2008 ident: ref34 article-title: The long march: a sample preparation technique that enhances contig length and coverage by high-throughput short-read sequencing. publication-title: PLoS ONE doi: 10.1371/journal.pone.0003495 – volume: 12 start-page: 996 year: 2002 ident: ref40 article-title: The Human Genome Browser at UCSC. publication-title: Genome Research doi: 10.1101/gr.229102. Article published online before print in May 2002 – volume: 463 start-page: 457 year: 2010 ident: ref11 article-title: Expansion of the eukaryotic proteome by alternative splicing. publication-title: Nature doi: 10.1038/nature08909 – volume: 14 start-page: 1188 year: 2004 ident: ref39 article-title: WebLogo: a sequence logo generator. publication-title: Genome Res doi: 10.1101/gr.849004 – volume: 87 start-page: 405 year: 1996 ident: ref10 article-title: tRNA ligase is required for regulated mRNA splicing in the unfolded protein response. publication-title: Cell doi: 10.1016/S0092-8674(00)81361-6 – volume: 136 start-page: 701 year: 2009 ident: ref6 article-title: The Spliceosome: Design Principles of a Dynamic RNP Machine. publication-title: Cell doi: 10.1016/j.cell.2009.02.009 – volume: 12 start-page: 656 year: 2002 ident: ref19 article-title: BLAT–the BLAST-like alignment tool. publication-title: Genome Res doi: 10.1101/gr.229202. Article published online before March 2002 – volume: 408 start-page: 796 year: 2000 ident: ref30 article-title: Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. publication-title: Nature doi: 10.1038/35048692 – volume: 10 start-page: 57 year: 2009 ident: ref1 article-title: RNA-Seq: a revolutionary tool for transcriptomics. publication-title: Nat Rev Genet doi: 10.1038/nrg2484 – volume: 34 start-page: D46 year: 2006 ident: ref7 article-title: ASD: a bioinformatics resource on alternative splicing. publication-title: Nucleic Acids Res doi: 10.1093/nar/gkj031 – volume: 25 start-page: 1754 year: 2009 ident: ref3 article-title: Fast and accurate short read alignment with Burrows-Wheeler transform. publication-title: Bioinformatics doi: 10.1093/bioinformatics/btp324 – reference: 20110989 - Nature. 2010 Jan 28;463(7280):457-63 – reference: 20150413 - Nucleic Acids Res. 2010 Jun;38(10):e112 – reference: 12045153 - Genome Res. 2002 Jun;12(6):996-1006 – reference: 20620871 - Neuron. 2010 Jun 24;66(6):848-58 – reference: 20371516 - Nucleic Acids Res. 2010 Aug;38(14):4570-8 – reference: 18714091 - Genome Res. 2008 Nov;18(11):1851-8 – reference: 15608248 - Nucleic Acids Res. 2005 Jan 1;33(Database issue):D501-4 – reference: 16219431 - Gene. 2005 Dec 30;364:53-62 – reference: 18978772 - Nature. 2008 Nov 27;456(7221):470-6 – reference: 19857271 - Genome Biol. 2009;10(10):242 – reference: 18516045 - Nat Methods. 2008 Jul;5(7):621-8 – reference: 10454621 - Nucleic Acids Res. 1999 Aug 1;27(15):3219-28 – reference: 18423832 - Cell. 2008 May 2;133(3):523-36 – reference: 19497933 - Bioinformatics. 2009 Aug 1;25(15):1966-7 – reference: 11932250 - Genome Res. 2002 Apr;12(4):656-64 – reference: 18488015 - Nature. 2008 Jun 26;453(7199):1239-43 – reference: 18978789 - Nat Genet. 2008 Dec;40(12):1413-5 – reference: 18660515 - Nucleic Acids Res. 2008 Sep;36(16):e105 – reference: 12368864 - Nature. 2002 Oct 3;419(6906):498-511 – reference: 17662120 - BMC Genomics. 2007;8:255 – reference: 18941527 - PLoS One. 2008;3(10):e3495 – reference: 19825846 - Hum Mol Genet. 2010 Jan 1;19(1):122-34 – reference: 11130711 - Nature. 2000 Dec 14;408(6814):796-815 – reference: 8898194 - Cell. 1996 Nov 1;87(3):405-13 – reference: 18689821 - Bioinformatics. 2008 Aug 15;24(16):i174-80 – reference: 20436464 - Nat Biotechnol. 2010 May;28(5):511-5 – reference: 19289445 - Bioinformatics. 2009 May 1;25(9):1105-11 – reference: 18451266 - Science. 2008 Jun 6;320(5881):1344-9 – reference: 19015660 - Nat Rev Genet. 2009 Jan;10(1):57-63 – reference: 16381912 - Nucleic Acids Res. 2006 Jan 1;34(Database issue):D46-55 – reference: 20011106 - PLoS Comput Biol. 2009 Dec;5(12):e1000598 – reference: 20236510 - Genome Biol. 2010;11(3):R34 – reference: 11779464 - Cell. 2001 Dec 28;107(7):881-91 – reference: 19261174 - Genome Biol. 2009;10(3):R25 – reference: 20410051 - Bioinformatics. 2010 Jun 15;26(12):1500-5 – reference: 19451168 - Bioinformatics. 2009 Jul 15;25(14):1754-60 – reference: 8898193 - Cell. 1996 Nov 1;87(3):391-404 – reference: 19239890 - Cell. 2009 Feb 20;136(4):701-18 – reference: 20519504 - J Biol Chem. 2010 Aug 13;285(33):25426-37 – reference: 14681350 - Nucleic Acids Res. 2004 Jan 1;32(Database issue):D23-6 – reference: 15173120 - Genome Res. 2004 Jun;14(6):1188-90  | 
    
| SSID | ssj0053866 | 
    
| Score | 2.2175462 | 
    
| Snippet | High-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression.... Background High-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene... Background High-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene...  | 
    
| SourceID | plos doaj unpaywall pubmedcentral proquest gale pubmed crossref  | 
    
| SourceType | Open Website Open Access Repository Aggregation Database Index Database Enrichment Source  | 
    
| StartPage | e13875 | 
    
| SubjectTerms | Acids Algorithms Alternative splicing Animals Arabidopsis - genetics Arabidopsis thaliana Biochemistry Biophysics Computational Biology - methods Computational Biology/Alternative Splicing Computational Biology/Genomics Datasets Erythrocytes Gene expression Gene Expression Profiling Gene mapping Gene sequencing Genes Genomes Genomics Humans Isoforms Markov analysis Medical research Molecular Biology/Bioinformatics Molecular Biology/RNA Splicing Next-generation sequencing Organisms Plasmodium falciparum Plasmodium falciparum - genetics Proteins Reproducibility of Results Ribonucleic acid RNA RNA sequencing RNA Splice Sites - genetics RNA Splicing - genetics Splice junctions Statistical analysis Studies  | 
    
| SummonAdditionalLinks | – databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3db9MwELdQX-AFMb4WGGAhJOAhXRLHTsxbQUwFqUNaGdqbFTs2DEVJWVqh_ffcOWnUiKHtgbfI_sVV7s535_o-CHllHDjBYNfCIrMsTEvOQ5kXZShdAtYyLcrcYYLz4ljMT9PPZ_xsp9UXxoR15YE7wh3qpORgJuMUO3ULIaVxCWcan2DQaNS-US63h6lOB8MuFqJPlGNZfNjzZbpqajv1l3MYV7hjiHy9_kErT1ZV017lcv4dOXl7U6-Ky99FVe2YpaN75G7vT9JZ9x175Jat75O9fse29E1fVvrtA1LNF4sl3lbbi3e0oOumqSh4rNT6IhLwS7SoS9piQDuqQIr5uhjfeUkbR_Gvt9oDahiraOsXoj_BKnrBpec1PTmehUv7i2LQ6UNyevTx64d52PdaCE2WsDWo4linwsDhgQupY6ex1B2TptAuYiYqJXgWBZOpcSU3NnImMYkElBQ2siJx7BGZ1EDdfUK5tSyxWurSwfGDcy3hIRc511HqYiYCwraEV6YvRI79MCrlb9cyOJB0tFPILtWzKyDh8NaqK8RxDf498nTAYhltPwDCpXrhUtcJV0BeoESoLid1UAZqlmZwsgdVxwLy0iOwlEaNsTrfi03bqk9fvt0AtDwZgV73INcAOUzR50fAN2GJrhHyYIQEhWBG0_sov1uqtApcOs6wL1kEb25l-uppOkzjohh_V9tm06pMygxRyb8huQBmg_cJkMfdJhlon8RRhi3SApKNts-IOeOZ-vyHr3WOid-CwZrTYaPdiP1P_gf7n5I7PljE3yIckMn6YmOfgQ-61s-9uvkDyXeC_w priority: 102 providerName: Directory of Open Access Journals – databaseName: ProQuest Central dbid: BENPR link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3db9MwELdG9wAviPG1wgALIQEP2dI4cWokhDrWqUxqQe2G9hY5jj0qRUnXtEL777lznbCIAXuL7F9c5c734fo-CHmjDDjBYNc8GWvmhVkUeaIvM0-YAKxlKLO-wQTn8YSPzsKT8-h8i0zqXBgMq6x1olXUWanwP_IDsEsRw-ZK_qfFpYddo_B2tW6hIV1rheyjLTF2h2wHWBmrQ7YPh5Nv01o3g3Rz7hLoWNw7cPzaX5SF3reXdhhveM1A2Tr-jbbuLPKyuskV_TOi8u66WMirnzLPr5mr4wfkvvMz6WCzMXbIli4ekh0nyRV958pNv39E8tF4PMNbbL38QAf0tCxzCp4sHdriEvBLVBYZnWGgO6pGejSvFMZ9XtHSUOyaXVjABMZyulmInoC1tBuazgs6nQy8mb6kR3IlH5Oz4-Hp55HnejB4Kg7YClR0Lw25gkNFxEXaMymWwGNCydT4TPmZAI9DMhEqk0VK-0YFKhCAElz7mgeGPSGdAqi7S2ikNQt0KtLMwLEkilIBD33ej1I_ND3Gu4TVhE-UK1COfTLyxN66xXBQ2dAuQXYljl1d4jVvLTYFOv6DP0SeNlgsr20HyuVF4qQ1SYMsAt-sF2J7eM6FUCaIWIpPMKjSLnmFOyLZ5Ko2SiIZhDGc-EEFsi55bRFYYqPAGJ4Lua6q5MvX77cAzaYt0FsHMiWQQ0mXNwHfhKW7Wsi9FhIUhWpN7-L-ralSJb9FCt6s9_TN07SZxkUxLq_Q5bpKYiFiRAV_h_Q5MBu8UoA83QhJQ3uQ0Bhbp3VJ3BKfFnPaM8X8h62BjgnhnMGa-42g3Yr9z_79pc_JPRseYu8N9khntVzrF-B1rtKXTpX8AqAAgXQ priority: 102 providerName: ProQuest – databaseName: Unpaywall dbid: UNPAY link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3Nb9MwFLdGd4ALML5WGGAhxMchXRLHTs2tsE1lUgtaN7QdUBQ7MUxEyVhaoXHgb-c9x40IbGIcuFn2z278bL_33PdhQp5qA0owyDUvjXPmRRnnnhymmSdNCNIySrOhwQDnyVSMD6LdQ364Qj4uY2EcBeGOWFS1teRjoSrzTUfJTcxX1FhPBwGLg2WPwQmABtbwFvNnNuMQ_jM2xwCkK2RVcFDVe2T1YPp-dNRYmkNPhD5z4XQXjdQRVzarf8u7e_hl5ymmf_pXXl2UJ-nZt7QofhFeOzfIj-W0G5-VL4PFXA30998yQv43utwk153aS0fNKGtkJS9vkTXHWGr6wmW_fnmbFOPJZIZG9fz0FR3R_aoqKCjWdNvmuoCp0rTM6Az97pFT063jWqMb6hmtDMVHvEsLmEJdQZuB6C4Ib3u-6HFJ96Yjb5Z_pVvpPL1DDna299-MPfckhKfjkM1BYgQqEhruOFxIFRiFGfmY1KkyPtN-JkEBSpmMtMm4zn2jQx1KQEmR-7kIDbtLeiUQZJ1QnucszJVUmYFbEudKQmEohlz5kQmY6BO2XPlEu3zp-GxHkVgjYAz3poZ2CVI4cRTuE6_tddLkC_kL_jVuqhaL2b5tBSxx4pY2UWHGQVUMInytXggptQk5U1iCSq365DFuyaQJnW15VjKKYjaEX_FZnzyxCMz4UaJL0ad0UdfJ23cfLgGa7XVAzx3IVEAOnbowDpgT7sAOcqODBL6lO83ruIWXVKkT0Dw5w-fTfOi5PFTnN9O2GQdFN8EyrxZ1EksZIyq8GDIUsNigJAPkXnNKW9qHgR_jS259EnfOb2dxui3l8Webkh3j0wWDMQftSb_U8t__1w4PyDXrv2INGxukNz9d5A9BLZ6rR465_QTVHrqs priority: 102 providerName: Unpaywall  | 
    
| Title | HMMSplicer: A Tool for Efficient and Sensitive Discovery of Known and Novel Splice Junctions in RNA-Seq Data | 
    
| URI | https://www.ncbi.nlm.nih.gov/pubmed/21079731 https://www.proquest.com/docview/1295354290 https://www.proquest.com/docview/799795352 https://www.proquest.com/docview/864950912 https://pubmed.ncbi.nlm.nih.gov/PMC2975632 https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0013875&type=printable https://doaj.org/article/b2d50651417146699cf253b6699514cb http://dx.doi.org/10.1371/journal.pone.0013875  | 
    
| UnpaywallVersion | publishedVersion | 
    
| Volume | 5 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVFSB databaseName: Free Full-Text Journals in Chemistry customDbUrl: eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: HH5 dateStart: 20060101 isFulltext: true titleUrlDefault: http://abc-chemistry.org/ providerName: ABC ChemistRy – providerCode: PRVAFT databaseName: Open Access Digital Library customDbUrl: eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: KQ8 dateStart: 20060101 isFulltext: true titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html providerName: Colorado Alliance of Research Libraries – providerCode: PRVAFT databaseName: Open Access Digital Library customDbUrl: eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: KQ8 dateStart: 20061001 isFulltext: true titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html providerName: Colorado Alliance of Research Libraries – providerCode: PRVAON databaseName: DOAJ Open Access Full Text customDbUrl: eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: DOA dateStart: 20060101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVEBS databaseName: EBSCO Food Science Source customDbUrl: eissn: 1932-6203 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: A8Z dateStart: 20080101 isFulltext: true titleUrlDefault: https://search.ebscohost.com/login.aspx?authtype=ip,uid&profile=ehost&defaultdb=fsr providerName: EBSCOhost – providerCode: PRVEBS databaseName: EBSCOhost Academic Search Ultimate customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: ABDBF dateStart: 20080101 isFulltext: true titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn providerName: EBSCOhost – providerCode: PRVBFR databaseName: Free Medical Journals customDbUrl: eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: DIK dateStart: 20060101 isFulltext: true titleUrlDefault: http://www.freemedicaljournals.com providerName: Flying Publisher – providerCode: PRVFQY databaseName: GFMER Free Medical Journals customDbUrl: eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: GX1 dateStart: 20060101 isFulltext: true titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php providerName: Geneva Foundation for Medical Education and Research – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: M~E dateStart: 20060101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre – providerCode: PRVAQN databaseName: PubMed Central customDbUrl: eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: RPM dateStart: 20060101 isFulltext: true titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/ providerName: National Library of Medicine – providerCode: PRVPQU databaseName: Health & Medical Collection (Proquest) customDbUrl: eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: 7X7 dateStart: 20061201 isFulltext: true titleUrlDefault: https://search.proquest.com/healthcomplete providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central customDbUrl: http://www.proquest.com/pqcentral?accountid=15518 eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: BENPR dateStart: 20061201 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Technology Collection customDbUrl: eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: 8FG dateStart: 20061201 isFulltext: true titleUrlDefault: https://search.proquest.com/technologycollection1 providerName: ProQuest – providerCode: PRVPQU databaseName: Public Health Database (ProQuest) customDbUrl: eissn: 1932-6203 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: 8C1 dateStart: 20061201 isFulltext: true titleUrlDefault: https://search.proquest.com/publichealth providerName: ProQuest – providerCode: PRVFZP databaseName: Scholars Portal Journals: Open Access customDbUrl: eissn: 1932-6203 dateEnd: 20250930 omitProxy: true ssIdentifier: ssj0053866 issn: 1932-6203 databaseCode: M48 dateStart: 20061201 isFulltext: true titleUrlDefault: http://journals.scholarsportal.info providerName: Scholars Portal  | 
    
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3db9MwELe27gFeEONrhVEshAQ8pErj2KmREOq2ljKpZVpXNJ6ixLHHpCjpmlbQ_547N42I6MTESxTZPzvqne_D9fmOkDfKgBMMds2JAs0cP-Hckd0ocaTxwFr6UdI1eMF5NBbDqX96yS93yKZma0nAYuvWDutJTedp-9fN6hMI_EdbtSHobAa1Z3mm2_boLeC7ZA9slcRiDiO_OlcA6RaivEB320ibHtgNsKRTzVbZlP6V4m7M0rzY5pX-HVx5b5nNotXPKE3_sFyDh-RB6XLS3nqN7JMdnT0i-6VQF_RdmXn6_WOSDkejCR5o6_kH2qMXeZ5ScGpp3-aZgC_RKEvoBGPeUUvSk-tCYQjoiuaGYgHtzALG0JbS9UT0FAynXdv0OqPn454z0Tf0JFpET8h00L84HjplOQZHBR5bgLbuxL5QsL_gQsYdE2M2PCZVFBuXKTeR4HxETPrKJFxp1yhPeRJQUmhXC8-wp6SRAaEPCOVaM0_HMk4M7FA4jyW8dEWXx65vOkw0CdsQPlRlrnIsmZGG9gAugD3LmnYhci4sOdckTjVqts7V8Q_8EfK0wmKmbduQz6_CUnDD2Es4uGkdHyvFCyGlMh5nMb5Bo4qb5BWuiHB9bbXSF2HPD2DzD9qQNclri8BsGxmG81xFy6IIv3z9dgfQ5LwGeluCTA7kUFF5hQJ-E2bxqiEPa0jQGarWfYDrd0OVIgSvjzMsXebCyM2a3t5Nq26cFEP0Mp0vizCQMkCUdzukK4DZ4KAC5NlaSCrab0SuSYKa-NSYU-_Jrn_YdOh4N1wwmLNdCdqd2P_8vz_1gty3QST2dOGQNBbzpX4JvukibpHd4DKAZ_e4g8_B5xbZO-qPz85b9t-ellVH0DYdn_W-_wZrVJTt | 
    
| linkProvider | Scholars Portal | 
    
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3db9MwELdGeRgviPG1wmAWAgEP6dI4cWokhArd1G5rkdYN9S0kjj0qRUnXtJr6T_E3cuekYRED9rK3KP7FUe7Od-f4Pgh5LTU4wWDXrNBXzHJjz7NEJ4wtoR2wlm4YdzQmOA9HvH_mHk68yQb5uc6FwbDKtU40ijrOJP4j3wO75DFsrmR_ml1Y2DUKT1fXLTQKsThSq0vYsuUfBz3g7xvHOdg__dK3yq4ClvQdtgCl045cLsFN9riI2jrCom5MyDDSNpN2LMCGhky4UseeVLaWjnQEoARXtuKOZjDvHXLXZaBLYP34k2qDB7qD8zI9j_ntvVIaWrMsVS1zJIjRjFfMn-kSUNmCxizJ8usc3T_jNTeX6SxcXYZJcsUYHjwg90svlnYLsdsiGyp9SLZKPZHTd2Ux6_ePSNIfDsd4Rq7mH2iXnmZZQsFPpvumdAW8iYZpTMcYRo-Kl_amucSo0hXNNMWe3KkBjOBeQouJ6CHYYrNc6DSlJ6OuNVYXtBcuwsfk7FZ48YQ0UqDuNqGeUsxRkYhiDZsez4sEXHR4x4tsV7cZbxK2Jnwgy_Ln2IUjCcyZng_boIJ2AbIrKNnVJFb11Kwo__Ef_GfkaYXF4t3mRjY_D0pdEERO7IHn13ax-TznQkjteCzCK7gpoybZRYkIikzYSgUFXddnHXiLzZrklUFgAY8UI4TOw2WeB4Ov324AGp_UQG9LkM6AHDIsszLgm7AwWA25U0OCGpK14W2U3zVV8uD3goUn1zJ9_TCthnFSjPpLVbbMA18IH1HO3yEdDswGnxcgT4tFUtHeads-NmZrEr-2fGrMqY-k0x-mwjqmm3MGc7aqhXYj9j_795fuks3-6fA4OB6Mjp6TeyYQxZxQ7JDGYr5UL8C_XUQvjVKh5Ptta7FfBGS33w | 
    
| linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3db9MwELdGkYAXxPhaYTALgYCHtGmcODUSQoWuWjdW0LqhvoXEsUelKOmaVlP_Nf467pw0LGLAXvYWxb84yt357hzfByEvpQYnGOyaFfqKWW7seZbohrEltAPW0g3jrsYE58MR3ztx9yfeZIP8XOfCYFjlWicaRR1nEv-Rt8EueQybK9ltXYZFfO0PPszOLOwghSet63YahYgcqNU5bN_y98M-8PqV4wx2jz_tWWWHAUv6DluAAupELpfgMntcRB0dYYE3JmQYaZtJOxZgT0MmXKljTypbS0c6AlCCK1txRzOY9wa56TMmMJzQn1SbPdAjnJepeszvtEvJaM2yVLXM8SBGNl4whaZjQGUXGrMkyy9zev-M3by9TGfh6jxMkguGcXCP3C09WtorRHCTbKj0PtksdUZO35SFrd8-IMne4eEYz8vV_B3t0eMsSyj4zHTXlLGAN9EwjekYQ-pRCdP-NJcYYbqimabYnzs1gBHcS2gxEd0Hu2yWDp2m9GjUs8bqjPbDRfiQnFwLLx6RRgrU3SLUU4o5KhJRrGED5HmRgIsu73qR7eoO403C1oQPZFkKHTtyJIE53_NhS1TQLkB2BSW7msSqnpoVpUD-g_-IPK2wWMjb3Mjmp0GpF4LIiT3wAjsuNqLnXAipHY9FeAU3ZdQkOygRQZEVW6mjoOf6rAtvsVmTvDAILOaR4rI4DZd5Hgy_fLsCaHxUA70uQToDcsiwzNCAb8IiYTXkdg0JKknWhrdQftdUyYPfixeeXMv05cO0GsZJMQIwVdkyD3whfEQ5f4d0OTAb_F-APC4WSUV7p2P72KStSfza8qkxpz6STn-YauuYes4ZzNmqFtqV2P_k31-6Q26B_go-D0cHT8kdE5NiDiu2SWMxX6pn4OououdGp1Dy_bqV2C8Mhbwi | 
    
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3Nb9MwFLdGd4ALML5WGGAhxMchXRLHTs2tsE1lUgtaN7QdUBQ7MUxEyVhaoXHgb-c9x40IbGIcuFn2z278bL_33PdhQp5qA0owyDUvjXPmRRnnnhymmSdNCNIySrOhwQDnyVSMD6LdQ364Qj4uY2EcBeGOWFS1teRjoSrzTUfJTcxX1FhPBwGLg2WPwQmABtbwFvNnNuMQ_jM2xwCkK2RVcFDVe2T1YPp-dNRYmkNPhD5z4XQXjdQRVzarf8u7e_hl5ymmf_pXXl2UJ-nZt7QofhFeOzfIj-W0G5-VL4PFXA30998yQv43utwk153aS0fNKGtkJS9vkTXHWGr6wmW_fnmbFOPJZIZG9fz0FR3R_aoqKCjWdNvmuoCp0rTM6Az97pFT063jWqMb6hmtDMVHvEsLmEJdQZuB6C4Ib3u-6HFJ96Yjb5Z_pVvpPL1DDna299-MPfckhKfjkM1BYgQqEhruOFxIFRiFGfmY1KkyPtN-JkEBSpmMtMm4zn2jQx1KQEmR-7kIDbtLeiUQZJ1QnucszJVUmYFbEudKQmEohlz5kQmY6BO2XPlEu3zp-GxHkVgjYAz3poZ2CVI4cRTuE6_tddLkC_kL_jVuqhaL2b5tBSxx4pY2UWHGQVUMInytXggptQk5U1iCSq365DFuyaQJnW15VjKKYjaEX_FZnzyxCMz4UaJL0ad0UdfJ23cfLgGa7XVAzx3IVEAOnbowDpgT7sAOcqODBL6lO83ruIWXVKkT0Dw5w-fTfOi5PFTnN9O2GQdFN8EyrxZ1EksZIyq8GDIUsNigJAPkXnNKW9qHgR_jS259EnfOb2dxui3l8Webkh3j0wWDMQftSb_U8t__1w4PyDXrv2INGxukNz9d5A9BLZ6rR465_QTVHrqs | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=HMMSplicer%3A+A+Tool+for+Efficient+and+Sensitive+Discovery+of+Known+and+Novel+Splice+Junctions+in+RNA-Seq+Data&rft.jtitle=PloS+one&rft.au=Dimon%2C+Michelle+T.&rft.au=Sorber%2C+Katherine&rft.au=DeRisi%2C+Joseph+L.&rft.date=2010-11-08&rft.pub=Public+Library+of+Science&rft.eissn=1932-6203&rft.volume=5&rft.issue=11&rft_id=info:doi/10.1371%2Fjournal.pone.0013875&rft_id=info%3Apmid%2F21079731&rft.externalDocID=PMC2975632 | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1932-6203&client=summon | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1932-6203&client=summon | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1932-6203&client=summon |