Discovery of numerous novel small genes in the intergenic regions of the Escherichia coli O157:H7 Sakai genome

In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Transcriptome sequencing (RNAseq) signals outside of annotated genes have usually been interpreted to indicate either ncRNA or pervasive transcription. Therefore, in addition to the transcriptome, the tran...

Full description

Saved in:
Bibliographic Details
Published inPloS one Vol. 12; no. 9; p. e0184119
Main Authors Hücker, Sarah M., Ardern, Zachary, Goldberg, Tatyana, Schafferhans, Andrea, Bernhofer, Michael, Vestergaard, Gisle, Nelson, Chase W., Schloter, Michael, Rost, Burkhard, Scherer, Siegfried, Neuhaus, Klaus
Format Journal Article
LanguageEnglish
Published United States Public Library of Science 13.09.2017
Public Library of Science (PLoS)
Subjects
Online AccessGet full text
ISSN1932-6203
1932-6203
DOI10.1371/journal.pone.0184119

Cover

Abstract In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Transcriptome sequencing (RNAseq) signals outside of annotated genes have usually been interpreted to indicate either ncRNA or pervasive transcription. Therefore, in addition to the transcriptome, the translatome (RIBOseq) of the enteric pathogen Escherichia coli O157:H7 strain Sakai was determined at two optimal growth conditions and a severe stress condition combining low temperature and high osmotic pressure. All intergenic open reading frames potentially encoding a protein of ≥ 30 amino acids were investigated with regard to coverage by transcription and translation signals and their translatability expressed by the ribosomal coverage value. This led to discovery of 465 unique, putative novel genes not yet annotated in this E. coli strain, which are evenly distributed over both DNA strands of the genome. For 255 of the novel genes, annotated homologs in other bacteria were found, and a machine-learning algorithm, trained on small protein-coding E. coli genes, predicted that 89% of these translated open reading frames represent bona fide genes. The remaining 210 putative novel genes without annotated homologs were compared to the 255 novel genes with homologs and to 250 short annotated genes of this E. coli strain. All three groups turned out to be similar with respect to their translatability distribution, fractions of differentially regulated genes, secondary structure composition, and the distribution of evolutionary constraint, suggesting that both novel groups represent legitimate genes. However, the machine-learning algorithm only recognized a small fraction of the 210 genes without annotated homologs. It is possible that these genes represent a novel group of genes, which have unusual features dissimilar to the genes of the machine-learning algorithm training set.
AbstractList In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Transcriptome sequencing (RNAseq) signals outside of annotated genes have usually been interpreted to indicate either ncRNA or pervasive transcription. Therefore, in addition to the transcriptome, the translatome (RIBOseq) of the enteric pathogen Escherichia coli O157:H7 strain Sakai was determined at two optimal growth conditions and a severe stress condition combining low temperature and high osmotic pressure. All intergenic open reading frames potentially encoding a protein of ≥ 30 amino acids were investigated with regard to coverage by transcription and translation signals and their translatability expressed by the ribosomal coverage value. This led to discovery of 465 unique, putative novel genes not yet annotated in this E. coli strain, which are evenly distributed over both DNA strands of the genome. For 255 of the novel genes, annotated homologs in other bacteria were found, and a machine-learning algorithm, trained on small protein-coding E. coli genes, predicted that 89% of these translated open reading frames represent bona fide genes. The remaining 210 putative novel genes without annotated homologs were compared to the 255 novel genes with homologs and to 250 short annotated genes of this E. coli strain. All three groups turned out to be similar with respect to their translatability distribution, fractions of differentially regulated genes, secondary structure composition, and the distribution of evolutionary constraint, suggesting that both novel groups represent legitimate genes. However, the machine-learning algorithm only recognized a small fraction of the 210 genes without annotated homologs. It is possible that these genes represent a novel group of genes, which have unusual features dissimilar to the genes of the machine-learning algorithm training set.
In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Transcriptome sequencing (RNAseq) signals outside of annotated genes have usually been interpreted to indicate either ncRNA or pervasive transcription. Therefore, in addition to the transcriptome, the translatome (RIBOseq) of the enteric pathogen Escherichia coli O157:H7 strain Sakai was determined at two optimal growth conditions and a severe stress condition combining low temperature and high osmotic pressure. All intergenic open reading frames potentially encoding a protein of ≥ 30 amino acids were investigated with regard to coverage by transcription and translation signals and their translatability expressed by the ribosomal coverage value. This led to discovery of 465 unique, putative novel genes not yet annotated in this E. coli strain, which are evenly distributed over both DNA strands of the genome. For 255 of the novel genes, annotated homologs in other bacteria were found, and a machine-learning algorithm, trained on small protein-coding E. coli genes, predicted that 89% of these translated open reading frames represent bona fide genes. The remaining 210 putative novel genes without annotated homologs were compared to the 255 novel genes with homologs and to 250 short annotated genes of this E. coli strain. All three groups turned out to be similar with respect to their translatability distribution, fractions of differentially regulated genes, secondary structure composition, and the distribution of evolutionary constraint, suggesting that both novel groups represent legitimate genes. However, the machine-learning algorithm only recognized a small fraction of the 210 genes without annotated homologs. It is possible that these genes represent a novel group of genes, which have unusual features dissimilar to the genes of the machine-learning algorithm training set.In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Transcriptome sequencing (RNAseq) signals outside of annotated genes have usually been interpreted to indicate either ncRNA or pervasive transcription. Therefore, in addition to the transcriptome, the translatome (RIBOseq) of the enteric pathogen Escherichia coli O157:H7 strain Sakai was determined at two optimal growth conditions and a severe stress condition combining low temperature and high osmotic pressure. All intergenic open reading frames potentially encoding a protein of ≥ 30 amino acids were investigated with regard to coverage by transcription and translation signals and their translatability expressed by the ribosomal coverage value. This led to discovery of 465 unique, putative novel genes not yet annotated in this E. coli strain, which are evenly distributed over both DNA strands of the genome. For 255 of the novel genes, annotated homologs in other bacteria were found, and a machine-learning algorithm, trained on small protein-coding E. coli genes, predicted that 89% of these translated open reading frames represent bona fide genes. The remaining 210 putative novel genes without annotated homologs were compared to the 255 novel genes with homologs and to 250 short annotated genes of this E. coli strain. All three groups turned out to be similar with respect to their translatability distribution, fractions of differentially regulated genes, secondary structure composition, and the distribution of evolutionary constraint, suggesting that both novel groups represent legitimate genes. However, the machine-learning algorithm only recognized a small fraction of the 210 genes without annotated homologs. It is possible that these genes represent a novel group of genes, which have unusual features dissimilar to the genes of the machine-learning algorithm training set.
Author Hücker, Sarah M.
Rost, Burkhard
Goldberg, Tatyana
Bernhofer, Michael
Ardern, Zachary
Nelson, Chase W.
Vestergaard, Gisle
Neuhaus, Klaus
Scherer, Siegfried
Schafferhans, Andrea
Schloter, Michael
AuthorAffiliation 2 ZIEL - Institute for Food & Health, Technische Universität München, Freising, Germany
3 Department of Informatics—Bioinformatics & TUM-IAS, Technische Universität München, Garching, Germany
5 Sackler Institute for Comparative Genomics, American Museum of Natural History New York, New York, United States of America
6 Core Facility Microbiome/NGS, ZIEL - Institute for Food & Health, Technische Universität München, Freising, Germany
4 Research Unit Environmental Genomics, Helmholtz Zentrum München, Neuherberg, Germany
Centre for Research and Technology-Hellas, GREECE
1 Chair for Microbial Ecology, Technische Universität München, Freising, Germany
AuthorAffiliation_xml – name: 1 Chair for Microbial Ecology, Technische Universität München, Freising, Germany
– name: 6 Core Facility Microbiome/NGS, ZIEL - Institute for Food & Health, Technische Universität München, Freising, Germany
– name: Centre for Research and Technology-Hellas, GREECE
– name: 4 Research Unit Environmental Genomics, Helmholtz Zentrum München, Neuherberg, Germany
– name: 3 Department of Informatics—Bioinformatics & TUM-IAS, Technische Universität München, Garching, Germany
– name: 2 ZIEL - Institute for Food & Health, Technische Universität München, Freising, Germany
– name: 5 Sackler Institute for Comparative Genomics, American Museum of Natural History New York, New York, United States of America
Author_xml – sequence: 1
  givenname: Sarah M.
  surname: Hücker
  fullname: Hücker, Sarah M.
– sequence: 2
  givenname: Zachary
  surname: Ardern
  fullname: Ardern, Zachary
– sequence: 3
  givenname: Tatyana
  surname: Goldberg
  fullname: Goldberg, Tatyana
– sequence: 4
  givenname: Andrea
  surname: Schafferhans
  fullname: Schafferhans, Andrea
– sequence: 5
  givenname: Michael
  surname: Bernhofer
  fullname: Bernhofer, Michael
– sequence: 6
  givenname: Gisle
  surname: Vestergaard
  fullname: Vestergaard, Gisle
– sequence: 7
  givenname: Chase W.
  surname: Nelson
  fullname: Nelson, Chase W.
– sequence: 8
  givenname: Michael
  surname: Schloter
  fullname: Schloter, Michael
– sequence: 9
  givenname: Burkhard
  surname: Rost
  fullname: Rost, Burkhard
– sequence: 10
  givenname: Siegfried
  surname: Scherer
  fullname: Scherer, Siegfried
– sequence: 11
  givenname: Klaus
  orcidid: 0000-0002-6020-2814
  surname: Neuhaus
  fullname: Neuhaus, Klaus
BackLink https://www.ncbi.nlm.nih.gov/pubmed/28902868$$D View this record in MEDLINE/PubMed
BookMark eNqNUk1v1DAQjVAR_YB_gMASFy672E7sOD0goVJopUo90LvlOONdL4692EnR_nucblq1FUKcxpp58-bNPB8XBz54KIq3BC9JWZNPmzBGr9xym9NLTERFSPOiOCJNSRec4vLg0fuwOE5pgzErBeevikMqGkwFF0eF_2qTDrcQdygY5MceYhgT8jnlUOqVc2gFHhKyHg1ryGGAmDNWowgrG3ya-qbKedJriFavrUI6OIuuCatPL2r0Q_1UdmIJPbwuXhrlEryZ40lx8-385uxicXX9_fLsy9VCM8qHBSetVmAablrdEUxZq7guAQNrWmMob0vBiGG6JhxqA7jlmhnamEYoxStTnhTv97RbF5KcL5VkvodgtCaUZsTlHtEFtZHbaHsVdzIoK-8SIa6kioPVDmQHohPAidBtU3VUN8Cg4nl2VxphujJzsT3X6Ldq9zvf7IGQYDmZdS9BTmbJ2azc93lWObY9dBr8EJV7IuZpxdu1XIVbyVhTUywywceZIIZfI6RB9tlNcE55yC7e7SuYwGKa9eEZ9O9XefdY0YOU-_-SAdUeoGNIKYL5301Pn7VpO6ghf5-8l3X_bv4DuObtjA
CitedBy_id crossref_primary_10_1093_bioinformatics_btz714
crossref_primary_10_3389_fmicb_2018_00931
crossref_primary_10_1080_07391102_2020_1808531
crossref_primary_10_1074_jbc_RA118_001996
crossref_primary_10_1080_10409238_2021_2006599
crossref_primary_10_1093_femsre_fuad063
crossref_primary_10_1128_mBio_02819_18
crossref_primary_10_3389_fmicb_2020_00377
crossref_primary_10_15252_msb_20188290
crossref_primary_10_1038_s41598_018_35756_y
crossref_primary_10_1016_j_molcel_2025_01_025
crossref_primary_10_1186_s12866_023_02988_6
crossref_primary_10_1002_pmic_202200421
crossref_primary_10_1128_spectrum_02351_22
crossref_primary_10_1093_molbev_msab329
crossref_primary_10_1016_j_isci_2022_103844
crossref_primary_10_1093_femsml_uqac005
crossref_primary_10_3389_fmolb_2020_00187
crossref_primary_10_1093_dnares_dsab007
crossref_primary_10_1074_jbc_RA119_012161
crossref_primary_10_1186_s12862_018_1134_0
Cites_doi 10.1186/1471-2105-11-131
10.1093/bioinformatics/btq093
10.1002/prot.340190108
10.1186/1471-2164-15-353
10.1111/j.1751-7915.2011.00261.x
10.1093/dnares/8.1.11
10.7554/eLife.09977
10.1261/rna.053561.115
10.1073/pnas.1508380112
10.1186/1471-2105-10-421
10.1128/IAI.71.3.1505-1512.2003
10.1590/S1984-29612013000200052
10.1093/bioinformatics/16.10.944
10.1186/1471-2164-9-75
10.1073/pnas.1113521108
10.1128/genomeA.00821-14
10.1073/pnas.0905029106
10.1093/bioinformatics/btq010
10.1128/mBio.00024-10
10.1038/nmeth.1226
10.1371/journal.pone.0023191
10.1073/pnas.92.19.8793
10.1002/prot.25155
10.7554/eLife.03523
10.1186/s12866-016-0896-z
10.1186/1471-2164-12-332
10.1093/gbe/evt028
10.1126/science.aad9822
10.1093/nar/gkw343
10.1038/nrmicro787
10.1093/nar/gkw822
10.1146/annurev-biochem-070611-102400
10.1016/S0022-2836(05)80360-2
10.1534/g3.116.036939
10.1093/bioinformatics/btm349
10.1186/1471-2164-14-620
10.1038/nrm3359
10.1093/nar/gks1219
10.1186/1477-5956-11-47
10.1186/s13062-015-0104-3
10.1093/femsle/fnw262
10.1016/j.cell.2008.05.042
10.1016/j.mib.2013.01.009
10.1126/science.1142490
10.1371/journal.pgen.1000569
10.1186/1471-2164-12-361
10.1038/srep41425
10.1016/j.celrep.2014.05.023
10.1002/embj.201488411
10.7554/eLife.08890
10.1186/1471-2199-8-100
10.1016/j.fm.2013.03.002
10.4014/jmb.0908.08007
10.1038/nmeth.3688
10.1016/S0168-9525(00)02024-2
10.1038/nrneph.2012.196
10.1186/s12864-016-2456-1
10.7554/eLife.03528
10.1016/S1672-0229(10)60008-3
10.1515/hsz-2014-0213
10.1093/bioinformatics/btt184
10.1186/s13104-016-1900-2
10.1073/pnas.1613055113
10.1093/bioinformatics/btq243
10.1101/gr.092759.109
10.1128/JB.184.20.5733-5745.2002
10.1093/gbe/evr099
10.1128/IAI.02928-14
10.1038/nchembio.1120
10.1021/acs.jproteome.6b00569
10.1093/nar/gku396
10.1093/nar/gkl266
10.1101/gr.4759706
10.1093/bioinformatics/btm009
10.1093/bioinformatics/bts611
10.1093/nar/gku671
10.1093/bioinformatics/btp616
10.1038/nature08756
10.1093/nar/gkh377
10.1186/1741-7007-12-4
10.1093/nar/gku366
10.1093/nar/gkw944
10.1186/s12864-017-3586-9
10.1128/JB.01368-13
10.7717/peerj.2584
10.1038/ncomms11605
10.1016/j.celrep.2015.12.073
10.1186/gm264
10.1111/j.1365-2958.2008.06495.x
10.1038/nrmicro3316
10.1126/science.1168978
10.1016/j.celrep.2014.07.045
10.1073/pnas.1309739110
ContentType Journal Article
Copyright 2017 Hücker et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
2017 Hücker et al 2017 Hücker et al
Copyright_xml – notice: 2017 Hücker et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
– notice: 2017 Hücker et al 2017 Hücker et al
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
3V.
7QG
7QL
7QO
7RV
7SN
7SS
7T5
7TG
7TM
7U9
7X2
7X7
7XB
88E
8AO
8C1
8FD
8FE
8FG
8FH
8FI
8FJ
8FK
ABJCF
ABUWG
AEUYN
AFKRA
ARAPS
ATCPS
AZQEC
BBNVY
BENPR
BGLVJ
BHPHI
C1K
CCPQU
D1I
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
H94
HCIFZ
K9.
KB.
KB0
KL.
L6V
LK8
M0K
M0S
M1P
M7N
M7P
M7S
NAPCQ
P5Z
P62
P64
PATMY
PDBOC
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
PTHSS
PYCSY
RC3
7X8
5PM
ADTOC
UNPAY
DOA
DOI 10.1371/journal.pone.0184119
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
ProQuest Central (Corporate)
Animal Behavior Abstracts
Bacteriology Abstracts (Microbiology B)
Biotechnology Research Abstracts
Nursing & Allied Health Database
Ecology Abstracts
Entomology Abstracts (Full archive)
Immunology Abstracts
Meteorological & Geoastrophysical Abstracts
Nucleic Acids Abstracts
Virology and AIDS Abstracts
Agricultural Science Collection
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
ProQuest Pharma Collection
Public Health Database
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Natural Science Journals
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
Materials Science & Engineering Collection
ProQuest Central (Alumni)
ProQuest One Sustainability (subscription)
ProQuest Central UK/Ireland
Advanced Technologies & Computer Science Collection
Agricultural & Environmental Science Collection
ProQuest Central Essentials
Biological Science Collection
ProQuest Central
Technology Collection
Natural Science Collection
Environmental Sciences and Pollution Management
ProQuest One Community College
ProQuest Materials Science Collection
ProQuest Central
Engineering Research Database
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
AIDS and Cancer Research Abstracts
SciTech Premium Collection
ProQuest Health & Medical Complete (Alumni)
Materials Science Database
Nursing & Allied Health Database (Alumni Edition)
Meteorological & Geoastrophysical Abstracts - Academic
ProQuest Engineering Collection
Biological Sciences
Agricultural Science Database
Health & Medical Collection (Alumni Edition)
Medical Database
Algology Mycology and Protozoology Abstracts (Microbiology C)
Biological Science Database
Engineering Database
Nursing & Allied Health Premium
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Biotechnology and BioEngineering Abstracts
Environmental Science Database (subscripiton)
Materials Science Collection
ProQuest Central Premium
ProQuest One Academic
Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
Engineering Collection
Environmental Science Collection
Genetics Abstracts
MEDLINE - Academic
PubMed Central (Full Participant titles)
Unpaywall for CDI: Periodical Content
Unpaywall
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Agricultural Science Database
Publicly Available Content Database
ProQuest Central Student
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
Nucleic Acids Abstracts
SciTech Premium Collection
ProQuest Central China
Environmental Sciences and Pollution Management
ProQuest One Applied & Life Sciences
ProQuest One Sustainability
Health Research Premium Collection
Meteorological & Geoastrophysical Abstracts
Natural Science Collection
Health & Medical Research Collection
Biological Science Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
Engineering Collection
Advanced Technologies & Aerospace Collection
Engineering Database
Virology and AIDS Abstracts
ProQuest Biological Science Collection
ProQuest One Academic Eastern Edition
Agricultural Science Collection
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
Biological Science Database
Ecology Abstracts
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
Environmental Science Collection
Entomology Abstracts
Nursing & Allied Health Premium
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
Environmental Science Database
ProQuest Nursing & Allied Health Source (Alumni)
Engineering Research Database
ProQuest One Academic
Meteorological & Geoastrophysical Abstracts - Academic
ProQuest One Academic (New)
Technology Collection
Technology Research Database
ProQuest One Academic Middle East (New)
Materials Science Collection
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Natural Science Collection
ProQuest Pharma Collection
ProQuest Central
ProQuest Health & Medical Research Collection
Genetics Abstracts
ProQuest Engineering Collection
Biotechnology Research Abstracts
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Bacteriology Abstracts (Microbiology B)
Algology Mycology and Protozoology Abstracts (Microbiology C)
Agricultural & Environmental Science Collection
AIDS and Cancer Research Abstracts
Materials Science Database
ProQuest Materials Science Collection
ProQuest Public Health
ProQuest Nursing & Allied Health Source
ProQuest SciTech Collection
Advanced Technologies & Aerospace Database
ProQuest Medical Library
Animal Behavior Abstracts
Materials Science & Engineering Collection
Immunology Abstracts
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList
MEDLINE - Academic


MEDLINE
Agricultural Science Database
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 4
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
– sequence: 5
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Sciences (General)
Ecology
DocumentTitleAlternate Novel genes in EHEC
EISSN 1932-6203
ExternalDocumentID 1938527122
oai_doaj_org_article_de8d8e618cb94d2c9e5e46c71d3f8fd3
10.1371/journal.pone.0184119
PMC5597208
28902868
10_1371_journal_pone_0184119
Genre Journal Article
GeographicLocations Germany
GeographicLocations_xml – name: Germany
GrantInformation_xml – fundername: ;
  grantid: Gerstner Scholars Fellowship
– fundername: ;
  grantid: SCHE316/3-2
– fundername: ;
  grantid: Alexander von Humboldt foundation
– fundername: ;
  grantid: KE740/13-2
GroupedDBID ---
123
29O
2WC
53G
5VS
7RV
7X2
7X7
7XC
88E
8AO
8C1
8CJ
8FE
8FG
8FH
8FI
8FJ
A8Z
AAFWJ
AAUCC
AAWOE
AAYXX
ABDBF
ABIVO
ABJCF
ABUWG
ACGFO
ACIHN
ACIWK
ACPRK
ACUHS
ADBBV
ADRAZ
AEAQA
AENEX
AEUYN
AFKRA
AFPKN
AFRAH
AHMBA
ALMA_UNASSIGNED_HOLDINGS
AOIJS
APEBS
ARAPS
ATCPS
BAWUL
BBNVY
BCNDV
BENPR
BGLVJ
BHPHI
BKEYQ
BPHCQ
BVXVI
BWKFM
CCPQU
CITATION
CS3
D1I
D1J
D1K
DIK
DU5
E3Z
EAP
EAS
EBD
EMOBN
ESTFP
ESX
EX3
F5P
FPL
FYUFA
GROUPED_DOAJ
GX1
HCIFZ
HH5
HMCUK
HYE
IAO
IEA
IGS
IHR
IHW
INH
INR
IOV
IPY
ISE
ISR
ITC
K6-
KB.
KQ8
L6V
LK5
LK8
M0K
M1P
M48
M7P
M7R
M7S
M~E
NAPCQ
O5R
O5S
OK1
OVT
P2P
P62
PATMY
PDBOC
PHGZM
PHGZT
PIMPY
PJZUB
PPXIY
PQGLB
PQQKQ
PROAC
PSQYO
PTHSS
PUEGO
PV9
PYCSY
RNS
RPM
RZL
SV3
TR2
UKHRP
WOQ
WOW
~02
~KM
ALIPV
BBORY
CGR
CUY
CVF
ECM
EIF
IPNFZ
NPM
RIG
3V.
7QG
7QL
7QO
7SN
7SS
7T5
7TG
7TM
7U9
7XB
8FD
8FK
AZQEC
C1K
DWQXO
FR3
GNUQQ
H94
K9.
KL.
M7N
P64
PKEHL
PQEST
PQUKI
PRINS
RC3
7X8
5PM
ADTOC
UNPAY
-
02
AAPBV
ABPTK
ADACO
BBAFP
KM
ID FETCH-LOGICAL-c526t-61bcaef96fbcd1025ba6c3e0e59bff26b3851f5c716e7fe0b6c5f29f98aa64f3
IEDL.DBID M48
ISSN 1932-6203
IngestDate Fri Nov 26 17:14:35 EST 2021
Tue Oct 14 19:05:45 EDT 2025
Sun Oct 26 03:48:37 EDT 2025
Tue Sep 30 16:54:29 EDT 2025
Fri Sep 05 07:12:25 EDT 2025
Tue Oct 07 07:37:16 EDT 2025
Thu Apr 03 07:02:11 EDT 2025
Wed Oct 01 04:09:28 EDT 2025
Thu Apr 24 23:12:39 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 9
Language English
License This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
cc-by
Creative Commons Attribution License
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c526t-61bcaef96fbcd1025ba6c3e0e59bff26b3851f5c716e7fe0b6c5f29f98aa64f3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Competing Interests: The authors have declared that no competing interests exist.
ORCID 0000-0002-6020-2814
OpenAccessLink http://journals.scholarsportal.info/openUrl.xqy?doi=10.1371/journal.pone.0184119
PMID 28902868
PQID 1938527122
PQPubID 1436336
ParticipantIDs plos_journals_1938527122
doaj_primary_oai_doaj_org_article_de8d8e618cb94d2c9e5e46c71d3f8fd3
unpaywall_primary_10_1371_journal_pone_0184119
pubmedcentral_primary_oai_pubmedcentral_nih_gov_5597208
proquest_miscellaneous_1938858089
proquest_journals_1938527122
pubmed_primary_28902868
crossref_primary_10_1371_journal_pone_0184119
crossref_citationtrail_10_1371_journal_pone_0184119
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2017-09-13
PublicationDateYYYYMMDD 2017-09-13
PublicationDate_xml – month: 09
  year: 2017
  text: 2017-09-13
  day: 13
PublicationDecade 2010
PublicationPlace United States
PublicationPlace_xml – name: United States
– name: San Francisco
– name: San Francisco, CA USA
PublicationTitle PloS one
PublicationTitleAlternate PLoS One
PublicationYear 2017
Publisher Public Library of Science
Public Library of Science (PLoS)
Publisher_xml – name: Public Library of Science
– name: Public Library of Science (PLoS)
References G Yachdav (ref46) 2014; 42
AS Warren (ref16) 2010; 11
MD Robinson (ref44) 2009; 26
DF Browning (ref48) 2004; 2
JY Lim (ref2) 2010; 20
BG Castro (ref6) 2013; 22
J Kans (ref93) 2013
KS Wilson (ref49) 1995; 92
M Brylinski (ref19) 2013; 11
J Ruiz-Orera (ref36) 2014; 3
D Wang (ref87) 2010; 8
Y Jeong (ref57) 2016; 7
CM Sharma (ref23) 2010; 464
F Mohammad (ref60) 2016; 14
I Yomtovian (ref73) 2010; 26
J Carlevaro-Fita (ref56) 2016; 22
L Calviello (ref38) 2016; 13
G Storz (ref15) 2014; 83
S Xue (ref61) 2012; 13
J Marks (ref64) 2016; 113
R Neme (ref25) 2016; 5
Z Ji (ref55) 2015; 4
JY Hwang (ref66) 2017; 45
T Bitard-Feildel (ref21) 2017; 7
BL Flaherty (ref24) 2011; 12
SS Oheigeartaigh (ref75) 2014; 196
T Carver (ref81) 2010; 26
C Camacho (ref89) 2009; 10
NT Ingolia (ref35) 2014; 8
G Kemp (ref18) 2014; 395
YF Lin (ref28) 2013; 14
AA Bazzini (ref34) 2014; 33
E Kopylova (ref84) 2012; 28
N Perdigão (ref20) 2015; 112
A Ceroni (ref96) 2006; 34
A Schlessinger (ref47) 2007; 23
MR Hemm (ref10) 2008; 70
J Baek (ref40) 2017; 7
J Boekhorst (ref14) 2011; 4
H Zur (ref78) 2016; 6
AL Delcher (ref12) 2007; 23
SW Naylor (ref7) 2003; 71
H Latif (ref42) 2014; 2
BA Wilson (ref53) 2011; 3
PB O'Connor (ref59) 2013; 29
T Goldberg (ref97) 2014; 42
JL Aspden (ref54) 2014; 3
M Levitt (ref72) 2009; 106
SB Lewis (ref3) 2015; 83
J Ma (ref51) 2002; 184
DA Drummond (ref71) 2008; 134
D Graur (ref30) 2013; 5
C Quast (ref91) 2013; 41
E Afgan (ref80) 2016; 44
W Zhang (ref98) 2006; 16
M Bernhofer (ref95) 2016; 84
D Dar (ref77) 2016; 352
F Aeschimann (ref32) 2015
J Ma (ref4) 2011; 6
JC Dunning Hotopp (ref43) 2007; 317
M Schubert (ref83) 2016; 9
K Neuhaus (ref39) 2017; 18
D Baumgartner (ref67) 2016; 16
P Rice (ref90) 2000; 16
JE Dornenburg (ref27) 2010; 1
S Banerjee (ref69) 2006; 44
X Zheng (ref70) 2011; 12
PY Hsu (ref37) 2016
Z Hou (ref5) 2013; 35
JT Wade (ref29) 2014; 12
RK Aziz (ref13) 2008; 9
K Byrgazov (ref62) 2013; 16
TV Tatarinova (ref74) 2016; 11
SF Altschul (ref88) 1990; 215
I Lasa (ref26) 2011; 108
A Mortazavi (ref82) 2008; 5
K Neuhaus (ref11) 2016; 17
JE Smith (ref33) 2014; 7
AL Hughes (ref52) 1999
MV Gerashchenko (ref63) 2014; 42
K Rutherford (ref41) 2000; 16
M Krzywinski (ref99) 2009; 19
T Rognes (ref85) 2016; 4
L Cassidy (ref79) 2016; 15
V Vimberg (ref50) 2007; 8
T Junier (ref92) 2010; 26
SA Slavoff (ref17) 2013; 9
R Landstorfer (ref8) 2014; 15
MV Gerashchenko (ref65) 2017; 45
H Trachtman (ref9) 2012; 8
B Rost (ref45) 2004; 32
T Hayashi (ref1) 2001; 8
SM Hücker (ref76) 2017; 364
VV Solovyev (ref86) 2011; 3
NT Ingolia (ref31) 2009; 324
X Liu (ref58) 2013; 110
TT Perkins (ref22) 2009; 5
B Rost (ref94) 1994; 19
BK Cho (ref68) 2014; 12
24705786 - EMBO J. 2014 May 2;33(9):981-93
27791002 - Proc Natl Acad Sci U S A. 2016 Oct 25;113(43):12150-12155
19121005 - Mol Microbiol. 2008 Dec;70(6):1487-501
27856567 - FEMS Microbiol Lett. 2017 Jan;364(2)
20003500 - BMC Bioinformatics. 2009 Dec 15;10:421
18662548 - Cell. 2008 Jul 25;134(2):341-52
19541617 - Proc Natl Acad Sci U S A. 2009 Jul 7;106(27):11079-84
28245801 - BMC Genomics. 2017 Feb 28;18(1):216
25534942 - Infect Immun. 2015 Mar;83(3):942-9
18261238 - BMC Genomics. 2008 Feb 08;9:75
27894276 - BMC Microbiol. 2016 Nov 28;16(1):285
24799431 - Nucleic Acids Res. 2014 Jul;42(Web Server issue):W337-43
17709338 - Bioinformatics. 2007 Sep 15;23(18):2376-84
25056308 - Nucleic Acids Res. 2014;42(17):e134
21711558 - BMC Genomics. 2011 Jun 28;12:332
27781170 - PeerJ. 2016 Oct 18;4:e2584
18516045 - Nat Methods. 2008 Jul;5(7):621-8
27924019 - Nucleic Acids Res. 2017 Jan 9;45(1):327-336
17973990 - BMC Mol Biol. 2007 Oct 31;8:100
27251447 - Nat Commun. 2016 Jun 02;7:11605
23856725 - Rev Bras Parasitol Vet. 2013 Apr-Jun;22(2):318-21
26102273 - Methods. 2015 Sep 1;85:75-89
7568019 - Proc Natl Acad Sci U S A. 1995 Sep 12;92(19):8793-7
26868221 - BMC Res Notes. 2016 Feb 12;9:88
27120414 - Science. 2016 Apr 8;352(6282):aad9822
26898226 - Sci Rep. 2016 Feb 22;6:21635
25144939 - Elife. 2014 Aug 21;3:e03528
23071270 - Bioinformatics. 2012 Dec 15;28(24):3211-7
24606146 - Annu Rev Biochem. 2014;83:753-77
28122954 - G3 (Bethesda). 2017 Mar 10;7(3):983-989
17237039 - Bioinformatics. 2007 Mar 15;23(6):673-9
24461193 - BMC Biol. 2014 Jan 24;12:4
12270832 - J Bacteriol. 2002 Oct;184(20):5733-45
20472542 - Bioinformatics. 2010 Jul 1;26(13):1669-70
21867574 - Genome Med. 2011 Jul 27;3(7):48
25153378 - Biol Chem. 2014 Dec;395(12):1365-77
10827456 - Trends Genet. 2000 Jun;16(6):276-7
22617470 - Nat Rev Mol Cell Biol. 2012 May 23;13(6):355-69
24321360 - Proteome Sci. 2013 Dec 09;11(1):47
27090285 - RNA. 2016 Jun;22(6):867-82
26657557 - Nat Methods. 2016 Feb;13(2):165-70
21948395 - Genome Biol Evol. 2011;3:1245-52
21518296 - Microb Biotechnol. 2011 May;4(3):308-13
20230630 - BMC Bioinformatics. 2010 Mar 15;11:131
8066087 - Proteins. 1994 May;19(1):55-72
23431001 - Genome Biol Evol. 2013;5(3):578-90
2231712 - J Mol Biol. 1990 Oct 5;215(3):403-10
15035009 - Nat Rev Microbiol. 2004 Jan;2(1):57-65
23193283 - Nucleic Acids Res. 2013 Jan;41(Database issue):D590-6
24931603 - Cell Rep. 2014 Jun 26;7(6):1858-66
27566436 - Proteins. 2016 Nov;84(11):1706-1716
23603333 - Bioinformatics. 2013 Jun 15;29(12):1488-91
20689751 - MBio. 2010 May 18;1(1):null
25069631 - Nat Rev Microbiol. 2014 Sep;12 (9):647-53
19541911 - Genome Res. 2009 Sep;19(9):1639-45
17761848 - Science. 2007 Sep 21;317(5845):1753-6
21749696 - BMC Genomics. 2011 Jul 12;12:361
23160002 - Nat Chem Biol. 2013 Jan;9(1):59-64
26836309 - Elife. 2016 Feb 02;5:e09977
16844986 - Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W177-81
24034785 - BMC Genomics. 2013 Sep 14;14:620
23664265 - Food Microbiol. 2013 Sep;35(2):136-42
22123973 - Proc Natl Acad Sci U S A. 2011 Dec 13;108(50):20172-7
26747447 - Biol Direct. 2016 Jan 08;11(1):2
26911138 - BMC Genomics. 2016 Feb 24;17 :133
16554712 - J Microbiol. 2006 Feb;44(1):11-22
12595469 - Infect Immun. 2003 Mar;71(3):1505-12
20451164 - Genomics Proteomics Bioinformatics. 2010 Mar;8(1):77-80
21826238 - PLoS One. 2011;6(8):e23191
26687005 - Elife. 2015 Dec 19;4:e08890
16606700 - Genome Res. 2006 Jun;16(6):757-67
26578815 - Proc Natl Acad Sci U S A. 2015 Dec 29;112(52):15898-903
25125650 - Genome Announc. 2014 Aug 14;2(4):null
24659774 - J Bacteriol. 2014 Jun;196(11):2030-42
26776510 - Cell Rep. 2016 Feb 2;14 (4):686-94
23415603 - Curr Opin Microbiol. 2013 Apr;16(2):133-9
27638886 - Nucleic Acids Res. 2017 Jan 25;45(2):e6
20164839 - Nature. 2010 Mar 11;464(7286):250-5
20134227 - J Microbiol Biotechnol. 2010 Jan;20(1):5-14
19213877 - Science. 2009 Apr 10;324(5924):218-23
24885796 - BMC Genomics. 2014 May 09;15:353
11258796 - DNA Res. 2001 Feb 28;8(1):11-22
25159147 - Cell Rep. 2014 Sep 11;8(5):1365-79
25233276 - Elife. 2014 Sep 16;3:e03523
27791167 - Proc Natl Acad Sci U S A. 2016 Oct 21
19910308 - Bioinformatics. 2010 Jan 1;26(1):139-40
23812753 - Proc Natl Acad Sci U S A. 2013 Jul 16;110(29):11928-33
22986362 - Nat Rev Nephrol. 2012 Nov;8(11):658-69
15215403 - Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W321-6
11120685 - Bioinformatics. 2000 Oct;16(10):944-5
20071372 - Bioinformatics. 2010 Mar 1;26(5):676-7
28134276 - Sci Rep. 2017 Jan 30;7:41425
27557128 - J Proteome Res. 2016 Oct 7;15(10 ):3773-3783
27137889 - Nucleic Acids Res. 2016 Jul 8;44(W1):W3-W10
20231229 - Bioinformatics. 2010 Apr 15;26(8):996-9
19609351 - PLoS Genet. 2009 Jul;5(7):e1000569
24848019 - Nucleic Acids Res. 2014 Jul;42(Web Server issue):W350-5
References_xml – volume: 11
  start-page: 131
  year: 2010
  ident: ref16
  article-title: Missing genes in the annotation of prokaryotic genomes
  publication-title: BMC Bioinformatics
  doi: 10.1186/1471-2105-11-131
– volume: 26
  start-page: 996
  issue: 8
  year: 2010
  ident: ref73
  article-title: Composition bias and the origin of ORFan genes
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btq093
– volume: 19
  start-page: 55
  issue: 1
  year: 1994
  ident: ref94
  article-title: Combining evolutionary information and neural networks to predict protein secondary structure
  publication-title: Proteins
  doi: 10.1002/prot.340190108
– volume: 15
  start-page: 353
  year: 2014
  ident: ref8
  article-title: Comparison of strand-specific transcriptomes of enterohemorrhagic Escherichia coli O157:H7 EDL933 (EHEC) under eleven different environmental conditions including radish sprouts and cattle feces
  publication-title: BMC Genomics
  doi: 10.1186/1471-2164-15-353
– volume: 44
  start-page: 11
  issue: 1
  year: 2006
  ident: ref69
  article-title: Rho-dependent transcription termination: more questions than answers
  publication-title: J Microbiol
– volume: 4
  start-page: 308
  issue: 3
  year: 2011
  ident: ref14
  article-title: Searching in microbial genomes for encoded small proteins
  publication-title: Microb Biotechnol
  doi: 10.1111/j.1751-7915.2011.00261.x
– volume: 8
  start-page: 11
  issue: 1
  year: 2001
  ident: ref1
  article-title: Complete genome sequence of enterohemorrhagic Escherichia coli O157:H7 and genomic comparison with a laboratory strain K-12
  publication-title: DNA Res
  doi: 10.1093/dnares/8.1.11
– volume: 5
  start-page: e09977
  year: 2016
  ident: ref25
  article-title: Fast turnover of genome transcription across evolutionary time exposes entire non-coding DNA to de novo gene emergence
  publication-title: eLife
  doi: 10.7554/eLife.09977
– volume: 22
  start-page: 867
  issue: 6
  year: 2016
  ident: ref56
  article-title: Cytoplasmic long noncoding RNAs are frequently bound to and degraded at ribosomes in human cells
  publication-title: RNA
  doi: 10.1261/rna.053561.115
– volume: 112
  start-page: 15898
  issue: 52
  year: 2015
  ident: ref20
  article-title: Unexpected features of the dark proteome
  publication-title: Proc Natl Acad Sci U S A
  doi: 10.1073/pnas.1508380112
– volume: 10
  start-page: 421
  year: 2009
  ident: ref89
  article-title: BLAST+: architecture and applications
  publication-title: BMC Bioinformatics
  doi: 10.1186/1471-2105-10-421
– volume: 71
  start-page: 1505
  issue: 3
  year: 2003
  ident: ref7
  article-title: Lymphoid follicle-dense mucosa at the terminal rectum is the principal site of colonization of enterohemorrhagic Escherichia coli O157:H7 in the bovine host
  publication-title: Infect Immun
  doi: 10.1128/IAI.71.3.1505-1512.2003
– volume: 22
  start-page: 318
  issue: 2
  year: 2013
  ident: ref6
  article-title: Occurrence of Shiga-toxigenic Escherichia coli in Stomoxys calcitrans (Diptera: Muscidae)
  publication-title: Rev Bras Parasitol Vet
  doi: 10.1590/S1984-29612013000200052
– volume: 16
  start-page: 944
  issue: 10
  year: 2000
  ident: ref41
  article-title: Artemis: sequence visualization and annotation
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/16.10.944
– volume: 9
  start-page: 75
  year: 2008
  ident: ref13
  article-title: The RAST Server: rapid annotations using subsystems technology
  publication-title: BMC Genomics
  doi: 10.1186/1471-2164-9-75
– volume: 108
  start-page: 20172
  issue: 50
  year: 2011
  ident: ref26
  article-title: Genome-wide antisense transcription drives mRNA processing in bacteria
  publication-title: Proc Natl Acad Sci U S A
  doi: 10.1073/pnas.1113521108
– volume: 2
  start-page: e00821
  issue: 4
  year: 2014
  ident: ref42
  article-title: A gapless, unambiguous genome sequence of the enterohemorrhagic Escherichia coli O157: H7 strain EDL933
  publication-title: Genome Announc
  doi: 10.1128/genomeA.00821-14
– volume: 106
  start-page: 11079
  issue: 27
  year: 2009
  ident: ref72
  article-title: Nature of the protein universe
  publication-title: Proc Natl Acad Sci U S A
  doi: 10.1073/pnas.0905029106
– year: 2016
  ident: ref37
  article-title: Super-resolution ribosome profiling reveals unannotated translation events in Arabidopsis
  publication-title: Proc Natl Acad Sci U S A
– volume: 26
  start-page: 676
  issue: 5
  year: 2010
  ident: ref81
  article-title: BamView: viewing mapped read alignment data in the context of the reference sequence
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btq010
– volume: 1
  issue: 1
  year: 2010
  ident: ref27
  article-title: Widespread Antisense Transcription in Escherichia coli
  publication-title: mBio
  doi: 10.1128/mBio.00024-10
– volume: 5
  start-page: 621
  issue: 7
  year: 2008
  ident: ref82
  article-title: Mapping and quantifying mammalian transcriptomes by RNA-Seq
  publication-title: Nat Methods
  doi: 10.1038/nmeth.1226
– volume: 6
  start-page: e23191
  issue: 8
  year: 2011
  ident: ref4
  article-title: Persistence of Escherichia coli O157:H7 and its mutants in soils
  publication-title: PLoS One
  doi: 10.1371/journal.pone.0023191
– volume: 92
  start-page: 8793
  issue: 19
  year: 1995
  ident: ref49
  article-title: Transcription termination at intrinsic terminators: the role of the RNA hairpin
  publication-title: Proc Natl Acad Sci U S A
  doi: 10.1073/pnas.92.19.8793
– volume: 84
  start-page: 1706
  issue: 11
  year: 2016
  ident: ref95
  article-title: TMSEG: Novel prediction of transmembrane helices
  publication-title: Proteins
  doi: 10.1002/prot.25155
– volume: 3
  start-page: e03523
  year: 2014
  ident: ref36
  article-title: Long non-coding RNAs as a source of new peptides
  publication-title: eLife
  doi: 10.7554/eLife.03523
– volume: 16
  start-page: 285
  issue: 1
  year: 2016
  ident: ref67
  article-title: Small proteins in cyanobacteria provide a paradigm for the functional analysis of the bacterial micro-proteome
  publication-title: BMC Microbiol
  doi: 10.1186/s12866-016-0896-z
– volume: 12
  start-page: 332
  year: 2011
  ident: ref24
  article-title: Directional RNA deep sequencing sheds new light on the transcriptional response of Anabaena sp. strain PCC 7120 to combined-nitrogen deprivation
  publication-title: BMC Genomics
  doi: 10.1186/1471-2164-12-332
– volume: 5
  start-page: 578
  issue: 3
  year: 2013
  ident: ref30
  article-title: On the immortality of television sets: “function” in the human genome according to the evolution-free gospel of ENCODE
  publication-title: Genome Biol Evol
  doi: 10.1093/gbe/evt028
– volume: 352
  start-page: aad9822
  issue: 6282
  year: 2016
  ident: ref77
  article-title: Term-seq reveals abundant ribo-regulation of antibiotics resistance in bacteria
  publication-title: Science
  doi: 10.1126/science.aad9822
– volume: 44
  start-page: W3
  issue: W1
  year: 2016
  ident: ref80
  article-title: The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkw343
– volume: 2
  start-page: 57
  issue: 1
  year: 2004
  ident: ref48
  article-title: The regulation of bacterial transcription initiation
  publication-title: Nat Rev Microbiol
  doi: 10.1038/nrmicro787
– volume: 45
  start-page: e6
  issue: 2
  year: 2017
  ident: ref65
  article-title: Ribonuclease selection for ribosome profiling
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkw822
– volume: 83
  start-page: 753
  year: 2014
  ident: ref15
  article-title: Small proteins can no longer be ignored
  publication-title: Annu Rev Biochem
  doi: 10.1146/annurev-biochem-070611-102400
– volume: 215
  start-page: 403
  issue: 3
  year: 1990
  ident: ref88
  article-title: Basic local alignment search tool
  publication-title: J Mol Biol
  doi: 10.1016/S0022-2836(05)80360-2
– volume: 7
  start-page: 983
  issue: 3
  year: 2017
  ident: ref40
  article-title: Identification of Unannotated Small Genes in Salmonella
  publication-title: G3
  doi: 10.1534/g3.116.036939
– volume: 23
  start-page: 2376
  issue: 18
  year: 2007
  ident: ref47
  article-title: Natively unstructured regions in proteins identified from contact predictions
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btm349
– volume: 14
  start-page: 620
  year: 2013
  ident: ref28
  article-title: A combination of improved differential and global RNA-seq reveals pervasive transcription initiation and events in all stages of the life-cycle of functional RNAs in Propionibacterium acnes, a major contributor to wide-spread human disease
  publication-title: BMC Genomics
  doi: 10.1186/1471-2164-14-620
– volume: 13
  start-page: 355
  issue: 6
  year: 2012
  ident: ref61
  article-title: Specialized ribosomes: a new frontier in gene regulation and organismal biology
  publication-title: Nat Rev Mol Cell Biol
  doi: 10.1038/nrm3359
– volume: 41
  start-page: D590
  issue: D1
  year: 2013
  ident: ref91
  article-title: The SILVA ribosomal RNA gene database project: improved data processing and web-based tools
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gks1219
– volume: 11
  start-page: 47
  issue: 1
  year: 2013
  ident: ref19
  article-title: Exploring the "dark matter" of a mammalian proteome by protein structure and function modeling
  publication-title: Proteome Sci
  doi: 10.1186/1477-5956-11-47
– volume: 11
  start-page: 1
  issue: 1
  year: 2016
  ident: ref74
  article-title: The mysterious orphans of Mycoplasmataceae
  publication-title: Biol Direct
  doi: 10.1186/s13062-015-0104-3
– year: 2013
  ident: ref93
  article-title: Entrez Direct: E-utilities on the UNIX Command Line
– volume: 364
  issue: 2
  year: 2017
  ident: ref76
  article-title: Transcriptional and translational regulation by RNA thermometers, riboswitches and the sRNA DsrA in Escherichia coli O157:H7 Sakai under combined cold and osmotic stress adaptation
  publication-title: FEMS Microbiol Lett
  doi: 10.1093/femsle/fnw262
– volume: 134
  start-page: 341
  issue: 2
  year: 2008
  ident: ref71
  article-title: Mistranslation-induced protein misfolding as a dominant constraint on coding-sequence evolution
  publication-title: Cell
  doi: 10.1016/j.cell.2008.05.042
– volume: 16
  start-page: 133
  issue: 2
  year: 2013
  ident: ref62
  article-title: Ribosome heterogeneity: another level of complexity in bacterial translation regulation
  publication-title: Curr Opin Microbiol
  doi: 10.1016/j.mib.2013.01.009
– volume: 6
  year: 2016
  ident: ref78
  article-title: Complementary Post Transcriptional Regulatory Information is Detected by PUNCH-P and Ribosome Profiling
  publication-title: Sci Rep
– volume: 317
  start-page: 1753
  issue: 5845
  year: 2007
  ident: ref43
  article-title: Widespread lateral gene transfer from intracellular bacteria to multicellular eukaryotes
  publication-title: Science
  doi: 10.1126/science.1142490
– volume: 5
  start-page: e1000569
  issue: 7
  year: 2009
  ident: ref22
  article-title: A strand-specific RNA-Seq analysis of the transcriptome of the typhoid bacillus Salmonella typhi
  publication-title: PLoS Genet
  doi: 10.1371/journal.pgen.1000569
– volume: 12
  start-page: 361
  issue: 1
  year: 2011
  ident: ref70
  article-title: Leaderless genes in bacteria: clue to the evolution of translation initiation mechanisms in prokaryotes
  publication-title: BMC Genomics
  doi: 10.1186/1471-2164-12-361
– volume: 7
  start-page: 41425
  year: 2017
  ident: ref21
  article-title: Exploring the dark foldable proteome by considering hydrophobic amino acids topology
  publication-title: Scientific reports
  doi: 10.1038/srep41425
– volume: 7
  start-page: 1858
  issue: 6
  year: 2014
  ident: ref33
  article-title: Translation of small open reading frames within unannotated RNA transcripts in Saccharomyces cerevisiae
  publication-title: Cell Rep
  doi: 10.1016/j.celrep.2014.05.023
– volume: 33
  start-page: 981
  issue: 9
  year: 2014
  ident: ref34
  article-title: Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation
  publication-title: EMBO J
  doi: 10.1002/embj.201488411
– volume: 4
  start-page: e08890
  year: 2015
  ident: ref55
  article-title: Many lncRNAs, 5'UTRs, and pseudogenes are translated and some are likely to express functional proteins
  publication-title: eLife
  doi: 10.7554/eLife.08890
– volume: 8
  start-page: 100
  year: 2007
  ident: ref50
  article-title: Translation initiation region sequence preferences in Escherichia coli
  publication-title: BMC Mol Biol
  doi: 10.1186/1471-2199-8-100
– volume: 35
  start-page: 136
  issue: 2
  year: 2013
  ident: ref5
  article-title: Transcriptional and functional responses of Escherichia coli O157:H7 growing in the lettuce rhizoplane
  publication-title: Food microbiology
  doi: 10.1016/j.fm.2013.03.002
– volume: 20
  start-page: 5
  issue: 1
  year: 2010
  ident: ref2
  article-title: A brief overview of Escherichia coli O157:H7 and its plasmid O157
  publication-title: J Microbiol Biotechnol
  doi: 10.4014/jmb.0908.08007
– volume: 13
  start-page: 165
  issue: 2
  year: 2016
  ident: ref38
  article-title: Detecting actively translated open reading frames in ribosome profiling data
  publication-title: Nat Methods
  doi: 10.1038/nmeth.3688
– volume: 16
  start-page: 276
  issue: 6
  year: 2000
  ident: ref90
  article-title: EMBOSS: the European Molecular Biology Open Software Suite
  publication-title: Trends Genet
  doi: 10.1016/S0168-9525(00)02024-2
– volume: 8
  start-page: 658
  issue: 11
  year: 2012
  ident: ref9
  article-title: Renal and neurological involvement in typical Shiga toxin-associated HUS
  publication-title: Nat Rev Nephrol
  doi: 10.1038/nrneph.2012.196
– volume: 17
  start-page: 133
  year: 2016
  ident: ref11
  article-title: Translatomics combined with transcriptomics and proteomics reveals novel functional, recently evolved orphan genes in Escherichia coli O157:H7 (EHEC)
  publication-title: BMC Genomics
  doi: 10.1186/s12864-016-2456-1
– volume: 3
  start-page: e03528
  year: 2014
  ident: ref54
  article-title: Extensive translation of small Open Reading Frames revealed by Poly-Ribo-Seq
  publication-title: eLife
  doi: 10.7554/eLife.03528
– volume: 8
  start-page: 77
  issue: 1
  year: 2010
  ident: ref87
  article-title: KaKs_Calculator 2.0: a toolkit incorporating gamma-series methods and sliding window strategies
  publication-title: Genomics Proteomics Bioinformatics
  doi: 10.1016/S1672-0229(10)60008-3
– volume: 395
  start-page: 1365
  issue: 12
  year: 2014
  ident: ref18
  article-title: Small membrane proteins–elucidating the function of the needle in the haystack
  publication-title: Biol Chem
  doi: 10.1515/hsz-2014-0213
– volume: 29
  start-page: 1488
  issue: 12
  year: 2013
  ident: ref59
  article-title: rRNA:mRNA pairing alters the length and the symmetry of mRNA-protected fragments in ribosome profiling experiments
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btt184
– volume: 9
  start-page: 88
  year: 2016
  ident: ref83
  article-title: AdapterRemoval v2: rapid adapter trimming, identification, and read merging
  publication-title: BMC Res Notes
  doi: 10.1186/s13104-016-1900-2
– volume: 113
  start-page: 12150
  issue: 43
  year: 2016
  ident: ref64
  article-title: Context-specific inhibition of translation by ribosomal antibiotics targeting the peptidyl transferase center
  publication-title: Proc Natl Acad Sci U S A
  doi: 10.1073/pnas.1613055113
– volume: 26
  start-page: 1669
  issue: 13
  year: 2010
  ident: ref92
  article-title: The Newick utilities: high-throughput phylogenetic tree processing in the UNIX shell
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btq243
– volume: 19
  start-page: 1639
  issue: 9
  year: 2009
  ident: ref99
  article-title: Circos: an information aesthetic for comparative genomics
  publication-title: Genome Res
  doi: 10.1101/gr.092759.109
– volume: 184
  start-page: 5733
  issue: 20
  year: 2002
  ident: ref51
  article-title: Correlations between Shine-Dalgarno sequences and gene features such as predicted expression levels and operon structures
  publication-title: J Bacteriol
  doi: 10.1128/JB.184.20.5733-5745.2002
– volume: 3
  start-page: 1245
  year: 2011
  ident: ref53
  article-title: Putatively noncoding transcripts show extensive association with ribosomes
  publication-title: Genome Biol Evol
  doi: 10.1093/gbe/evr099
– volume: 83
  start-page: 942
  issue: 3
  year: 2015
  ident: ref3
  article-title: Enterohemorrhagic Escherichia coli colonization of human colonic epithelium in vitro and ex vivo
  publication-title: Infect Immun
  doi: 10.1128/IAI.02928-14
– volume: 9
  start-page: 59
  issue: 1
  year: 2013
  ident: ref17
  article-title: Peptidomic discovery of short open reading frame-encoded peptides in human cells
  publication-title: Nat Chem Biol
  doi: 10.1038/nchembio.1120
– year: 1999
  ident: ref52
  article-title: Adaptive Evolution of Genes and Genomes
– volume: 15
  start-page: 3773
  issue: 10
  year: 2016
  ident: ref79
  article-title: Combination of Bottom-up 2D-LC-MS and Semi-top-down GelFree-LC-MS Enhances Coverage of Proteome and Low Molecular Weight Short Open Reading Frame Encoded Peptides of the Archaeon Methanosarcina mazei
  publication-title: J Proteome Res
  doi: 10.1021/acs.jproteome.6b00569
– year: 2015
  ident: ref32
  article-title: Transcriptome-wide measurement of ribosomal occupancy by ribosome profiling
  publication-title: Methods
– volume: 42
  start-page: W350
  year: 2014
  ident: ref97
  article-title: LocTree3 prediction of localization
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gku396
– volume: 34
  start-page: W177
  issue: suppl 2
  year: 2006
  ident: ref96
  article-title: DISULFIND: a disulfide bonding state and cysteine connectivity prediction server
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkl266
– volume: 16
  start-page: 757
  issue: 6
  year: 2006
  ident: ref98
  article-title: Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms
  publication-title: Genome Res
  doi: 10.1101/gr.4759706
– volume: 23
  start-page: 673
  issue: 6
  year: 2007
  ident: ref12
  article-title: Identifying bacterial genes and endosymbiont DNA with Glimmer
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btm009
– volume: 28
  start-page: 3211
  issue: 24
  year: 2012
  ident: ref84
  article-title: SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bts611
– volume: 42
  start-page: e134
  issue: 17
  year: 2014
  ident: ref63
  article-title: Translation inhibitors cause abnormalities in ribosome profiling experiments
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gku671
– volume: 26
  start-page: 139
  issue: 1
  year: 2009
  ident: ref44
  article-title: edgeR: a Bioconductor package for differential expression analysis of digital gene expression data
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btp616
– volume: 464
  start-page: 250
  issue: 7286
  year: 2010
  ident: ref23
  article-title: The primary transcriptome of the major human pathogen Helicobacter pylori
  publication-title: Nature
  doi: 10.1038/nature08756
– volume: 32
  start-page: W321
  issue: suppl 2
  year: 2004
  ident: ref45
  article-title: The predictprotein server
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkh377
– volume: 12
  start-page: 4
  year: 2014
  ident: ref68
  article-title: Genome-scale reconstruction of the sigma factor network in Escherichia coli: topology and functional states
  publication-title: BMC Biol
  doi: 10.1186/1741-7007-12-4
– volume: 42
  start-page: W337
  year: 2014
  ident: ref46
  article-title: PredictProtein—an open resource for online prediction of protein structural and functional features
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gku366
– volume: 45
  start-page: 327
  issue: 1
  year: 2017
  ident: ref66
  article-title: A ribosome profiling study of mRNA cleavage by the endonuclease RelE
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkw944
– volume: 18
  start-page: 216
  issue: 1
  year: 2017
  ident: ref39
  article-title: Differentiation of ncRNAs from small mRNAs in Escherichia coli O157:H7 EDL933 (EHEC) by combined RNAseq and RIBOseq—ryhB encodes the regulatory RNA RyhB and a peptide, RyhP
  publication-title: BMC Genomics
  doi: 10.1186/s12864-017-3586-9
– volume: 196
  start-page: 2030
  issue: 11
  year: 2014
  ident: ref75
  article-title: SearchDOGS bacteria, software that provides automated identification of potentially missed genes in annotated bacterial genomes
  publication-title: J Bacteriol
  doi: 10.1128/JB.01368-13
– volume: 4
  start-page: e2584
  year: 2016
  ident: ref85
  article-title: VSEARCH: a versatile open source tool for metagenomics
  publication-title: PeerJ
  doi: 10.7717/peerj.2584
– volume: 7
  start-page: 11605
  year: 2016
  ident: ref57
  article-title: The dynamic transcriptional and translational landscape of the model antibiotic producer Streptomyces coelicolor A3(2)
  publication-title: Nat Commun
  doi: 10.1038/ncomms11605
– volume: 14
  start-page: 686
  issue: 4
  year: 2016
  ident: ref60
  article-title: Clarifying the Translational Pausing Landscape in Bacteria by Ribosome Profiling
  publication-title: Cell Rep
  doi: 10.1016/j.celrep.2015.12.073
– volume: 3
  start-page: 48
  issue: 7
  year: 2011
  ident: ref86
  article-title: Towards the integration of genomics, epidemiological and clinical data
  publication-title: Genome Med
  doi: 10.1186/gm264
– volume: 70
  start-page: 1487
  issue: 6
  year: 2008
  ident: ref10
  article-title: Small membrane proteins found by comparative genomics and ribosome binding site models
  publication-title: Mol Microbiol
  doi: 10.1111/j.1365-2958.2008.06495.x
– volume: 12
  start-page: 647
  issue: 9
  year: 2014
  ident: ref29
  article-title: Pervasive transcription: illuminating the dark matter of bacterial transcriptomes
  publication-title: Nat Rev Microbiol
  doi: 10.1038/nrmicro3316
– volume: 324
  start-page: 218
  issue: 5924
  year: 2009
  ident: ref31
  article-title: Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling
  publication-title: Science
  doi: 10.1126/science.1168978
– volume: 8
  start-page: 1365
  issue: 5
  year: 2014
  ident: ref35
  article-title: Ribosome profiling reveals pervasive translation outside of annotated protein-coding genes
  publication-title: Cell Rep
  doi: 10.1016/j.celrep.2014.07.045
– volume: 110
  start-page: 11928
  issue: 29
  year: 2013
  ident: ref58
  article-title: High-resolution view of bacteriophage lambda gene expression by ribosome profiling
  publication-title: Proc Natl Acad Sci U S A
  doi: 10.1073/pnas.1309739110
– reference: 24799431 - Nucleic Acids Res. 2014 Jul;42(Web Server issue):W337-43
– reference: 25125650 - Genome Announc. 2014 Aug 14;2(4):null
– reference: 23431001 - Genome Biol Evol. 2013;5(3):578-90
– reference: 27791167 - Proc Natl Acad Sci U S A. 2016 Oct 21;:
– reference: 21867574 - Genome Med. 2011 Jul 27;3(7):48
– reference: 21749696 - BMC Genomics. 2011 Jul 12;12:361
– reference: 27781170 - PeerJ. 2016 Oct 18;4:e2584
– reference: 26898226 - Sci Rep. 2016 Feb 22;6:21635
– reference: 25534942 - Infect Immun. 2015 Mar;83(3):942-9
– reference: 23071270 - Bioinformatics. 2012 Dec 15;28(24):3211-7
– reference: 20164839 - Nature. 2010 Mar 11;464(7286):250-5
– reference: 26776510 - Cell Rep. 2016 Feb 2;14 (4):686-94
– reference: 24848019 - Nucleic Acids Res. 2014 Jul;42(Web Server issue):W350-5
– reference: 27894276 - BMC Microbiol. 2016 Nov 28;16(1):285
– reference: 22986362 - Nat Rev Nephrol. 2012 Nov;8(11):658-69
– reference: 27566436 - Proteins. 2016 Nov;84(11):1706-1716
– reference: 18261238 - BMC Genomics. 2008 Feb 08;9:75
– reference: 25069631 - Nat Rev Microbiol. 2014 Sep;12 (9):647-53
– reference: 23812753 - Proc Natl Acad Sci U S A. 2013 Jul 16;110(29):11928-33
– reference: 20071372 - Bioinformatics. 2010 Mar 1;26(5):676-7
– reference: 25233276 - Elife. 2014 Sep 16;3:e03523
– reference: 20231229 - Bioinformatics. 2010 Apr 15;26(8):996-9
– reference: 27856567 - FEMS Microbiol Lett. 2017 Jan;364(2):
– reference: 28245801 - BMC Genomics. 2017 Feb 28;18(1):216
– reference: 20003500 - BMC Bioinformatics. 2009 Dec 15;10:421
– reference: 15215403 - Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W321-6
– reference: 20230630 - BMC Bioinformatics. 2010 Mar 15;11:131
– reference: 24034785 - BMC Genomics. 2013 Sep 14;14:620
– reference: 24461193 - BMC Biol. 2014 Jan 24;12:4
– reference: 19213877 - Science. 2009 Apr 10;324(5924):218-23
– reference: 27557128 - J Proteome Res. 2016 Oct 7;15(10 ):3773-3783
– reference: 2231712 - J Mol Biol. 1990 Oct 5;215(3):403-10
– reference: 27924019 - Nucleic Acids Res. 2017 Jan 9;45(1):327-336
– reference: 24885796 - BMC Genomics. 2014 May 09;15:353
– reference: 12595469 - Infect Immun. 2003 Mar;71(3):1505-12
– reference: 27251447 - Nat Commun. 2016 Jun 02;7:11605
– reference: 27120414 - Science. 2016 Apr 8;352(6282):aad9822
– reference: 12270832 - J Bacteriol. 2002 Oct;184(20):5733-45
– reference: 25144939 - Elife. 2014 Aug 21;3:e03528
– reference: 22617470 - Nat Rev Mol Cell Biol. 2012 May 23;13(6):355-69
– reference: 23193283 - Nucleic Acids Res. 2013 Jan;41(Database issue):D590-6
– reference: 24705786 - EMBO J. 2014 May 2;33(9):981-93
– reference: 24659774 - J Bacteriol. 2014 Jun;196(11):2030-42
– reference: 21948395 - Genome Biol Evol. 2011;3:1245-52
– reference: 24321360 - Proteome Sci. 2013 Dec 09;11(1):47
– reference: 24606146 - Annu Rev Biochem. 2014;83:753-77
– reference: 27638886 - Nucleic Acids Res. 2017 Jan 25;45(2):e6
– reference: 27137889 - Nucleic Acids Res. 2016 Jul 8;44(W1):W3-W10
– reference: 24931603 - Cell Rep. 2014 Jun 26;7(6):1858-66
– reference: 17237039 - Bioinformatics. 2007 Mar 15;23(6):673-9
– reference: 18516045 - Nat Methods. 2008 Jul;5(7):621-8
– reference: 11258796 - DNA Res. 2001 Feb 28;8(1):11-22
– reference: 26747447 - Biol Direct. 2016 Jan 08;11(1):2
– reference: 19910308 - Bioinformatics. 2010 Jan 1;26(1):139-40
– reference: 26868221 - BMC Res Notes. 2016 Feb 12;9:88
– reference: 21711558 - BMC Genomics. 2011 Jun 28;12:332
– reference: 22123973 - Proc Natl Acad Sci U S A. 2011 Dec 13;108(50):20172-7
– reference: 19541911 - Genome Res. 2009 Sep;19(9):1639-45
– reference: 23856725 - Rev Bras Parasitol Vet. 2013 Apr-Jun;22(2):318-21
– reference: 25056308 - Nucleic Acids Res. 2014;42(17):e134
– reference: 17761848 - Science. 2007 Sep 21;317(5845):1753-6
– reference: 26102273 - Methods. 2015 Sep 1;85:75-89
– reference: 10827456 - Trends Genet. 2000 Jun;16(6):276-7
– reference: 27090285 - RNA. 2016 Jun;22(6):867-82
– reference: 21826238 - PLoS One. 2011;6(8):e23191
– reference: 19121005 - Mol Microbiol. 2008 Dec;70(6):1487-501
– reference: 21518296 - Microb Biotechnol. 2011 May;4(3):308-13
– reference: 16844986 - Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W177-81
– reference: 17709338 - Bioinformatics. 2007 Sep 15;23(18):2376-84
– reference: 28134276 - Sci Rep. 2017 Jan 30;7:41425
– reference: 28122954 - G3 (Bethesda). 2017 Mar 10;7(3):983-989
– reference: 16554712 - J Microbiol. 2006 Feb;44(1):11-22
– reference: 23603333 - Bioinformatics. 2013 Jun 15;29(12):1488-91
– reference: 7568019 - Proc Natl Acad Sci U S A. 1995 Sep 12;92(19):8793-7
– reference: 26836309 - Elife. 2016 Feb 02;5:e09977
– reference: 16606700 - Genome Res. 2006 Jun;16(6):757-67
– reference: 20134227 - J Microbiol Biotechnol. 2010 Jan;20(1):5-14
– reference: 15035009 - Nat Rev Microbiol. 2004 Jan;2(1):57-65
– reference: 19609351 - PLoS Genet. 2009 Jul;5(7):e1000569
– reference: 23415603 - Curr Opin Microbiol. 2013 Apr;16(2):133-9
– reference: 25159147 - Cell Rep. 2014 Sep 11;8(5):1365-79
– reference: 27791002 - Proc Natl Acad Sci U S A. 2016 Oct 25;113(43):12150-12155
– reference: 20472542 - Bioinformatics. 2010 Jul 1;26(13):1669-70
– reference: 25153378 - Biol Chem. 2014 Dec;395(12):1365-77
– reference: 8066087 - Proteins. 1994 May;19(1):55-72
– reference: 23664265 - Food Microbiol. 2013 Sep;35(2):136-42
– reference: 23160002 - Nat Chem Biol. 2013 Jan;9(1):59-64
– reference: 11120685 - Bioinformatics. 2000 Oct;16(10):944-5
– reference: 17973990 - BMC Mol Biol. 2007 Oct 31;8:100
– reference: 26657557 - Nat Methods. 2016 Feb;13(2):165-70
– reference: 19541617 - Proc Natl Acad Sci U S A. 2009 Jul 7;106(27):11079-84
– reference: 20451164 - Genomics Proteomics Bioinformatics. 2010 Mar;8(1):77-80
– reference: 26687005 - Elife. 2015 Dec 19;4:e08890
– reference: 20689751 - MBio. 2010 May 18;1(1):null
– reference: 18662548 - Cell. 2008 Jul 25;134(2):341-52
– reference: 26578815 - Proc Natl Acad Sci U S A. 2015 Dec 29;112(52):15898-903
– reference: 26911138 - BMC Genomics. 2016 Feb 24;17 :133
SSID ssj0053866
Score 2.3698976
Snippet In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Transcriptome sequencing (RNAseq) signals outside of annotated...
SourceID plos
doaj
unpaywall
pubmedcentral
proquest
pubmed
crossref
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
StartPage e0184119
SubjectTerms Algorithms
Amino acids
Bacteria
Bioinformatics
Biological evolution
Biology and Life Sciences
Conserved Sequence
Deoxyribonucleic acid
DNA
DNA, Bacterial - genetics
DNA, Intergenic - genetics
E coli
Ecology
Escherichia coli
Escherichia coli O157 - genetics
Food contamination & poisoning
Frames
Gene expression
Gene sequencing
Genes
Genes, Bacterial
Genetic Association Studies
Genome, Bacterial
Genomes
Genomics
Growth conditions
High-Throughput Nucleotide Sequencing
Homology
Informatics
Learning algorithms
Low temperature
Machine learning
Neural networks
Non-coding RNA
Open reading frames
Open Reading Frames - genetics
Osmosis
Osmotic pressure
Pipelines
Protein structure
Proteins
Research and Analysis Methods
RNA, Bacterial - genetics
Secondary structure
Transcription
Transcriptome
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELZQL3BBlFcDBRmJAxyyTez4xY1HqxUScKBIvUW2Y0NE6l2Rrar--84k2dWuqFQOXDNOdjPzTfJNbH9DyOtoQhNisLkXyudVZX1ui-DyhklWBm619_hp4MtXOf9RfT4TZ1utvnBN2CgPPDruqAm60UGW2jtTNcybIEIlvSobHnVsBp3PQpt1MTU-gyGLpZw2ynFVHk1xmS0XKcwKKGpKVNbZehENev2ob9ot-pu45t9LJu9epKW9urRdt_U-OnlA7k9Ekr4fb2Cf3AnpIdmfUrWnbyY96bePSPrU9h5Xal7RRaQJN1xCuU8THOpofw7Xpj_xiUfbRIEOUlSQwC2ZrafYtgFgieeh5bjHELe4PJoCgFr6rRTq3VzR7_a3bfEqi_PwmJyeHJ9-nOdTmwWID5MrKB6dtyEaGZ1vgG8IZ6XnoQjCuBiZdBxYWRTgchlUDIWTXkRmotHWyiryJ2QvgV8PCNXRceOjFYVnlbNg5wVzXHqjhAFulRG-dnntJwly7ITR1cO8moJSZHRejYGqp0BlJN-ctRwlOG4Z_wGjuRmLAtrDAYBVPcGqvg1WGTlALKx_oK-B6GrBVMlYRg7X-LjZ_GpjhlTF-RebAsR1GKOFBrRm5OkIp82fHOZ7tdQZUTtA27mLXUtqfw1y4FgTQgpkZLaB5D_56dn_8NNzco8h0cGmGvyQ7K3-XIQXQNNW7uWQkdeZPUIZ
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: ProQuest Central
  dbid: BENPR
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fb9MwELZGJwQviA3YCgMZiQd4yJbYseMgIcSgU4VEQTCkvUW2Y4-ILilLK9T_nrvUKauYgNecHce-75w7__iOkGc-d6XzTkdWZDZKU20jHTsTlUyyxHGtrMWlgQ8TOf6avj8TZ1tk0t-FwWOV_ZzYTdRlY3GN_AgcDSVYljD2evYjwqxRuLvap9DQIbVC-aqjGLtBthkyYw3I9vFo8ulzPzeDdUsZLtDxLDkK-jqcNbU7jCHYSZBx58oPquPxR97TadNe54P-eZTy1qKe6eVPPZ1e-U-d3CV3goNJ36wQsUO2XL1Lbo46curlLtkJxtzS54Fx-sU9Ur-rWotnOZe08bTGK5nNoqU1PJrS9gJaoec4J9KqpuAwUuSYwEublaWY2AGAi_VQMmoRBBUeoKbQZEU_JiJ7Oc7oF_1dV_iW5sLdJ6cno9O34ygkYgANMjmH8NJY7XwuvbEleCTCaGm5i53IjfdMGlBN4oWF2Mtl3sVGWuFZ7nOltUw9f0AGNYzwPqHKG55br0VsWWo0yHnMDJc2z0QO3teQ8H7wCxtIyjFXxrTodt4yCFZWw1igyoqgsiGJ1rVmK5KOf5Q_Rr2uyyLFdveguTwvgsUWpVOlcjJR1uRpyWzuhEsl9LHkXvmSD8k-oqJvoC1-I3RIDnqkXC9-uhaDMeMOja4d6LUro4SKFXzi3gpY64_sdoSVVEOSbUBuoxebkrr61hGGY9TIYqh5uAbnf43Tw7_34xG5zdDJwYQa_IAM5pcL9xhctLl5EuzuFzHRQZg
  priority: 102
  providerName: ProQuest
– databaseName: Unpaywall
  dbid: UNPAY
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELbK9gAXoDzahYKMxAEOSRM7dhxuBVqtkChItFI5oMh2bIi6za7IrlA58NuZSZwVC0XAgVuU8Tj2eOzMeDyfCXnsC1c573RkRW6jLNM20okzUcUkSx3XylrcGnh9JCcn2atTcbpBPgy5MEGC4CNOZ20XyceHWeP2giT3EK-oj57GKc_TgSOeQ6E4AYclhTnYIQ7hztgCE5CukE0pwFQfkc2To7f77_tIM4skS3hIp_tdTWu_qw7VH1FQoUmXWaS_Hqy8umzm-uKLnk5_-Gsd3iDfhv72h1XO4uXCxPbrT1CQ_00gN8n1YO_S_b6WLbLhmltkK6woLX0SYK-f3ibNy7q1eKD0gs48bTAvdLZsaQOvprQ9h87Rj7gw07qhYLVSBLrAzNHaUrxdAmYP8iHloEVNrPEUNwU9r-mbVOTPJjl9p890jbXMzt0dcnx4cPxiEoXbIECNmFyAj2usdr6Q3tgKzCJhtLTcJU4UxnsmDQfj0QsLDqDLvUuMtMKzwhdKa5l5fpeMGpDIDqHKG15Yr0ViWWY00HnCDJe2yEUBJuCY8GHMSxuQ0vHCjmnZhf9y8Jh64ZUo4jKIeEyiFde8Rwr5Q_nnqE6rsojz3b2AMS7D2JaVU5VyMlXWFFnFbOGEyyT0seJe-YqPyQ6qw_CBtgS9VoLlKWNjsjso6OXkRysyrCgYJtKNg3HtyiihEgVN3O71edXILiytpBqTfE3T13qxTmnqTx1qObquLAHOeDUn_kpO9_6V4T65xtD2wns--C4ZLT4v3QOwHBfmYZj_3wGRinfk
  priority: 102
  providerName: Unpaywall
Title Discovery of numerous novel small genes in the intergenic regions of the Escherichia coli O157:H7 Sakai genome
URI https://www.ncbi.nlm.nih.gov/pubmed/28902868
https://www.proquest.com/docview/1938527122
https://www.proquest.com/docview/1938858089
https://pubmed.ncbi.nlm.nih.gov/PMC5597208
https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0184119&type=printable
https://doaj.org/article/de8d8e618cb94d2c9e5e46c71d3f8fd3
http://dx.doi.org/10.1371/journal.pone.0184119
UnpaywallVersion publishedVersion
Volume 12
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVFSB
  databaseName: Free Full-Text Journals in Chemistry
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: HH5
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: http://abc-chemistry.org/
  providerName: ABC ChemistRy
– providerCode: PRVAFT
  databaseName: Open Access Digital Library
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: KQ8
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVAFT
  databaseName: Open Access Digital Library
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: KQ8
  dateStart: 20061001
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: DOA
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVEBS
  databaseName: EBSCOhost Academic Search Ultimate
  customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: ABDBF
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn
  providerName: EBSCOhost
– providerCode: PRVEBS
  databaseName: EBSCOhost Food Science Source
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: A8Z
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: https://search.ebscohost.com/login.aspx?authtype=ip,uid&profile=ehost&defaultdb=fsr
  providerName: EBSCOhost
– providerCode: PRVBFR
  databaseName: Free Medical Journals
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: DIK
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: http://www.freemedicaljournals.com
  providerName: Flying Publisher
– providerCode: PRVFQY
  databaseName: GFMER Free Medical Journals
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: GX1
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php
  providerName: Geneva Foundation for Medical Education and Research
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: M~E
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
– providerCode: PRVAQN
  databaseName: PubMed Central
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: RPM
  dateStart: 20060101
  isFulltext: true
  titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/
  providerName: National Library of Medicine
– providerCode: PRVPQU
  databaseName: ProQuest - Health and Medical
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: 7X7
  dateStart: 20061201
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/healthcomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl: http://www.proquest.com/pqcentral?accountid=15518
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: BENPR
  dateStart: 20061201
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Technology Collection
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: 8FG
  dateStart: 20061201
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/technologycollection1
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Public Health Database
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: 8C1
  dateStart: 20061201
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/publichealth
  providerName: ProQuest
– providerCode: PRVFZP
  databaseName: Scholars Portal Journals: Open Access
  customDbUrl:
  eissn: 1932-6203
  dateEnd: 20250930
  omitProxy: true
  ssIdentifier: ssj0053866
  issn: 1932-6203
  databaseCode: M48
  dateStart: 20061201
  isFulltext: true
  titleUrlDefault: http://journals.scholarsportal.info
  providerName: Scholars Portal
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3db9MwELfGJgQviA3YCqMyEg_w0CpxYsdBQmgbLRXSygSrVJ4i27FHRJaUphX0v-cuTSMqNsFLHny-Or6P-i62f0fISxfb1DqreoZHpheGyvSUZ3UvZYL5NlDSGPw0cD4Wo0n4ccqnO2RTs7URYHVjaof1pCbzvP_rx-odOPzbumpD5G-Y-rOysH0PUhYfcUD3YK2KsZjDedjuK4B3C9FcoLuNc2uBqnH8Efc0L6ubYtC_j1LeWxYztfqp8vyPdWr4kDxoAkx6sraIfbJjiwNyd1CDU68OyH7jzBV91SBOv35EivdZZfAs54qWjhZ4JbNcVrSAppxW1zAKvcL_RJoVFAJGihgTeGkzMxQLO4DhIh9SBhUaQYYHqCkMmdFPPo_ejCL6RX1XGf5KeW0fk8vh4PJs1GsKMYAGmVhAeqmNsi4WTpsUIhKulTCB9SyPtXNM6ADiNscN5F42ctbTwnDHYhdLpUTogidktwAJHxEqnQ5i4xT3DAu1AnrgMR0IE0c8huirQ4KN8BPTgJRjrYw8qXfeIkhW1mJMUGVJo7IO6bVcszVIxz_6n6Je274IsV03lPOrpPHYJLUylVb40ug4TJmJLbehgDmmgZMuDTrkCK1iM0CVQCgsOYt8xjrkeGMpN5NftGRwZtyhUYUFvdZ9JJeehFc8XBtW-5L1jrAUskOiLZPbmsU2pci-1YDhmDUyDzj7rXH-l5ye3j7FZ-Q-wwAHi2kEx2R3MV_a5xCeLXSX3ImmETzlmY_P4Ycu2TsdjC8-d-sPHt3aI6FtMr44-fobQcpF9A
linkProvider Scholars Portal
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELZKESoXRMujCwWMBBIcsk2c2HGQEAK61ZY-OLBIe4tsxy4R22RpdlXtj-I_MpNNQldUwKVXj9_zeTzjxwwhL1xiM-us8gyPjRdFynjKt9rLmGCBDZU0Bo8Gjk_E8Gv0aczHa-Rn-xcGn1W2MrEW1Flp8Ix8FxQNyVkcMPZu-sPDqFF4u9qG0FjC4tAuLsBkq94e7AF_XzK2Pxh9HHpNVAHoDhMzsJW0UdYlwmmTwfbKtRImtL7liXaOCQ3tBI4bMCRs7KyvheGOJS6RSonIhVDtDXIzCkGUwPKJx519B6JDiOZ3XhgHuw0Y-tOysH0fLKkA3flc2v3qIAHoVHVSVlcpuH--09yYF1O1uFCTyaVNcP8uudNor_T9Em6bZM0WW-TWoPZ8vdgim42kqOirxp3163uk2Msrgw9FF7R0tMD_nuW8ogUkTWh1Bq3QUxS4NC8oaKMUHVjgj9DcUIwaAasCyyFlUCHCcnydTaHJnH4OePxmGNMv6rvKsZbyzN4no-vgxwOyXsAMbxMqnQ4T4xT3DYu0AnroMx0Kk8Q8AdWuR8J28lPTeEDHQByTtL7Wi8ESWk5jiixLG5b1iNeVmi49gPwj_wfka5cX_XfXCeX5adqIgzSzMpNWBNLoJMqYSSy3kYAxZqGTLgt7ZBtR0TZQpb_h3yM7LVKuJj_vyCAp8PpHFRb4WueRXPoSuvhwCayuk_V1sxSyR-IVyK2MYpVS5N9qb-RokjIfSvY7cP7XPD36-ziekY3h6PgoPTo4OXxMbjPUpjByR7hD1mfnc_sEdMGZflqvQErSa17xvwAY5nmZ
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtR1db9Mw0BpDfLwgNj5WGGAkkOAhbWInjoOEENBWHYOBxCb1LbIde0R0SVlaTf1p_Dvu0qSsYgJe9urz9334zj7fEfLMJTazzirPRLHxwlAZT_lWexkTLLBcSWPwauDTgRgdhR_G0XiD_Gz_wqBbZSsTa0GdlQbvyHugaMiIxQFjPde4RXzpD99Mf3iYQQpfWtt0GksS2beLMzDfqtd7fcD1c8aGg8P3I6_JMABTY2IGdpM2yrpEOG0yOGojrYTh1rdRop1jQsOYgYsMGBU2dtbXwkSOJS6RSonQcej2Crkac56gN2E8Xtl6IEaEaH7q8TjoNYTRnZaF7fpgVQUY2ufcSVgnDMAAq5OyukjZ_dNn88a8mKrFmZpMzh2Iw9vkVqPJ0rdL0tsiG7bYJtcGdRTsxTbZaqRGRV80oa1f3iFFP68MOo0uaOlogX8_y3lFCyia0OoERqHHKHxpXlDQTCkGs8DfobmhmEECOATbIWRQIbXl6KlNYcicfg6i-NUopl_Vd5VjL-WJvUsOLwMf98hmATu8Q6h0mifGqcg3LNQK4NxnmguTxFECal6H8HbzU9NEQ8ekHJO0fuKLwSpabmOKKEsblHWIt2o1XUYD-Uf9d4jXVV2M5V0XlKfHaSMa0szKTFoRSKOTMGMmsZENBawx4066jHfIDlJFO0CV_maFDtltKeVi8NMVGKQGPgWpwgJe6zoykr6EKd5fEtZqkvXTsxSyQ-I1kltbxTqkyL_VkcnRPGU-tOyuiPO_9unB39fxhFwHXk8_7h3sPyQ3GSpWmMSD75LN2encPgK1cKYf1wxISXrJDP8LlaF93A
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELbK9gAXoDzahYKMxAEOSRM7dhxuBVqtkChItFI5oMh2bIi6za7IrlA58NuZSZwVC0XAgVuU8Tj2eOzMeDyfCXnsC1c573RkRW6jLNM20okzUcUkSx3XylrcGnh9JCcn2atTcbpBPgy5MEGC4CNOZ20XyceHWeP2giT3EK-oj57GKc_TgSOeQ6E4AYclhTnYIQ7hztgCE5CukE0pwFQfkc2To7f77_tIM4skS3hIp_tdTWu_qw7VH1FQoUmXWaS_Hqy8umzm-uKLnk5_-Gsd3iDfhv72h1XO4uXCxPbrT1CQ_00gN8n1YO_S_b6WLbLhmltkK6woLX0SYK-f3ibNy7q1eKD0gs48bTAvdLZsaQOvprQ9h87Rj7gw07qhYLVSBLrAzNHaUrxdAmYP8iHloEVNrPEUNwU9r-mbVOTPJjl9p890jbXMzt0dcnx4cPxiEoXbIECNmFyAj2usdr6Q3tgKzCJhtLTcJU4UxnsmDQfj0QsLDqDLvUuMtMKzwhdKa5l5fpeMGpDIDqHKG15Yr0ViWWY00HnCDJe2yEUBJuCY8GHMSxuQ0vHCjmnZhf9y8Jh64ZUo4jKIeEyiFde8Rwr5Q_nnqE6rsojz3b2AMS7D2JaVU5VyMlXWFFnFbOGEyyT0seJe-YqPyQ6qw_CBtgS9VoLlKWNjsjso6OXkRysyrCgYJtKNg3HtyiihEgVN3O71edXILiytpBqTfE3T13qxTmnqTx1qObquLAHOeDUn_kpO9_6V4T65xtD2wns--C4ZLT4v3QOwHBfmYZj_3wGRinfk
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Discovery+of+numerous+novel+small+genes+in+the+intergenic+regions+of+the+Escherichia+coli+O157%3AH7+Sakai+genome&rft.jtitle=PloS+one&rft.au=H%C3%BCcker%2C+Sarah&rft.au=Ardern%2C+Zachary&rft.au=Goldberg%2C+Tatyana&rft.au=Schafferhans%2C+Andrea&rft.date=2017-09-13&rft.pub=Public+Library+of+Science&rft.eissn=1932-6203&rft.volume=12&rft.issue=9&rft_id=info:doi/10.1371%2Fjournal.pone.0184119&rft.externalDocID=1938527122
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1932-6203&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1932-6203&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1932-6203&client=summon