High Resolution Discrimination of Clinical Mycobacterium tuberculosis Complex Strains Based on Single Nucleotide Polymorphisms
Recently, the diversity of the Mycobacterium tuberculosis complex (MTBC) population structure has been described in detail. Based on geographical separation and specific host pathogen co-evolution shaping MTBC virulence traits, at least 20 major lineages/genotypes have evolved finally leading to a c...
Saved in:
| Published in | PloS one Vol. 7; no. 7; p. e39855 |
|---|---|
| Main Authors | , , , , , , |
| Format | Journal Article |
| Language | English |
| Published |
United States
Public Library of Science
02.07.2012
Public Library of Science (PLoS) |
| Subjects | |
| Online Access | Get full text |
| ISSN | 1932-6203 1932-6203 |
| DOI | 10.1371/journal.pone.0039855 |
Cover
| Summary: | Recently, the diversity of the Mycobacterium tuberculosis complex (MTBC) population structure has been described in detail. Based on geographical separation and specific host pathogen co-evolution shaping MTBC virulence traits, at least 20 major lineages/genotypes have evolved finally leading to a clear influence of strain genetic background on transmissibility, clinical presentation/outcome, and resistance development. Therefore, high resolution genotyping for characterization of strains in larger studies is mandatory for understanding mechanisms of host-pathogen-interaction and to improve tuberculosis (TB) control. Single nucleotide polymorphisms (SNPs) represent the most reliable markers for lineage classification of clinical isolates due to the low levels of homoplasy, however their use is hampered either by low discriminatory power or by the need to analyze a large number of genes to achieve higher resolution. Therefore, we carried out de novo sequencing of 26 genes (approx. 20000 bp per strain) in a reference collection of MTBC strains including all major genotypes to define a highly discriminatory gene set. Overall, 161 polymorphisms were detected of which 59 are genotype-specific, while 13 define deeper branches such as the Euro-American lineage. Unbiased investigation of the most variable set of 11 genes in a population based strain collection (one year, city of Hamburg, Germany) confirmed the validity of SNP analysis as all strains were classified with high accuracy. Taken together, we defined a diagnostic algorithm which allows the identification of 17 MTBC phylogenetic lineages with high confidence for the first time by sequencing analysis of just five genes. In conclusion, the diagnostic algorithm developed in our study is likely to open the door for a low cost high resolution sequence/SNP based differentiation of the MTBC with a very high specificity. High throughput assays can be established which will be needed for large association studies that are mandatory for detailed investigation of host-pathogen-interaction during TB infection. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 ObjectType-Article-2 ObjectType-Feature-1 Conceived and designed the experiments: SH MP SF TU SN. Performed the experiments: SH MP TU. Analyzed the data: SH MP SF TU RD UN SN. Contributed reagents/materials/analysis tools: RD UN. Wrote the paper: SH MP SF TU RD UN SN. |
| ISSN: | 1932-6203 1932-6203 |
| DOI: | 10.1371/journal.pone.0039855 |