| Literature DB >> 31368488 |
Guillaume Sapriel1,2, Roland Brosch3.
Abstract
Tuberculosis remains one of the deadliest infectious diseases of humanity. To better understand the evolutionary history of host-adaptation of tubercle bacilli (MTB), we sought for mycobacterial species that were more closely related to MTB than the previously used comparator species Mycobacterium marinum and Mycobacterium kansasii. Our phylogenomic approach revealed some recently sequenced opportunistic mycobacterial pathogens, Mycobacterium decipiens, Mycobacterium lacus, Mycobacterium riyadhense, and Mycobacterium shinjukuense, to constitute a common clade with MTB, hereafter called MTB-associated phylotype (MTBAP), from which MTB have emerged. Multivariate and clustering analyses of genomic functional content revealed that the MTBAP lineage forms a clearly distinct cluster of species that share common genomic characteristics, such as loss of core genes, shift in dN/dS ratios, and massive expansion of toxin-antitoxin systems. Consistently, analysis of predicted horizontal gene transfer regions suggests that putative functions acquired by MTBAP members were markedly associated with changes in microbial ecology, for example adaption to intracellular stress resistance. Our study thus considerably deepens our view on MTB evolutionary history, unveiling a decisive shift that promoted conversion to host-adaptation among ancestral founders of the MTBAP lineage long before Mycobacterium tuberculosis has adapted to the human host.Entities:
Keywords: evolutionary transition; host adaption; mycobacteria; pathogen evolution; pathogenomic; tuberculosis
Mesh:
Substances:
Year: 2019 PMID: 31368488 PMCID: PMC6736058 DOI: 10.1093/gbe/evz162
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Pairwise Genomic Distances of MTB and Closely Related SGM Species
| ANI [% Aligned] | Mtub | Mcan | Mdec | Mshi | Mlac | Mriy | Mkan | Mmar | Mszu | Mgor |
|---|---|---|---|---|---|---|---|---|---|---|
| Mtub | — |
|
|
|
|
| 79.63 [65.42] | 78.57 [64.75] | 79.28 [64.01] | 77.70 [62.28] |
| Mcan |
| — |
|
|
|
| 79.69 [64.68] | 78.57 [63.34] | 79.28 [63.35] | 77.57 [60.75] |
| Mdec |
|
| — |
|
|
| 79.45 [66.00] | 78.57 [66.67] | 79.32 [65.69] | 77.55 [62.72] |
| Mshi |
|
|
| — |
|
| 80.68 [67.36] | 79.37 [65.38] | 80.35 [66.50] | 78.79 [64.74] |
| Mlac |
|
|
|
| — |
| 80.46 [67.77] | 79.06 [65.30] | 80.49 [67.68] | 78.59 [64.42] |
| Mriy |
|
|
|
|
| — | 79.13 [58.92] | 77.90 [56.69] | 80.93 [64.48] | 77.56 [58.40] |
| Mkan | 78.62 [47.78] | 78.63 [48.36] | 79.07 [54.05] | 79.65 [49.10] | 79.70 [53.48] | 79.13 [56.38] | — | 79.07 [59.76] | 78.92 [58.72] | 77.54 [57.73] |
| Mmar | 77.31 [49.41] | 77.35 [49.69] | 77.98 [57.08] | 78.31 [48.95] | 78.04 [53.76] | 77.59 [56.56] | 78.82 [62.67] | — | 77.37 [59.18] | 76.53 [58.75] |
| Mszu | 77.82 [47.14] | 77.84 [47.64] | 78.53 [55.11] | 78.99 [48.53] | 79.25 [54.46] | 80.39 [62.95] | 78.35 [60.67] | 77.24 [57.97] | — | 77.79 [61.73] |
| Mgor | 76.20 [40.69] | 76.23 [40.77] | 76.76 [45.98] | 77.29 [41.74] | 77.22 [45.49] | 76.96 [50.24] | 76.99 [51.96] | 76.16 [50.79] | 77.60 [54.55] | — |
Notes.—ANI values were calculated from genome to genome BLAST-based comparison. Within brackets: aligned genome percentage.
Bold: higher ANI values with MTBC, as compared with previously studied reference outgroups M. kansasii and M. marinum.
Mtub, M. tuberculosis H37Rv; Mcan, M. canettii STBK; Mdec, M. decipiens; Mlac, M. lacus; Mriy, M. riyadhense; Mkan, M. kansasii; Mmar, M. marinum E11; Mszu, M. szulgai; Mgor, M. gordonae.
. 1.—Phylogenetic organization of MTB and closely related SGM species. (A) SGM phylogenetic tree. For tree construction, concatenated conserved protein sequences from 107 universally conserved bacterial genes, as defined by the bcgTree software, were extracted from 28 mycobacterial species. For data analysis, a similarity matrix was calculated using the JTT_DCmut+I+G model. The phylogentic tree was constructed using ML estimations. Bootstrap values were calculated from 500 replicates. Red branches: members of the MTBAP lineage. (B) Genetic distance of MTBAP lineage and MGS–MKM outgroup members relative to M. tuberculosis H37Rv. dS distribution of 923 core-genome genes compared with those of M. tuberculosis H37Rv. Synonymous mutation rates (dS) for each gene were determined using the PAML algorithm. y-Axis: logarithmic scale. Mcan, M. canettii STB-K; Mdec, M. decipiens; Mshi, M. shinjukuense; Mlac, M. lacus; Mriy, M. riyadhense; Mkan, M. kansasii; Mmar, M. marinum E11; Mszu, M. szulgai; Mgor, M. gordonae. Bold bar: median. Box edges: 25th and 75th percentiles. Whiskers: extreme values.
EGGNOG Functional Category Counts of Annotated Genes in Genomes of MTB and Closely Related Mycobacterial Species
| Function | Mtub | Mcan | Mdec | Mshi | Mlac | Mriy | Mkan | Mmar | Mszu | Mgor |
|---|---|---|---|---|---|---|---|---|---|---|
| Energy production and conversion | 212 | 214 | 293 | 230 | 282 | 322 | 347 | 322 | 366 | 379 |
| Cell cycle control, cell division, chromosome partitioning | 45 | 45 | 35 | 44 | 41 | 49 | 38 | 32 | 42 | 57 |
| Amino acid transport and metabolism | 197 | 196 | 243 | 187 | 210 | 224 | 236 | 265 | 242 | 242 |
| Nucleotide transport and metabolism | 74 | 77 | 77 | 72 | 80 | 82 | 80 | 79 | 74 | 75 |
| Carbohydrate transport and metabolism | 147 | 141 | 175 | 166 | 170 | 197 | 202 | 193 | 232 | 240 |
| Coenzyme transport and metabolism | 136 | 140 | 148 | 115 | 147 | 173 | 158 | 162 | 171 | 157 |
| Lipid transport and metabolism | 213 | 210 | 273 | 216 | 265 | 289 | 311 | 318 | 313 | 386 |
| Translation, ribosomal structure, and biogenesis | 149 | 147 | 150 | 139 | 140 | 150 | 141 | 154 | 146 | 151 |
| Transcription | 227 | 235 | 281 | 221 | 273 | 356 | 306 | 325 | 356 | 429 |
| Replication, recombination, and repair | 233 | 257 | 176 | 176 | 188 | 262 | 248 | 192 | 152 | 328 |
| Cell wall/membrane/envelope biogenesis | 124 | 124 | 152 | 127 | 123 | 152 | 168 | 149 | 152 | 175 |
| Cell motility | 35 | 37 | 36 | 40 | 43 | 45 | 47 | 42 | 52 | 50 |
| Post-translational modification, protein turnover, chaperones | 103 | 108 | 106 | 115 | 115 | 126 | 142 | 136 | 132 | 167 |
| Inorganic ion transport and metabolism | 138 | 139 | 164 | 139 | 151 | 170 | 197 | 202 | 204 | 235 |
| Secondary metabolites biosynthesis, transport, and catabolism | 155 | 150 | 292 | 145 | 217 | 273 | 263 | 303 | 351 | 342 |
| Signal transduction mechanisms | 110 | 105 | 131 | 109 | 111 | 147 | 182 | 146 | 186 | 240 |
| Intracellular trafficking, secretion, and vesicular transport | 22 | 21 | 22 | 22 | 24 | 28 | 22 | 20 | 23 | 28 |
| Defense mechanisms | 59 | 59 | 62 | 61 | 64 | 67 | 77 | 59 | 62 | 87 |
Note.—Number of genes in each EGGNOG category.
Values below two standard deviation quantities as compared with the MGS–MKM outgroup.
Values below one standard deviation quantity, as compared with the MGS–MKM outgroup.
Mtub, M. tuberculosis H37Rv; Mcan, M. canettii STB-K; Mdec, M. decipiens; Mlac, M. lacus; Mriy, M. riyadhense; Mkan, M. kansasii; Mmar, M. marinum E11; Mszu, M. szulgai; Mgor, M. gordonae.
. 2.—Genomic evolution in MTBAP lineage. (A) Color coded table representing a heatmap for which the rows and columns were sorted by hierarchical clustering approaches. The row tree shows the clustering of MICFAM protein families based on species profile similarity calculated by the Ward agglomerative hierarchical clustering algorithm. The column tree represents the clustering of species based on MICFAM profile. Red: under-represented gene families. Green: over-represented gene families. Color-key for Row Z-scores is shown together with a histogram indicating the number of MICFAM protein families associated with each of the Z-scores. (B) Estimated gene gain and loss in the MTBAP lineage and the MGS–MKM outgroup. Variable genome parts of the MTBAP lineage and the MGS–MKM outgroup were computed using the MICFAM tool with a 50% amino-acid identity threshold and 80% alignment coverage. A table representing presence or absence of gene families for each species was then analyzed by Gain Loss Mapping Engine. Red: branches showing more gene loss than gene gain. (C) dN/dS distribution of core-genome orthologs as compared with the M. gordonae outgroup. Bold bars indicate the median dN/dS values for each species. Notch estimates correspond to 95% confidence intervals for median values. Box edges represent 25th and 75th percentiles. Whiskers represent estimated extreme values. Mcan, M. canettii STB-K; Mdec, M. decipiens; Mshi, M. shinjukuense; Mlac, M. lacus; Mriy, M. riyadhense; Mkan, M. kansasii; Mmar, M. marinum E11; Mszu, M. szulgai; Mgor, M. gordonae.
. 3.—Multivariate and clustering analyses of PFAM domain contributions to the MTBAP lineage. (A) PCA of species from the MTBAP lineage and the MGS–MKM outgroup, based on occurrence of PFAM domains in each genome (only PFAM domains present in more than two species were conserved). Ellipse: 95% confidence value ellipse of MTBAP lineage members. (B) PFAM domain contribution to MTBAP lineage. x-Axis: PFAM domain scores determined by BCA (MTBAP lineage vs. MGS–MKM outgroup). y-Axis: PFAM domain differences in average occurrence for each class (MTBAP lineage vs. MGS–MKM outgroup). Red: toxin-associated domains. Green: antitoxin-associated domains.
. 4.—Toxin–antitoxin content of mycobacterial species in the MTBAP lineage and other SGM. (A) Number of putative toxin genes bearing PIN and MazF domain in SGM. Toxins from toxin–antitoxin systems were identified from whole proteome data sets, using HMMER (PF02452.16—PemK_toxin; PF01850.20 PIN domain) with a 0.01 threshold e value. Orange: VapC. Blue MazF. **: Values above two standard deviation levels from SGM (outside MTBAP lineage and MGS–MKM outgroup) average. (B) Putative orthologs of M. tuberculosis VapC and MazF toxins. Yellow: BBH 50% translated sequence identity, BBH 80% query cover. Red: BBH and synteny. Mcan, M. canettii STB-K; Mdec, M. decipiens; Mshi, M. shinjukuense; Mlac, M. lacus; Mriy, M. riyadhense; Mkan, M. kansasii; Mmar, M. marinum E11; Mszu, M. szulgai; Mgor, M. gordonae.
Mycobacterium tuberculosis Virulence Genes Probably Acquired before MTB Speciation, and within MTBAP Lineage Evolution
| Label | Gene | Product | Function | Phenotype | Intracellular Survival | Animal Infection Model | Mcan | Mdec | Mshi | Mlac | Mriy | SGM |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Rv0064 | — | Membrane protein | — | — | DC ( | — | ++ | ++ | − | − | + | − |
| Rv0071 | — | — | — | Deleted in L2/Beijing lineage | M ( | — | − | + | − | − | − | − |
| Rv0240 |
| Toxin–antitoxin | Regulation | Induced under lysosomal stress conditions ( | — | Macaque ( | − | − | − | + | − | − |
| Rv0348 |
| Transcriptional regulator | Regulation | Hypoxia responsive regulator ( | — | Mouse ( | − | − | ++ | − | − | − |
| Rv0590 |
| Transporter | Transport (lipids) | Belongs to mce2 operon, involved in sulfolipid accumulation ( | — | Mouse ( | ++ | ++ | − | − | − | − |
| Rv0890c | — | Transcriptional regulator | Regulation | — | DC ( | — | ++ | ++ | − | − | − | − |
| Rv0893c | — | SAM methyltransferase | Secondary metabolism | — | M ( | — | ++ | ++ | − | − | − | − |
| Rv0895 | — | DAG-O-acyltransferase | Secondary metabolism (lipids) | — | M ( | — | ++ | ++ | − | − | + | − |
| Rv0977 |
| PE_PPE | Secreted and surface proteins | — | DC ( | — | ++ | − | ++ | − | ++ | − |
| Rv1288 | — | Putative mycolyltransferase II | Secondary metabolism (lipids) | — | DC ( | — | ++ | ++ | − | − | + | − |
| Rv1359 | — | Transcriptional regulator | Regulation | — | DC ( | — | ++ | + | − | − | − | − |
| Rv1442 |
| Biotin sulfoxide reductase | Electron transfer activity | Oxidative stress resistance (putative) | DC ( | — | ++ | ++ | ++ (p) | + | ++ (p) | − |
| Rv1552 |
| Fumarate reductase subunit | Electron transfer activity | Hypoxia adaptation ( | DC ( | — | ++ | ++ | ++ | ++ | ++ | − |
| Rv1739c |
| Transporter | Transport | Sulfate uptake ( | DC ( | — | ++ | − | ++ | + | − | − |
| Rv1981c |
| DNA-methylase | Regulation | Deleted in | — | Mouse ( | ++ | ++ | − | − | + | − |
| Rv2275 | — | Cyclopeptide synthase | Secondary metabolism (mycocyclosin) | — | — | Mouse ( | ++ | + | − | ++ | − | − |
| Rv2328 |
| PE_PPE | Secreted and surface proteins | — | M ( | — | ++ | + | − | − | − | − |
| Rv2547 |
| Toxin–antitoxin | Regulation | Induced within granulomas ( | M ( | — | ++ | − | ++ | − | ++ | − |
| Rv2548 |
| Toxin–antitoxin | Regulation | Induced within granulomas ( | M ( | — | ++ | − | ++ | − | ++ | − |
| Rv2549c |
| Toxin–antitoxin | Regulation | — | M ( | — | ++ | − | ++ | − | ++ | − |
| Rv2550c |
| Toxin–antitoxin | Regulation | — | M ( | — | ++ | − | ++ | − | ++ | − |
| Rv2735c | — | — | — | — | DC ( | — | − | − | ++ | − | + | − |
| Rv2954c | — | Fucosyltransferase | Secondary metabolism (phenolglycolipid) | — | M ( | — | ++ | ++ | − | − | − | − |
| Rv2955c | — | Fucosyltransferase | Secondary metabolism (phenolglycolipid) | — | M ( | — | ++ | ++ | − | − | − | − |
| Rv3082c |
| Transcriptional regulator | Regulation | Activation of Myma operon upon acidic pH and within phagosome ( | M ( | Guinea pig ( | ++ | ++ | − | − | − | − |
| Rv3087 | — | DAG-O-acyltransferase | Secondary metabolism (lipids) | Belongs to Myma operon, involved in mycolic acid content ( | M ( | Mouse ( | ++ | ++ | − | − | − | − |
| Rv3179 | — | — | — | — | DC ( | — | ++ | − | − | + | + (p) | − |
| Rv3320c |
| Toxin–antitoxin | Regulation | Induced in nutrient starvation conditions ( | — | Mouse deltaMHC ( | ++ | − | − | + | + | − |
| Rv3343c |
| PE_PPE | Secreted and surface proteins | Oxidative stress resistance ( | DC ( | — | ++ | − | + | − | ++ | − |
| Rv3345c |
| PE_PPE | Secreted and surface proteins | — | DC ( | — | − | − | + | − | ++ | − |
| Rv3376 | — | Phosphatase | Secondary metabolism (diterpene 1TbAd) | Phagolysosome maturation arrest ( | M ( | — | ++ | + | − | + | − | − |
| Rv3377c | — | Diterpene synthase | Secondary metabolism (diterpene 1TbAd) | Phagolysosome maturation arrest ( | DC ( | — | ++ | ++ | − | ++ | − | − |
| Rv3476c |
| Transporter | Transport | — | DC ( | — | ++ | ++ | − | − | − | − |
| Rv3487c |
| Lipase-esterase | Secondary metabolism (lipids) | Upregulated under acidic pH conditions ( | — | Mouse ( | ++ | ++ | − | − | − | − |
| Rv3826 |
| Acyl-CoA synthetase | Secondary metabolism (sulfolipids) | Lower binding affinity for macrophages ( | DC ( | — | ++ | + | − | − | − | − |
Notes.—List of experimentally confirmed M. tuberculosis virulence genes that have at least one ortholog among other species of the MTBAP lineage (M. canettii is not included in the analysis) and no ortholog in any SGM depicted in the representative SGM database.
Presence of a putative ortholog by BBH-analysis.
Presence of a putative ortholog by BBH-analysis and synteny confirmation, -Absence of putative ortholog.
(p)Putative pseudogene.
Mtub, M. tuberculosis H37Rv; Mcan, M. canettii STBK; Mdec, M. decipiens; Mlac, M. lacus; Mriy, M. riyadhense.
. 5.—Representation of selected genomic islands. The genomic regions depict the fumarate reductase locus (Rv1552-1555), the Myma locus (Rv3082c-3087), or the sulfolipid synthesis locus (Rv3820c-3826), and the surrounding genomic regions in M. tuberculosis, M. decipiens, and M. kansasii. Pink links: homologous genomic regions. Filled arrows: genes in genomic islands.
. 6.—Hypothetical evolutionary scenario of the members of the MTBAP lineage. In this schematic representation, the likely evolutionary breakpoint is marked, beyond which the concerned members depict shared host-adapted traits.