| Literature DB >> 34665012 |
Eric C Keen1,2, JooHee Choi1, Meghan A Wallace2, Michelle Azar3, Carlos R Mejia-Chew4, Shail B Mehta4, Thomas C Bailey4, Lindsay J Caverly3, Carey-Ann D Burnham2,4,5,6, Gautam Dantas1,2,6,7.
Abstract
Nontuberculous mycobacteria, including those in the Mycobacterium avium complex (MAC), constitute an increasingly urgent threat to global public health. Ubiquitous in soil and water worldwide, MAC members cause a diverse array of infections in humans and animals that are often multidrug resistant, intractable, and deadly. MAC lung disease is of particular concern and is now more prevalent than tuberculosis in many countries, including the United States. Although the clinical importance of these microorganisms continues to expand, our understanding of their genomic diversity is limited, hampering basic and translational studies alike. Here, we leveraged a unique collection of genomes to characterize MAC population structure, gene content, and within-host strain dynamics in unprecedented detail. We found that different MAC species encode distinct suites of biomedically relevant genes, including antibiotic resistance genes and virulence factors, which may influence their distinct clinical manifestations. We observed that M. avium isolates from different sources-human pulmonary infections, human disseminated infections, animals, and natural environments-are readily distinguished by their core and accessory genomes, by their patterns of horizontal gene transfer, and by numerous specific genes, including virulence factors. We identified highly similar MAC strains from distinct patients within and across two geographically distinct clinical cohorts, providing important insights into the reservoirs which seed community acquisition. We also discovered a novel MAC genomospecies in one of these cohorts. Collectively, our results provide key genomic context for these emerging pathogens and will facilitate future exploration of MAC ecology, evolution, and pathogenesis. IMPORTANCE Members of the Mycobacterium avium complex (MAC), a group of mycobacteria encompassing M. avium and its closest relatives, are omnipresent in natural environments and emerging pathogens of humans and animals. MAC infections are difficult to treat, sometimes fatal, and increasingly common. Here, we used comparative genomics to illuminate key aspects of MAC biology. We found that different MAC species and M. avium isolates from different sources encode distinct suites of clinically relevant genes, including those for virulence and antibiotic resistance. We identified highly similar MAC strains in patients from different states and decades, suggesting community acquisition from dispersed and stable reservoirs, and we discovered a novel MAC species. Our work provides valuable insight into the genomic features underlying these versatile pathogens.Entities:
Keywords: Mycobacterium; Mycobacterium avium complex; comparative genomics; genomic epidemiology; nontuberculous mycobacteria; whole-genome sequencing
Year: 2021 PMID: 34665012 PMCID: PMC8525567 DOI: 10.1128/mSystems.01194-21
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1M. avium core genomes cluster by source of isolation. (A) Pairwise average nucleotide identity (ANI) matrix of all 170 MAC isolates in this study. MA, M. avium; MAR, M. arosiense; MCO, M. colombiense; MI, M. intracellulare; ML, M. lepraemurium; MM, M. marseillense; MV, M. vulneris; TS, novel isolates from WUMAC and FLAC cohorts described in this study. (B) ANI matrix of all 96 available mycobacterial representative/reference strains and 2 WUMAC isolates with <95% ANI to any other isolate in panel A. MAC, M. avium complex; NTM, all other nontuberculous mycobacteria; MTB, M. tuberculosis; ML, M. leprae, NOV, WUMAC isolates which represent a putative novel MAC genomospecies. An asterisk denotes M. mantenii. (C) Core genome phylogeny of 109 M. avium isolates as determined by Roary and RAxML. The scale bar represents the number of substitutions per site. (D) Core genome phylogenetic network of 109 M. avium isolates as determined by SplitsTree4. Branch lengths represent uncorrected P values. Numbers represent lineages as shown in panel C. Overlapping nodes were not annotated with the isolate source.
FIG 4Similar MAC strains in distinct patients and cohorts. (A) Core genome phylogeny of all WUMAC and FLAC M. intracellulare isolates (n = 16) and all other M. intracellulare isolates (n = 28). Unpaired isolates are from patients for whom only a single isolate was available. The scale bar represents the number of substitutions per site. (B) Core genome phylogeny of all WUMAC and FLAC M. avium isolates (n = 25) and all other M. avium human pulmonary isolates (n = 31). Unpaired isolates are from patients for whom only a single isolate was available. The scale bar represents the number of substitutions per site. (C) Pairwise single-nucleotide polymorphism (SNP) distances for all intrapatient isolate pairs from WUMAC/FLAC M. avium (n = 5) and M. intracellulare (n = 6) patients. The sampling interval represents the number of days between isolate collections. (D) All intrapatient and interpatient pairwise core genome SNP distances that were <1,000 for all possible WUMAC/FLAC M. avium and M. intracellulare isolate pairs. Pairwise SNP distances of >1,000 are not shown. The sampling interval represents the number of days between isolate collections.
FIG 2M. avium pangenomes cluster by source of isolation. (A) Pangenome ordination of 109 M. avium isolates by t-distributed stochastic neighbor embedding (t-SNE). (B) Between- and within-group Jaccard dissimilarity of M. avium isolate sources, comprising 109 total isolates. ****, P < 0.0001; **, P < 0.01 (Kruskal-Wallis test with the Benjamini-Hochberg correction). (C) Genes differentially abundant in pulmonary (n = 56; green) and nonpulmonary (n = 53; red) M. avium isolates as identified by Scoary. All genes with substantial (log2 fold change > 3) and significant (P < 0.05; Fisher’s exact test with the Benjamini-Hochberg correction) enrichment are shown. (D) COG profiles of enriched genes in panel C. Categories of general or unidentified hits or those with no hits are not shown. The reference proteome represents the COG distribution of proteins in the representative M. avium strain OCU464. ***, P < 0.001 (Fisher’s exact test with the Benjamini-Hochberg correction).
FIG 3Foreign DNA signatures vary by MAC species and M. avium isolate source. (A) Total numbers of kilobases of putative foreign DNA in MAC genomes as identified by IslandPath-DIMOB. MA-HP, human pulmonary M. avium isolates; MA-HD, human disseminated M. avium isolates; MA-E, environmental M. avium isolates; MA-A, animal M. avium isolates; MI, M. intracellulare; MCO, M. colombiense. ****, P < 0.0001; **, P < 0.01; *, P < 0.05 (one-way ANOVA with Tukey’s post hoc correction). (B) Total numbers of copies of mobility elements within foreign DNA in M. avium, binned by isolate source. ns, not significant (one-way ANOVA with Tukey’s post hoc correction for each mobility element). (C) Top 10 most abundant genes within each mobility element in foreign DNA in M. avium. Mobility and hypothetical proteins are not shown. (D) Top 10 most abundant genes within foreign DNA for each M. avium isolate source. Mobility and hypothetical proteins are not shown.