| Literature DB >> 33941780 |
Tao Luo1,2, Peng Xu3,4, Yangyi Zhang5, Jessica L Porter6,7, Marwan Ghanem8, Qingyun Liu3, Yuan Jiang5, Jing Li5, Qing Miao9, Bijie Hu9, Benjamin P Howden6,7,10, Janet A M Fyfe11, Maria Globan11, Wencong He12, Ping He12, Yiting Wang12, Houming Liu13, Howard E Takiff14,15,16, Yanlin Zhao17, Xinchun Chen18, Qichao Pan19, Marcel A Behr20, Timothy P Stinear21,22, Qian Gao23.
Abstract
Mycobacterium kansasii can cause serious pulmonary disease. It belongs to a group of closely-related species of non-tuberculous mycobacteria known as the M. kansasii complex (MKC). Here, we report a population genomics analysis of 358 MKC isolates from worldwide water and clinical sources. We find that recombination, likely mediated by distributive conjugative transfer, has contributed to speciation and on-going diversification of the MKC. Our analyses support municipal water as a main source of MKC infections. Furthermore, nearly 80% of the MKC infections are due to closely-related M. kansasii strains, forming a main cluster that apparently originated in the 1900s and subsequently expanded globally. Bioinformatic analyses indicate that several genes involved in metabolism (e.g., maintenance of the methylcitrate cycle), ESX-I secretion, metal ion homeostasis and cell surface remodelling may have contributed to M. kansasii's success and its ongoing adaptation to the human host.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33941780 PMCID: PMC8093194 DOI: 10.1038/s41467-021-22760-6
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Global diversity of the M. kansasii complex.
a Geographical distribution of the 358 isolates in the study. The gradient blue colors indicate the prevalence of M. kansasii among NTM disease. b Core genome-based maximum-likelihood phylogeny of the 358 isolates. The colors of the terminal nodes correspond to the geographical origin of individual isolates, as denoted by the circles in (a). c Pairwise genomic average nucleotide identity (gANI) within and between the M. kansasii complex species. Source data are provided as a Source Data file.
Fig. 2Genomic recombination and its contribution in speciation and diversification of the M. kansasii complex.
a Phylogenetic network of the M. kansasii complex based on the core genome alignment of 358 isolates. b Population structure and genomic recombinations inferred by fastGEAR. Each line represents the genomic constitution (exhibited as color strips) of individual isolates according to ancestral (upper panel) or recent (lower panel) recombinations. Strip colors represent the different species as in panel (a). White strips (lower section) represent recent recombinations from unknown sources. Source data are provided as a Source Data file. c Length distribution of recombinant fragments in the four major species. Boxes show the median and interquartile range (IQR) while whiskers extend to a maximum of 1.5× IQR. d Maximum likelihood phylogeny of the four major species based on non-recombinant SNVs. Brackets indicate clusters containing isolates with an average pair-wise genomic difference of fewer than 100 SNVs. The colors of terminal nodes indicate the geographical origins of the isolates, corresponding to Fig. 1a. Filled circles indicate a human source; empty circles, an environmental source. MKMC M. kansasii main cluster.
Fig. 3Phylogenomic analyses of the M. kansasii main cluster (MKMC).
a The maximum-likelihood phylogeny of the MKMC based on non-recombinant mutations. The colors of terminal branches indicate the geographical origins of the isolates, as in Fig. 1a. Isolation from non-human or unknown sources is indicated by triangles in the terminal nodes. b Genomic pattern and proportion of recent recombinations for individual isolates. Donor species are colored as in Fig. 2a. Source data are provided as a Source Data file. c Median-joining network for the Australia outbreak strain cluster. Node size indicates the number of isolates; node color indicates the source and year of isolation.
Fig. 4Genomic loci specific to M. kansasii.
a Synteny map for the genomic region flanking the genes of methylcitrate cycle (MCC) in M. kansasii. Full-length and truncated genes are represented by arrows and rectangles respectively. The full MCC genes in M. kansasii and their orthologous sequences in the other MKC species are indicated with different colors. The flanking genes are in gray or white to represent homologous or orphan genes, respectively. b A scheme of the MCC of mycobacteria and its relation to the beta-oxidation, tricarboxylic acid (TCA) and glyoxylate cycles. 2MC 2-methylcitrate, 2MI 2-methylisocitrate, OAA oxaloacetate. c Synteny map of the three ESP (ESX-1 secretory protein) loci specific to M. kansasii. Red arrows represent espE-like genes and the numbers indicate their MKAN_RS identifiers. d Sequence similarity between the EspE of M. tuberculosis and the EspE-like proteins of M. kansasii. Residues are colored to indicate similarities.
Fig. 5Genes under positive selections in the MKMC.
a Circular plot of genes under potential positive selection. Innermost inward bars, number of recombination per gene; innermost outward bars, number of mutations per gene (red bars, n ≥ 4); outer red lines, location of highly polymorphic genes; outer blue line, location of genes with convergent mutations. b Schematic diagrams depicting the distribution and frequency of non-synonymous mutations in zur and tetR1/2, and the recombinations around tetR1/2. Hom. homozygous mutation, Het. heterozygous mutation, Rec. recombination (strip colors represent donor subspecies corresponding to Fig. 2a). Source data are provided as a Source Data file. c Schematic representation of mutations in LOS biosynthesis genes and corresponding morphology of the mutant strains. Genes were colored according to their functions. Green, genes involved in polyketide synthesis; orange, acyltransferase; cyan, glucosyltransferase. fs frameshift. Source data are provided as a Source Data file.