| Literature DB >> 28338924 |
Natalya E Mikheecheva1,2, Marina V Zaychikova1, Alexander V Melerzanov2, Valery N Danilenko1,3.
Abstract
Mycobacterium tuberculosis is divided into several distinct lineages, and various genetic markers such as IS-elements, VNTR, and SNPs are used for lineage identification. We propose an M. tuberculosis classification approach based on functional polymorphisms in virulence genes. An M. tuberculosis virulence genes catalog has been established, including 319 genes from various protein groups, such as proteases, cell wall proteins, fatty acid and lipid metabolism proteins, sigma factors, toxin-antitoxin systems. Another catalog of 1,573 M. tuberculosis isolates of different lineages has been developed. The developed SNP-calling program has identified 3,563 nonsynonymous SNPs. The constructed SNP-based phylogeny reflected the evolutionary relationship between lineages and detected new sublineages. SNP analysis of sublineage F15/LAM4/KZN revealed four lineage-specific mutations in cyp125, mce3B, vapC25, and vapB34. The Ural lineage has been divided into two geographical clusters based on different SNPs in virulence genes. A new sublineage, B0/N-90, was detected inside the Beijing-B0/W-148 by SNPs in irtB, mce3F and vapC46. We have found 27 members of B0/N-90 among the 227 available genomes of the Beijing-B0/W-148 sublineage. Whole-genome sequencing of strain B9741, isolated from an HIV-positive patient, was demonstrated to belong to the new B0/N-90 group. A primer set for PCR detection of B0/N-90 lineage-specific mutations has been developed. The prospective use of mce3 mutant genes as genetically engineered vaccine is discussed.Entities:
Keywords: Mycobacterium tuberculosis; lineage-specific SNP; phylogenetic analysis; virulence genes
Mesh:
Substances:
Year: 2017 PMID: 28338924 PMCID: PMC5381574 DOI: 10.1093/gbe/evx053
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Distribution of 319 Selected Virulence Genes between Functional Groups of Protein Gene Products
| Group | Number of Genes | Main Functions |
|---|---|---|
| Cell wall proteins | 48 | Transport of virulence factors, adhesion, interaction with host immune system |
| Lipoproteins | 7 | Interaction with macrophages, adhesion, signal transduction |
| Lipid and fatty acid metabolism | 17 | Suppression of immune response, interaction with host cells |
| Mycolic acid synthesis | 12 | Interaction with macrophages |
| Cholesterol catabolism | 6 | Using cholesterol as an energy source, synthesis of virulence-associated PDIM lipids |
| Type VII secretion system | 17 | Export of signal molecules and inhibitors of antimicrobial response, induction of bacteria’s exit from macrophages |
| Proteins inhibiting antimicrobial responses of the macrophage | 15 | Survival in macrophages |
| Sigma factors | 8 | Transcriptional regulation of virulence factors |
| Two-component systems | 12 | Signal transduction, participation in adaptation to hostile environment |
| Serine–threonine proteinkinases | 3 | Apoptose inhibition, inhibition of phagosome–lysosome fusion, reduction of phagocytosis |
| TA systems | 142 | Survival under hostile conditions through switching to the latent state |
| Proteases | 7 | Regulation of virulence factors, host cells degradation, protection from immune response |
| Metal transporter proteins | 7 | Survival in hostile conditions, export of toxic ions—antimicrobial factors |
| Other | 18 | – |
Studied Mycobacterium tuberculosis Lineages and Sublineages
| Lineage and Sublineage | Number of Genomes | Main Countries of Isolation |
|---|---|---|
| Beijing | 557 | |
| Beijing-modern | 295 | South Africa, India, Sweden, China |
| Beijing-B0/W-148 | 87(+140) | Belarus, Iran, Sweden, (Russia) |
| Beijing-ancestral | 175 | South Africa, South Korea |
| Cameroon | 6 | Mali |
| Delhi/CAS | 102 | India, Belarus, Uganda |
| EAI | 197 | |
| EAI-Manila | 18 | India, Taiwan |
| Other | 179 | India |
| X | 165 | |
| Ghana | 37 | Mali |
| Haarlem | 49 | South Africa, Romania, Belarus |
| Other | 79 | South Africa, Mali |
| LAM: | 337 | |
| LAM1 | 31 | Panama |
| LAM2 | 9 | Panama |
| F15/LAM4/KZN | 150 | South Africa |
| Other | 147 | South Africa, Belarus |
| S | 109 | South Africa, Uganda |
| SMI-049 | 5 | Sweden |
| T-H37Rv-like | 12 | – |
| Ural | 83 | Moldova, Sweden, Belarus |
| Total | 1,573 |
Note.—Beijing lineage is divided into two sublineages: Beijing-modern and Beijing-ancestral. Sublineage Beijing-modern includes Beijing-B0/W-148 sublineage. EAI lineage includes EAI-Manila sublineage. X lineage includes sublineages Ghana and Haarlem. LAM lineage includes sublineages LAM1, LAM2 and F15/LAM4/KZN.
Additionally analyzed genomes from GMTV database.
FGlobal phylogeny of the main Mycobacterium tuberculosis lineages. The tree has two main branches: EAI (so-called Indo-oceanic lineage 1, one of the most ancient MT strains) and the branch with modern lineages — Delhi/CAS (East-African–Indian lineage 3), Beijing (East-Asian lineage 2), and others (Euro-American lineage 4). The tree was rooted on M. canettii. Bootstrap values >50% are shown.
F(a) The numbers of synonymous (blue) and nonsynonymous (orange) SNPs per gene in TA and non-TA genes are statistically different (t-test, P < 0.001). Standard errors for every group are shown. (b) Distribution of nsSNPs in virulence genes by the number of genomes in which mutation is located. (c) Comparison of nsSNPs in virulence genes (red squares) and in the whole genome (blue circles). Values of nsSNPs were divided by maximum to make them vary from 0 to 1 and mean values for every lineage were calculated. Standard deviations for every group are shown.
A Set of Lineage-Specific Nonsynonymous SNPs in Virulence Genes Sufficient for the Resolution of the 18 Studied Mycobacterium tuberculosis Groups
| Lineage or Sublineage | Gene | Groups | SNP | |
|---|---|---|---|---|
| Beijing | TA systems | G122T G-V | 1 | |
| Beijing-modern | TA systems | A46G T-A | 1 | |
| Beijing-B0/W-148 | Cell wall proteins | T145G S-A | 1 | |
| Cameroon | Mycolic acid synthesis | C691T H-Y | 1 | |
| Delhi/CAS | Metal transporter proteins | C143T A-V | 0.99 | |
| EAI | Metal transporter proteins | G964T V-F | 0.99 | |
| EAI-Manila | Metal transporter proteins | G2137A D-N | 1 | |
| X | Lipid and fatty acid metabolism | A3688G T-A | 0.9 | |
| Ghana | Cell wall proteins | G338C G-A | 1 | |
| Haarlem | Metal transporter proteins | G545A R-H | 1 | |
| LAM | Cell wall proteins | C992G P-R | 1 | |
| LAM1 | Type VII secretion system | C32T T-I | 1 | |
| LAM2 | Mycolic acid synthesis | G166A E-K | 1 | |
| F15/LAM4/KZN | Cell wall proteins | T44C F-S | 1 | |
| S | TA systems | G175A G-S | 1 | |
| SMI-049 | Lipoproteins | C877G P-A | 1 | |
| T-H37Rv-like | Cell wall proteins | T1109T | 1 | |
| Ural | Cell wall proteins | C1000G Q-E | 1 |
Note.—For every SNP, its position, nucleotide and acid change are given. Lineage T-H37Rv-like is defined by the absence of mutation in 1,109 nucleotide in mce1F. P, Pearson correlation coefficient.
FPhylogenetic trees of the Mycobacterium tuberculosis sublineges. (a) Phylogenetic tree of the LAM lineage. The tree is constructed for 337 isolates of the LAM lineage. Three sublineages are separated in different branches with high bootstrap support. (b) Phylogenetic tree of the Beijing-B0/W-148 sublineage. The tree is constructed for 87 isolates of the Beijing-B0/W-148 sublineage. Note two Belarus clusters and B0/N-90 sublineage. (c) Phylogenetic tree of the Ural lineage. The three is constructed for 83 isolates of Ural sublineage. All isolates are divided into two geographical clusters. The trees were rooted on H37Rv reference genome. Bootstrap values >50% are shown.
Lineage-Specific Mutations in Virulence Genes of the F15/LAM4/KZN and B0/N-90 Sublineages
| Gene | Group | Function of Product Protein | SNP | Effect of SNP |
|---|---|---|---|---|
| F15/LAM4/KZN | ||||
| | Cell wall proteins | Adhesion or invasion | T44C F-S | A (0/3) |
| | Cholesterol catabolism | Degradation of cholesterol, participation in synthesis of virulence-associated lipids PDIM | T1076C I-T | A (0/3) |
| | TA systems | Toxin, RNase, participation in developing of dormant state | A221C N-T | N (3/0) |
| | Antitoxin, participation in developing of dormant state | C140A A-E | D (2/1) | |
| B0/N-90 | ||||
| | Cell wall proteins | Adhesion or invasion | A1229C D-A | D (2/1) |
| | TA systems | Toxin, participation in pathogenesis and developing of dormant state | C113G A-G | D (2/1) |
| | Metal transporter proteins | ABC transporter, import of iron | G523A A-T | N (3/0) |
Note.—Effect of SNP was defined using three algorithms from online-tool PredictSNP. Numbers shown in parentheses are algorithms predicting neutral effect of mutations versus algorithms predicting deleterious effect. Note that mutation in mce3B and cyp125 are likely to affect protein function. All provided mutations have Pearson coefficient >0.95. A, affected; D, doubtful; N, neutral.
FDistribution of SNPs between B0/N-90 isolates. Comparative diagram of SNPs in five isolates of the B0/N-90 sublineage. Both common and individual mutations are shown. The distribution of SNPs shows close evolutionary relationships between Russian isolates, whereas the Swedish isolate shows no common mutations except core ones.
Nonsynonymous SNPs in mce1–4 operons
| Operon | Number of nsSNPs | |||
|---|---|---|---|---|
| Total | Lineage-Specific | Specific to Small Groups | ||
| Total | Affected | |||
| 48 | 11 (22%) | 37 | 10 (27%) | |
| 35 | 5 (14%) | 30 | 15 (50%) | |
| 32 | 8 (25%) | 24 | 12 (50%) | |
| 26 | 2 (8%) | 24 | 15 (63%) | |
Note.—A total number of nsSNPs in mce genes was counted. All identified mutations were divided into two groups: specific to an established lineage and located in small groups of at least two isolates. Number of affected mutations was counted using PredictSNP. Note that mce4 operon has the biggest (63%) and mce1—the smallest (27%) percent of small group mutations possibly affecting protein function. Full version of the table with descriptions of SNPs and groups is listed in supplementary material S2, Supplementary Material online.