| Literature DB >> 28943727 |
David Tarazona1, Luis Jaramillo1, Victor Borda1, Kelly Levano1, Marco Galarza1, Heinner Guio1.
Abstract
Mycobacterium tuberculosis (MTB), the causative agent of tuberculosis (TB), has a vast diversity of genotypes including Beijing, CAS, EAI, Haarlem, LAM, X, Ural, T, AFRI1 and AFRI2. However, genotyping can be expensive, time consuming and in some cases, results may vary depending on methodology used. Here, we proposed a new set of 10 SNPs using a total of 249 MTB genomes, and selected by first the inclusion/ exclusion (IE) criteria using spoligotyping and phylogenies, followed by the selection of the nonsynonymous SNPs present in the most conserved cluster of orthologous groups (COG) of each genotype of MTB. Genotype assignment of the new set of 10 SNPs was validated using an additional of 34 MTB genomes and results showed 100% correlation with their known genotypes. Our set of 10 SNPs have not been previously reported and cover the MTB genotypes that are prevalent worldwide. This set of SNPs could be used for molecular epidemiology with drug resistant markers.Entities:
Keywords: Genomic signature; Genotyping; Mycobacterium tuberculosis
Year: 2017 PMID: 28943727 PMCID: PMC5602289 DOI: 10.6026/97320630013224
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Phylogenetic tree under Maximum parsimony of 249 strains of M. tuberculosis.
Figure 2Diagram of signature SNPs for MTB genotypes: Beijing, CAS, EAI, Haarlem, LAM, T, X, Ural, AFRI1 and AFRI2 after IE criteria.
Figure 3The relative distribution of SNPs of Mycobacterium tuberculosis in protein belonged to certain Clusters of Orthologous Groups (COGs). Function unknown (S), General function prediction only (R), Lipid transport and metabolism (I), Secondary metabolites biosynthesis, transport, and catabolism (Q), Amino acid transport and metabolism (E), Energy production and conversion (C), Cell wall/membrane/envelope biogenesis (M), Replication, recombination and repair (L), Cell motility (N), Carbohydrate transport and metabolism (G), Coenzyme transport and metabolism (H), Inorganic ion transport and metabolism (P), Transcription (K), Translation, ribosomal structure and biogenesis (J), Post-translational modification, protein turnover, and chaperones (O), Signal transduction mechanisms (T), Nucleotide transport and metabolism (F), Cell cycle control, cell division, chromosome partitioning (D), Defense mechanisms (V), Intracellular trafficking, secretion, and vesicular transport (U), RNA processing and modification (A).
Stepwise SNP set selection: IE, differential genotype ; *, the loci belonging to other strains from different genotypes and uncommon between genotypes were eliminated. **, SNP set based in COG group (AàK); *** SNP set based in less variable COG group genotype.
| STEPS | Genotype | Beijing | CAS | EAI | Haarlem | LAM | X | Ural | AFRI1 | AFRI2 | T | Total | |
| Gagneux [ | L2 | L3 | L1 L3 | L4 | L4 | L4 | L4 | L5 | L6 | L4 | |||
| MTB genome sequences | 52 | 6 | 23 | 19 | 78 | 9 | 1 | 6 | 5 | 50 | 249 | ||
| 1 | Integration of 7649 SNPs | Total SNPs | 458 | 854 | 674 | 624 | 424 | 412 | 886 | 1735 | 1582 | 7649 | |
| 2 | IE criteria selection | SNPs-IE | 69 | 310 | 220 | 33 | 93 | 56 | 396 | 737 | 621 | 2535 | |
| SNPs - IE* | 20 | 276 | 208 | 15 | 37 | 16 | 372 | 580 | 599 | 2123 | |||
| 3 | Nonsynonymous SNPs selection present in most conserved COG | SNP set based COG group** | 3 | 29 | 22 | 1 | 3 | 1 | 36 | 68 | 79 | 243 | |
| COG in MTB most conservative | F | U | U | F | D | K | A | A | A | - | |||
| Proposed SNP set *** | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 10 | |||
Set of 10 SNPs proposed to genotype MTB.
| Genotype Differential | COG | Gene (locus tag) | SNP proposed | |
| Principal SNPs set | Beijing | Nucleotide transport and metabolism | GuaB2 (Rv3411c) | 3830349 (Ala391Thr: GCG ->ACG) |
| CAS | Intracellular trafficking, secretion, and vesicular transport | (Rv3921c) | 4409954 (Ala39Gly:GCC-> GGC) | |
| EAI | Intracellular trafficking, secretion, and vesicular transport | SecE1 (Rv0638) | 734116 (Met127Thr: ATG->ACG) | |
| Haarlem | Nucleotide transport and metabolism | Hpt (Rv3625c) | 4063682 (Leu61Met: CTG->ATG) | |
| LAM | Cell cycle control, cell division, chromosome partitioning | Smc (Rv2922c) | 3236230 (Arg526Leu:CGT->CTT) | |
| X | Transcription | (Rv2618) | 2946570 (Gly63Asp:GGC->GAC) | |
| (Rv3625c) | 4063682 (Leu61Met:CTG->ATG) | |||
| Ural | RNA processing and modification | (Rv1097c) | 1225462 (Asp228Gly:GAC->GGC) | |
| AFRI1 | RNA processing and modification | (Rv3439c) | 3858894 (Leu266Phe:CTT->TTT) | |
| AFRI2 | RNA processing and modification | (Rv1097) | 1226021 (Gln42Lys:CAG->AAG) | |
| (Rv3689) | 4130604 (Ser42Asn:AGC->AAC) | |||
| Alternative SNPs set | Beijing | Post-translational modification, protein turnover, and chaperones | (Rv1463) | 1651308 (Glu198Gly: GAA ->GGA) |
| CAS | Defense mechanisms | IrtA (Rv1348) | 1513189 (Ala48Val: GCT-> GTT) | |
| EAI | Defense mechanisms | (Rv1730) | 1956930 (Thr39Pro: ACT->CCT) | |
| Haarlem | Transcription | RpoC (Rv0668) | 765150 (Gly594Glu: GGG -> GAG) | |
| LAM | Signal transduction mechanisms | CstA (Rv3063) | 3429202 (Tyr654Asp: TAC -> GAC) | |
| X | Carbohydrate transport and metabolism | (Rv2994) | 3352244 (Thr312Ala: ACC->GCC) | |
| Ural | Intracellular trafficking, secretion, and vesicular transport | FtsY (Rv2921) | 3233940 (Ala67Gly: GCC->GGC) | |
| AFRI1 | Intracellular trafficking, secretion, and vesicular transport | (Rv1887) | 2136642 (Leu129Phe:CTT->TTT) | |
| AFRI2 | Defense mechanisms | IrtA (Rv1348) | 1515003 (Ala653Thr: GCC>ACC) | |
Genotype assignment of 34 MTB genomes using new proposed set of 10 SNPs.
| Strain | Accesion Number | Ref. Lineage | Genotype assigned by our new set of 10 SNPs | |
| 1 | 13-2459 | LDNL00000000 | Beijing | Beijing |
| 2 | 5351 | JXXH01000000 | Beijing | Beijing |
| 3 | 96075 | CP009426 | Beijing | Beijing |
| 4 | B9741 | LVJJ01000000 | Beijing | Beijing |
| 5 | BEIJING-L 323 | CP010873 | Beijing | Beijing |
| 6 | BeijingDS 6701 | JOKR01000001 | Beijing | Beijing |
| 7 | E186hv | JXAW00000000 | Beijing | Beijing |
| 8 | KT-0133 | JUFG00000000 | Beijing | Beijing |
| 9 | MTBR209 | LATO00000000 | Beijing | Beijing |
| 10 | W06 | LHCK00000000 | Beijing | Beijing |
| 11 | ZT272 | LGTJ00000000 | Beijing | Beijing |
| 12 | TBR-103XDR | JRJT01000001 | Beijing | Beijing |
| 13 | tahitMT11 | CVMX01000001 | Haarlem | Haarlem |
| 14 | TBR-102 | JRJS00000000 | Haarlem | Haarlem |
| 15 | TKK_03_0101 | GCF_000651975.1 | Haarlem | Haarlem |
| 16 | TKK_03_0103 | GCF_000651995.1 | Haarlem | Haarlem |
| 17 | TBR-152 | JRJQ00000000 | LAM | LAM |
| 18 | TKK_04_0029 | GCF_000673435.1 | LAM3 | LAM |
| 19 | TKK_04_0038 | GCF_000673275.1 | LAM4 | LAM |
| 20 | TKK_04_0039 | GCF_000673295.1 | LAM4 | LAM |
| 21 | TKK_04_0043 | GCF_000673075.1 | LAM4 | LAM |
| 22 | TKK_04_0044 | GCF_000673335.1 | LAM3 | LAM |
| 23 | TBR-175 | JRJR00000000.1 | LAM | LAM |
| 24 | TKK_04_0120 | GCF_000654175.1 | EAI | EAI |
| 25 | TKK-01-0028 | GCF_000660665.1 | X | X |
| 26 | TKK_02_0027 | GCF_000672095.1 | X | X |
| 27 | TKK_03_0063 | GCF_000651695.1 | X | X |
| 28 | TKK_03_0099 | GCF_000651935.1 | X | X |
| 29 | TKK_03_0150 | GCF_000652255.1 | X | X |
| 30 | TKK_05SA_0021 | GCF_000653515.1 | X | X |
| 31 | TKK_03_0037 | GCF_000651475.1 | CAS | CAS |
| 32 | TKK_04_0139 | GCF_000656875.1 | CAS | CAS |
| 33 | TKK_04_0148 | GCF_000656935.1 | CAS | CAS |
| 34 | TKK_05SA_0050 | GCF_000653755.1 | CAS | CAS |