| Literature DB >> 30072734 |
Prasit Palittapongarnpim1,2, Pravech Ajawatanawong3, Wasna Viratyosin4, Nat Smittipat4, Areeya Disratthakit5, Surakameth Mahasirimongkol5, Hideki Yanai6,7, Norio Yamada8, Supalert Nedsuwan9, Worarat Imasanguan9, Pacharee Kantipong9, Boonchai Chaiyasirinroje6, Jiraporn Wongyai6, Licht Toyo-Oka10, Jody Phelan11, Julian Parkhill12, Taane G Clark11, Martin L Hibberd11, Wuthiwat Ruengchai3, Panawun Palittapongarnpim3, Tada Juthayothin4, Sissades Tongsima4, Katsushi Tokunaga10.
Abstract
Tuberculosis presents a global health challenge. Mycobacterium tuberculosis is divided into several lineages, each with a different geographical distribution. M. tuberculosis lineage 1 (L1) is common in the high-burden areas in East Africa and Southeast Asia. Although the founder effect contributes significantly to the phylogeographic profile, co-evolution between the host and M. tuberculosis may also play a role. Here, we reported the genomic analysis of 480 L1 isolates from patients in northern Thailand. The studied bacterial population was genetically diverse, allowing the identification of a total of 18 sublineages distributed into three major clades. The majority of isolates belonged to L1.1 followed by L1.2.1 and L1.2.2. Comparison of the single nucleotide variant (SNV) phylogenetic tree and the clades defined by spoligotyping revealed some monophyletic clades representing EAI2_MNL, EAI2_NTM and EAI6_BGD1 spoligotypes. Our work demonstrates that ambiguity in spoligotype assignment could be partially resolved if the entire DR region is investigated. Using the information to map L1 diversity across Southeast Asia highlighted differences in the dominant strain-types in each individual country, despite extensive interactions between populations over time. This finding supported the hypothesis that there is co-evolution between the bacteria and the host, and have implications for tuberculosis disease control.Entities:
Mesh:
Year: 2018 PMID: 30072734 PMCID: PMC6072702 DOI: 10.1038/s41598-018-29986-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The phylogenetic tree of 480 M. tuberculosis L1 isolates from Chiangrai, constructed by the Bayesian inference method. The major subgroups were labelled on the rightmost. L1.1.1.1 were the same group as reported by Coll et al. The sublineages L1.1.1.2–L1.1.1.9, labelled on the right, were named sequentially according to their positions in the tree. Some interesting spoligotypes are shown after the sublineage names. The SIT number followed by the + sign indicates that the shown SIT contributed to the majority of the sublineage. The ++ sign before the SIT number indicates that the spoligotype was only a small fraction of the sublineage. The deletion of DVR57-60 was specific to L1.1.1.6 but was not revealed by spoligotyping because the DVRs were not used for standard spoligotyping.
Figure 2PCA plots revealed the separation of sublineages. (a) shows a plot of representative isolates of L1 with axes representing eigenvector 1 and 2. The red, purple and orange dots indicate isolates belonging to L1.1, L1.2.1 and L1.2.2 respectively. (b) shows a plot of representative isolates of L1.1 with axes representing eigenvector 1 and 2. Members of L1.1.1 are represented in red. The sublineages of L1.1.2 and L1.1.3 are indicated in different shades of greyish green and green, respectively. (c) shows a plot of representative isolates of L1.1.1 with axes representing eigenvector 1 and 3. (d) shows a plot of representative isolates of L1.1.1.1, L1.1.1.3, L1.1.1.4 and L1.1.1.9 with axes representing eigenvector 2 and 3. (e) shows a plot of representative isolates of L1.2.1 with axes representing eigenvector 1 and 2. L1.2.1.1, L1.2.1.2 and L1.2.1.3 are indicated in dark purple, blue and light purple, respectively.
Summary of findings for each sublineage in this study.
| Sublineages | Number of isolates | Number of specific SNVs | Shared DVR deletions | Coll’s classifi-cation | Number of specific SNPs in Coll’s study | Common specific SNPs in both studies | Confirmed barcoding SNPs | Mean distances within sublineages /between sublineages | Probability | Fixation Indices | Depths/heights in the phylogenetic tree |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 480 | 550 |
| 473 | 615938 G/A | 0/23 | |||||
| 354 | 23 | 38 | 22 | 4404247 G/A | 570.0/808.4.1 | <2.2 × 10−16 | 0.295 | 2/21 | |||
| 269 | 39 | 57 | 39 | 3021283 G/A | 451.1/749.7 | <2.2 × 10−16 | 0.398 | 3/20 | |||
| 14 | 44 | 138 | 44 | 3216553 G/A | 252.9/468.3 | 7.45 × 10−6 | 0.460 | 8/8 | |||
| 1.1.1.2 | 26 | 42 | 340.5/554.2 | 6.53 × 10−10 | 0.386 | 7/10 | |||||
| 1.1.1.3 | 7 | 5 | 431.8/503.7 | 0.002331 | 0.143* | 7/3 | |||||
| 1.1.1.4 | 5 | 47 | 224.2/477.2 | 0.01167* | 0.530 | 11/4 | |||||
| 1.1.1.5 | 42 | 46 | 18–21 | 323.9/505.5 | 3.12 × 10−15 | 0.359 | 11/11 | ||||
| 1.1.1.6 | 27 | 90 | 57–60 | 135.3/490.6 | 3.02 × 10−10 | 0.724 | 10/11 | ||||
| 1.1.1.7 | 44 | 29 | 269.8/453.4 | 6.79 × 10−16 | 0.405 | 10/12 | |||||
| 1.1.1.8 | 73 | 34 | 53 | 174.4/467.0 | 2.2 × 10−16 | 0.627 | 10/13 | ||||
| 1.1.1.9 | 20 | 23 | 62 (not unique) | 284.7/487.3 | 1.45 × 10−11 | 0.416 | 9/12 | ||||
| Unclassified | 11 | ||||||||||
| 32 | 12 | 154 | 12 | 487.5/750.7 | 6.49 × 10−12 | 0.351 | 3/9 | ||||
| 1.1.2.1 | 23 | 143 | 18 | 322.0/720.5 | 6.60 × 10−09 | 0.553 | 4/8 | ||||
| 1.1.2.2 | 9 | 102 | 3–4 | 2622402 G/A** | 310.0/720.5 | 4.11 × 10−5 | 0.570 | 4/5 | |||
| 53 | 29 | 33, 56 | 66 | 29 | — | 502.1/750.2 | <2.2 × 10−16 | 0.331 | 3/11 | ||
| 1.1.3.1 | 21 | 85 | 6–7, 33, 51, 56–57 | 228.6/654.6 | 3.12 × 10−8 | 0.651 | 5/10 | ||||
| 1.1.3.2 | 17 | 215 | 33, 56 | 131.4/671.9 | 6.99 × 10−7 | 0.805 | 5/9 | ||||
| 1.1.3.3 | 15 | 125 | 33, 56 | 138.4/652.3 | 3.38 × 10−6 | 0.788 | 4/9 | ||||
| 108 | 208 | 4, 10 | 87 | 55 | — | 145.3/803.0 | <2.2 × 10−16 | 0.819 | 1/14 | ||
| 1.2.1.1 (EAI2_MNL) | 12 | 63 | 4,10, 30–31 | 120.3/286.8 | 3.64 × 10−05 | 0.580 | 3/12 | ||||
| 1.2.1.2 (EAI2_NTB) | 94 | 20 | 4,10, 17–35 | 106.4/272.8 | <2.2 × 10−16 | 0.610 | 2/8 | ||||
| 1.2.1.3 | 2 | 53 | 3–4, 10, 30–31 | 76.0/201.9 | 0.2207* | 0.614 | 3/1 | ||||
| 18 | 150 | 62 (not unique) | 95 | 59 | 3470377 C/T | 430.2/851.9 | 3.22 × 10−07 | 0.495 | 2/7 |
The sublineage numbers in bold conforms to the nomenclature of Coll’s et al.[27], with the additional sublineages, named in this study, shown in regular fonts. Specific SNPs appeared in all members of the sublineages. All members of some sublineages shared DVR deletions, but the DVRs may have been deleted sporadically from members of other sublineages. Common specific SNPs in both studies indicates the number of specific SNPs that were the same in both Coll’s and this study. The probability denotes the probability that the mean SNV distances of isolates in a group from the other members in the same group were different from the ones from other isolates that were not in the same group but in the same level of grouping, by the Wilcoxon rank-sum test.
The number of isolates with various experimental spoligotypes identified in each sublineage, listed in ascending order of the octal codes.
| Major sublineage | Sub-lineage | Num-ber | Number of isolates × known spoligotypes (SIT) | Number of isolates × unclassified spoligotypes |
|---|---|---|---|---|
| 1.1.1 | 1.1.1.1 | 14 | 1 × 677777777413771 (342), 2 × 777737777413771 (618), | 1 × 717777777003371 |
| 1.1.1.2 | 26 | 1 × 737777770003771, 1 × 777717777413771 1 × 777777777411771* | ||
| 1.1.1.3 | 7 | 2 × 763777777413771 (792), 1 × 777777767413731 (1404), | 1 × 777777777410071 | |
| 1.1.1.4 | 5 | 1 × 777777767413371 | ||
| 1.1.1.5 | 42 | 15 × 774177777413731 (493), 14 × 77417777741377 (1951), 3 × 774177757413771 orphan | 1 × 574177777413731, 1 × 674167777413731, 3 × 774000017413771, 1 × 774077777410071, 1 × 774177777411771, 2 × 774177777413001, 1 × 774177777413701 | |
| 1.1.1.6 | 27 | None | ||
| 1.1.1.7 | 44 | 1 × 777717777413671 (orphan), 1 × 777577777413771(1372), 2 × 777777637413771 (orphan), 1 × 777777760000000 (786), 1 × 777777760000031 (773), 12 × 777777760000071 (1274), 4 × 777777777013771 (937), 1 × 777777777413071 (934), 3 × 777777777413700 (138), | 4 × 703577777413771 1 × 776377777413771 2 × 777777677413700 | |
| 1.1.1.8 | 73 | 3 × 577777777413671, 1 × 737777777413671, 2 × 777770777413671, 2 × 777777607413671, 1 × 777777777413411, 3 × 777777777413660, 1 × 777777777413661 | ||
| 1.1.1.9 | 20 | 1 × 717777777413731 (1316), 2 × 677777777413731 (529), 1 × 777776777413731 (735), 1 × 777777777413131 (745), | 1 × 677760377413731 | |
| 1.1.2 | 1.1.2.1 | 23 | 15 × 775777777413771 (939) | 1 × 775000007413771, 1 × 775777757413771*, 1 × 775777777403171, 4 × 775777777413471, 1 × 775777777413761 |
| 1.1.2.2 | 9 | 1 × 400037777413771 (8)*, 1 × 474000377413031 (1983), 1 × 475777777413771 (Orphan)*, 1 × 477001777413771 (1875), 1 × 477777777413031 (355), 2 × 477777777413071 (11), 1 × 477777777413731 (1182)* | 1 × 475777777413051* | |
| 1.1.3 (EAI6_BGD1) | 1.1.3.1 | 21 | 13 × 777777757413371 (292), 1 × 777777700003371 (1391), 2 × 777777743413371 (Orphan), 1 × 777777747413371 (1390) | 1 × 777577757413371 1 × 777767700003371* 1 × 777777707413371 1 × 777777750403371 |
| 1.1.3.2 | 17 | 3 × 777777657413771 | ||
| 1.1.3.3 | 15 | 1 × 777701757413771, 1 × 777777703413771, 2 × 777777757412771* | ||
| 1.2.1 (EAI2) | 1.2.1.1 (EAI2_MNL) | 12 | 1 × 677767477413771 (1490), 1 × 677777477413701 (483), 1 × 677777477413751 (287), 9 × 677777477413771 (19) | None |
| 1.2.1.2 (EAI2_NTB) | 94 | 83 × 674000003413771 (89)* (Discrepancy in 6 cases) 1 × 074000003413771 (orphan), 1 × 600000000000000 (orphan), 4 × 674000002000071 (orphan) | 1 × 474000003413771, 2 × 674000003413700, 2 × 674000003413711 | |
| 1.2.1.3 | 2 | 1 × 477777477413771 (413) | 1 × 477777477413731 | |
| 1.2.2 | 1.2.2 | 18 | 2 × 577777777413731 (477), 1 × 777767777413731 (1251)*, 2 × 777777774413731 (514), | 1 × 776177775413731, 1 × 777777774403731*, 1 × 777777774412731, 1 × 777777775413731, 1 × 777777777411631 |
| Un-classified | 11 | 1 × 773637777413771, 1 × 777637777413411 |
The spoligotypes that are found in more than one sublineage are indicated in bold typeface. No unclassified spoligotypes appeared in two sublineages. (SIT denotes the spoligotype international type).
*The spoligotypes, of which some isolates had different predicted spoligotypes.
Demographic and clinical profiles of patients infected by different sublineages in this study.
| L1.1.1 | L1.1.2 | L1.1.3 | L1.2.1 | L1.2.2 | All | |
|---|---|---|---|---|---|---|
| Total | 269 | 32 | 53 | 108 | 18 | 480 |
|
| ||||||
| Male | 198 | 27 | 37 | 78 | 14 | 354 |
| Female | 71 | 5 | 16 | 30 | 4 | 126 |
|
| ||||||
| <19 | 1 | 0 | 2 | 4 | 0 | 7 |
| 20–29 | 24 | 2 | 4 | 12 | 1 | 43 |
| 30–39 | 42 | 6 | 15 | 25 | 4 | 92 |
| 40–49 | 47 | 8 | 12 | 20 | 5 | 92 |
| 50–59 | 57 | 5 | 7 | 18 | 1 | 88 |
| 60–69 | 50 | 9 | 10 | 11 | 3 | 83 |
| >70 | 48 | 2 | 3 | 18 | 4 | 75 |
| Average ages | 53.2 | 50.3 | 45.6 | 48.5 | 51.9 | 50.7 |
| Median ages | 54 | 49 | 43 | 46 | 47 | 50 |
| % with ages >49* | 57.6 | 50.0 | 37.7 | 43.5 | 44.4 | 51.3 |
|
| ||||||
| Thai | 238 | 26 | 45 | 103 | 17 | 429 |
| Hill tribe | 24 | 3 | 6 | 3 | 1 | 37 |
| Foreigner | 7 | 3 | 2 | 2 | 0 | 14 |
|
| ||||||
| Pulmonary | 258 | 32 | 49 | 102 | 17 | 458 |
| Pulmonary and Extrapulmonary | 9 | 0 | 4 | 6 | 1 | 20 |
| Extrapulmonary | 2 | 0 | 0 | 0 | 0 | 2 |
|
| ||||||
| Positive | 49 | 5 | 12 | 29 | 4 | 99 |
| Negative | 215 | 27 | 40 | 76 | 13 | 371 |
| Unknown | 5 | 0 | 1 | 3 | 1 | 10 |
*The proportions of the patients over age 49 were different between various sublineages (Chi-square test, p = 0.0246). The relative risk of infecting by L1.1.1 of patients over age 49, compared to the ones being younger was 1.29 (p = 0.0019, 95% CI: 1.099–1.522).
Figure 3Area map showing the predominant sublineages of L1 in different countries in Southeast Asia, deduced from their spoligotype reports. There were no obvious predominant strains in Cambodia and there was no information from Lao.