| Literature DB >> 31653940 |
Nguyen Thi Le Hang1, Minako Hijikata2, Shinji Maeda3, Pham Huu Thuong4, Jun Ohashi5, Hoang Van Huan4, Nguyen Phuong Hoang6, Akiko Miyabayashi2, Vu Cao Cuong7, Shintaro Seto2, Nguyen Van Hung8, Naoto Keicho9,10.
Abstract
Drug-resistant tuberculosis (TB) is a serious global problem, and pathogen factors involved in the transmission of isoniazid (INH)-resistant TB have not been fully investigated. We performed whole genome sequencing of 332 clinical Mycobacterium tuberculosis (Mtb) isolates collected from patients newly diagnosed with smear-positive pulmonary TB in Hanoi, Vietnam. Using a bacterial genome-wide approach based on linear mixed models, we investigated the associations between 31-bp k-mers and clustered strains harboring katG-S315T, a major INH-resistance mutation in the present cohort and in the second panel previously published in South Africa. Five statistically significant genes, namely, PPE18/19, gid, emrB, Rv1588c, and pncA, were shared by the two panels. We further identified variants of the genes responsible for these k-mers, which are relevant to the spread of INH-resistant strains. Phylogenetic convergence test showed that variants relevant to PPE46/47-like chimeric genes were significantly associated with the same phenotype in Hanoi. The associations were further confirmed after adjustment for the confounders. These findings suggest that genomic variations of the pathogen facilitate the expansion of INH-resistance TB, at least in part, and our study provides a new insight into the mechanisms by which drug-resistant Mtb maintains fitness and spreads in Asia and Africa.Entities:
Mesh:
Year: 2019 PMID: 31653940 PMCID: PMC6814805 DOI: 10.1038/s41598-019-51812-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Frequencies of known mutations conferring resistance to first-line drugs among all isolates, phenotypically resistant isolates, and susceptible isolates in the Hanoi sample set (N = 332).
| Drug resistance-conferring mutations | Study population | Phenotypically resistant isolates | Phenotypically susceptible isolates |
|---|---|---|---|
|
| N = 332 | N = 88 | N = 244 |
| (−) | 233 (70.2) | 8 (9.1) | 225 (92.2) |
| 87 (26.2) | 75 (85.3) | 12 (4.9) | |
| Rv1482c | 7 (2.1) | 3 (3.4) | 4 (1.6) |
| Rv1482c | 1 (0.3) | 1 (1.1) | 0 (0.0) |
| Rv1482c | 1 (0.3) | 1 (1.1) | 0 (0.0) |
| 1 (0.3) | 0 (0.0) | 1 (0.4) | |
| 1 (0.3) | 0 (0.0) | 1 (0.4) | |
| 1 (0.3) | 0 (0.0) | 1 (0.4) | |
|
| N = 332 | N = 86 | N = 246 |
| (−) | 259 (78.0) | 23 (26.6) | 236 (96.0) |
| 44 (13.3) | 42 (48.8) | 2 (0.8) | |
| 14 (4.2) | 11 (12.8) | 3 (1.2) | |
| 6 (1.8) | 6 (7.0) | 0 (0.0) | |
| 1 (0.3) | 1 (1.2) | 0 (0.0) | |
| 1 (0.3) | 1 (1.2) | 0 (0.0) | |
| 1 (0.3) | 1 (1.2) | 0 (0.0) | |
| 2 (0.6) | 1 (1.2) | 1 (0.4) | |
| 1 (0.3) | 0 (0.0) | 1 (0.4) | |
| 1 (0.3) | 0 (0.0) | 1 (0.4) | |
| 2 (0.6) | 0 (0.0) | 2 (0.8) | |
|
| N = 332 | N = 13 | N = 319 |
| (−) | 310 (93.4) | 1 (7.7) | 309 (96.9) |
| 9 (2.7) | 4 (30.8) | 5 (1.6) | |
| 4 (1.2) | 4 (30.8) | 0 (0.0) | |
| 2 (0.6) | 2 (15.4) | 0 (0.0) | |
| 1 (0.3) | 1 (7.7) | 0 (0.0) | |
| 1 (0.3) | 1 (7.7) | 0 (0.0) | |
| 1 (0.3) | 0 (0.0) | 1 (0.3) | |
| 1 (0.3) | 0 (0.0) | 1 (0.3) | |
| 3 (0.9) | 0 (0.0) | 3 (0.9) | |
|
| N = 332 | N = 5 | N = 327 |
| (−) | 303 (91.3) | 0 (0.0) | 303 (92.7) |
| 6 (1.8) | 3 (60.0) | 3 (0.9) | |
| 13 (3.9) | 1 (20.0) | 12 (3.7) | |
| 1 (0.3) | 1 (20.0) | 0 (0.0) | |
| 1 (0.3) | 0 (0.0) | 1 (0.3) | |
| 2 (0.6) | 0 (0.0) | 2 (0.6) | |
| 1 (0.3) | 0 (0.0) | 1 (0.3) | |
| 4 (1.2) | 0 (0.0) | 4 (1.2) | |
| 1 (0.3) | 0 (0.0) | 1 (0.3) | |
|
| N = 332 | N = 9 | N = 323 |
| (−) | 319 (96.1) | 4 (44.4) | 315 (97.5) |
| 3 (0.9) | 2 (22.2) | 1 (0.3) | |
| 1 (0.3) | 1 (11.1) | 0 (0.0) | |
| 1 (0.3) | 1 (11.1) | 0 (0.0) | |
| 3 (0.9) | 1 (11.1) | 2 (0.6) | |
| 1 (0.3) | 0 (0.0) | 1 (0.3) | |
| 1 (0.3) | 0 (0.0) | 1 (0.3) | |
| 1 (0.3) | 0 (0.0) | 1 (0.3) | |
| 1 (0.3) | 0 (0.0) | 1 (0.3) | |
| 1 (0.3) | 0 (0.0) | 1 (0.3) |
Figure 1Distribution of drug resistant-relevant mutations and their presence in correlation with Mtb lineages and clustering. Clustered: defined by pairwise SNV difference <6 SNVs. Different colors indicate different mutations.
Genes harboring k-mers significantly associated with different phenotypes (from (1) to (6)) in Hanoi and KwaZulu-Natal studies.
| Phenotype | Gene name | Number of genes | |||||
|---|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | ||
| No | Yes |
| Yes | Yes |
|
|
|
| No | Yes | No | Yes | No | Yes |
| 1 |
| No | Yes | Yes | No | Yes | No |
| 1 |
| No | No |
| Yes | Yes |
|
|
|
| Yes | No | Yes | No | No | No |
| 3 |
| No | No |
| No | No |
|
|
|
| No | No |
| Yes | No |
|
|
|
| No | No | No | Yes | No | Yes |
| 10 |
| No | No | No | Yes | Yes | Yes |
| 11 |
| No | Yes | Yes | No | No | No |
| 29 |
| No | No | No | No | Yes | No |
| 3 |
| Yes | No | No | No | No | No |
| 4 |
| No | Yes | No | No | No | No |
| 16 |
| No | No | No | No | No | Yes |
| 20 |
| No | No | Yes | No | No | No |
| 356 |
Phenotype (1): clustered strains (cluster[+]) (pairwise SNV difference between two isolates is no more than five SNVs) in Hanoi’s study cohort; (2): strains harboring katG-S315 T mutation (S315 T[+]) in Hanoi’s cohort; (3): cluster[+]/S315T[+] in Hanoi’s cohort; (4): cluster[+] in KwaZulu-Natal’s cohort; (5) S315T[+] in KwaZulu-Natal’s cohort; (6): cluster[+]/S315T[+] in KwaZulu-Natal’s cohort; SNV: single nucleotide variant.
The genes in bold are associated with cluster[+]/S315T[+], and shared between Hanoi and KwaZulu-Natal study population.
Figure 2Venn diagram showing the number of genes significantly associated with the given phenotypes and shared by the two study cohorts. (a) results from k-mer-based GWAS; (b) results from phyOverlap; cluster[+]: clustered (pairwise SNV difference between two isolates is no more than five SNVs); S315T[+]: harboring katG-S315T mutation; Hanoi: Hanoi’s study cohort; KwaZulu-Natal: KwaZulu-Natal’s study cohort.
Figure 3Distribution of k-mers derived from variant (or wild) types of five genes (PPE18/19, gid, emrB, Rv1588c, and pncA) that showed positive or negative associations with clustered strains carrying katG-S315T in the phylogenetic trees of Hanoi (a) and KwaZulu-Natal (b) study population.
Genes with k-mers significantly associated with clustered strains harboring katG-S315T mutation (cluster[+]/S315T[+]) in both Hanoi and KwaZulu-Natal study cohorts, further investigated by using the DBGWAS platform and variant/deletion-based search.
| Gene name | Sample panel | K-mer-based GWAS | Strongest P value | DBGWAS* | Strongest P value | Variant-based GWAS/deletion-based GWAS/BLAST search | Strongest P value | Conclusion on variant(s) corresponding to significant k-mer(s) |
|---|---|---|---|---|---|---|---|---|
|
| Hanoi | 36 k-mers, SNV at 1339644 and 1339649 | 1.840E-09 | 1 unitig, SNVs at 1339644 and 1339649 | 5.56188E-07 | SNVs at 1339644 (E99A) and 1339649 (A101T) found by variant-based GWAS | 1.941E-09 | Significant SNVs E99A and A101T |
|
| KwaZulu-Natal | 5 k-mers, SNV at 1340231 | 2.998E-08 | 1 unitig**, SNV at 1532777 | 4.8561E-10 | SNV at 1532777 (Q286R) detected by BLAST search. By variant-based GWAS, SNV was not called properly | NA | SNV Q286R |
|
| Hanoi | 33 k-mers, SNV at 4407686 | 5.213E-08 | 3 unitgs, SNV at 4407686 | 3.676E-05 | SNV at 4407686 (E173*) fount by variant-based GWAS | 2.991E-06 | Significant SNV E173* |
|
| KwaZulu-Natal | 101 k-mers | 7.802E-07 | 7 unitigs, no SNV | 7.943E-09 | Large deletions (from 120 bp to 675 bp) | 2.178E-04 | Large deletions |
|
| Hanoi | 31 k-mers, SNV at 876918 | 8.579E-08 | 2 unitigs, SNV at 876918 | 1.704E-05 | SNV at 876918 (F508S) found by variant-based GWAS | 1.507E-08 | Significant SNV F508S |
|
| KwaZulu-Natal | 31 k-mers, SNV at 877058 | 7.802E-07 | 2 unitigs**, SNV at 877058 | 7.943E-09 | SNV at 877058 (I461I) found by variant-based GWAS | 1.077E-06 | Significant SNV I461I |
|
| Hanoi | 4 k-mers without SNV (neg.***) | 8.579E-08 | 2 unitigs**, no SNV | 1.704E-05 | SNV at 1789735 (P34P) found by variant-based GWAS and BLAST search | 1.511E-08 | Significant SNV P34P |
|
| KwaZulu-Natal | 12 k-mers without SNV (neg.***) | 2.453E-06 | 0 unitig | NA | No SNV inside the corresponding k-mers | NA | None |
|
| Hanoi | 31 k-mers without SNV (neg.***) | 4.437E-07 | 0 unitig ( | NA | SNV at 2288820 (Q141P) found by variant-based GWAS | 4.919E-07 | Significant SNV Q141P |
|
| KwaZulu-Natal | 29 k-mers, SNV at 2288785 and 2288788 | 4.802E-11 | 1 unitig, G insertion at 2288785 | 7.639E-12 | SNV at 2288785 (T153fs) found by variant-based GWAS | 6.569E-11 | Significant SNV T153fs |
cluster[+]: clustered (pairwise SNV difference between two isolates is no more than five SNVs); S315T[+]: harboring katG-S315T mutation; Hanoi: Hanoi’s study cohort; KwaZulu-Natal: KwaZulu-Natal’s study cohort; GWAS: genome wide association study; SNV: single nucleotide variant.
*Only significant unitigs and their SNVs are shown.
**Found by sequence search.
***Showing negative association with Cluster[+]/S315T[+] by logistic regression analyses
Genes with deletions/insertions significantly associated with clustered strains harboring katG-S315T mutation among Hanoi samples obtained by GWAS, and corresponding k-mers.
| Reference gene* | Locus in H37Rv (AL123456) | Deletion/insertion | P value | Principal component | K-mers | Strongest P value |
|---|---|---|---|---|---|---|
| Details | No of significant k-mers/No of k-mers | |||||
| AL123456.3H37Rv|00828| | Rv0790c | 3-bp insertion at 884090 to 884092 | 1.507E-08 | 9 | 42/46 | 1.43E-08 |
| AL123456.3H37Rv|01092| | Rv1043c | 1-bp deletion at 1166499 | 1.941E-09 | 9 | 45/61 | 1.43E-08 |
| AL123456.3H37Rv|02117| | Rv2025c | Deletion of almost all Rv2025c gene, also Rv2024c (789-bp) | 9.053E-08 | 9 | 819/833 | 1.43E-08 |
| AL123456.3H37Rv|02393| | Rv2286c | 52-bp deletion from 2559452 to 2559503 | 1.507E-08 | 9 | 69/123 | 1.43E-08 |
| AL123456.3H37Rv|03394| | Rv3230c | 1-bp deletion at 3607421 | 9.053E-08 | 9 | 59/63 | 1.84E-09 |
| AL123456.3H37Rv|02535| | Rv2416c | 2-bp deletion at 2714527–2714528 | 3.219E-06 | 12 | 0/60 | 1.32E-05 |
| AP018033.1HN024|01030| | Best hit with Rv0974c | 238-bp deletion from 1086195 to 1086432 | 2.991E-06 | 12 | — | — |
| AP018033.1HN024|02797| | Best hit with Rv2690c | 12-bp deletion | 1.720E-05 | 12 | — | — |
| AP018036.1HN506|03217| | Best hit with Rv3077 | 459-bp insertion | 1.720E-05 | 12 | — | — |
PC-9 correlated gene variants were observed in 14 samples (HN-042, HN-065, HN-120, HN-127, HN-152, HN-154, HN-170, HN-179, HN-186, HN-222, HN-253, HN-391, HN-441, HN-456) that belong to ancient Beijing sublineages.
PC-12 correlated gene variants were observed in 10 samples (HN-019, HN-075, HN-109, HN-137, HN-169, HN-324, HN-330, HN-435, HN-462, HN-506) that belong to modern Beijing sublineages.
*Genome sequences of clinical isolates in our Hanoi cohort, AP018033 to AP018036 as well as AL123456 (H37Rv genome) were used for reference.
Genes detected by the phyOverlap method and their significant associations with the Hanoi strains harboring the katG-S315T mutation, with and without clustering, and loci shared with the KwaZulu-Natal study population.
| Locus | Gene name | No of SNV | Parsimony score | P value | FDR (q value) | Remark |
|---|---|---|---|---|---|---|
| Rv3021c |
| 12 | 73 | 0.00000 | 0.00000 | PE/PPE |
| Rv3511 |
| 13 | 18 | 0.00000 | 0.00000 | PE-PGRS family protein |
| Rv0278c-Rv0279c |
| 2 | 37 | 0.00002 | 0.02434 | PE-PGRS family protein |
| Rv1068c |
| 13 | 27 | 0.00002 | 0.02434 | PE-PGRS family protein |
|
|
|
|
|
|
|
|
| Rv0682 |
| 2 | 21 | 0.00000 | 0.00000 | Streptomycin-resistance related gene |
| Rv0758 |
| 8 | 8 | 0.00002 | 0.01217 | Sensor part of a two component regulatory system |
|
|
|
|
|
|
|
|
| Rv3021c |
| 12 | 73 | 0.00000 | 0.00000 | PE/PPE |
| Rv3169 |
| 5 | 5 | 0.00002 | 0.01217 | Unknown |
| Rv3428c-Rv3429 |
| 28 | 247 | 0.00000 | 0.00000 | |
| Rv3647c-Rv3648c |
| 4 | 4 | 0.00006 | 0.03245 | |
| Rv3680-Rv3681c |
| 1 | 78 | 0.00000 | 0.00000 | |
The genes in bold are shared between Hanoi and KwaZulu-Natal study population.