| Literature DB >> 34593898 |
Paula J Gómez-González1, Joao Perdigao2, Pedro Gomes2, Zully M Puyen3, David Santos-Lazaro3, Gary Napier1, Martin L Hibberd1, Miguel Viveiros4, Isabel Portugal2, Susana Campino1, Jody E Phelan1, Taane G Clark5,6,7.
Abstract
Tuberculosis (TB), caused by Mycobacterium tuberculosis, is one of the deadliest infectious diseases worldwide. Multidrug and extensively drug-resistant strains are making disease control difficult, and exhausting treatment options. New anti-TB drugs bedaquiline (BDQ), delamanid (DLM) and pretomanid (PTM) have been approved for the treatment of multi-drug resistant TB, but there is increasing resistance to them. Nine genetic loci strongly linked to resistance have been identified (mmpR5, atpE, and pepQ for BDQ; ddn, fgd1, fbiA, fbiB, fbiC, and fbiD for DLM/PTM). Here we investigated the genetic diversity of these loci across >33,000 M. tuberculosis isolates. In addition, epistatic mutations in mmpL5-mmpS5 as well as variants in ndh, implicated for DLM/PTM resistance in M. smegmatis, were explored. Our analysis revealed 1,227 variants across the nine genes, with the majority (78%) present in isolates collected prior to the roll-out of BDQ and DLM/PTM. We identified phylogenetically-related mutations, which are unlikely to be resistance associated, but also high-impact variants such as frameshifts (e.g. in mmpR5, ddn) with likely functional effects, as well as non-synonymous mutations predominantly in MDR-/XDR-TB strains with predicted protein destabilising effects. Overall, our work provides a comprehensive mutational catalogue for BDQ and DLM/PTM associated genes, which will assist with establishing associations with phenotypic resistance; thereby, improving the understanding of the causative mechanisms of resistance for these drugs, leading to better treatment outcomes.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34593898 PMCID: PMC8484543 DOI: 10.1038/s41598-021-98862-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Number of variants per analysed gene across the 33,675 isolates, with the average number of mutations per sample and by resistance profile.
| Gene | Drug | Gene SNPs [Indels,fs*] | Prom. SNPs [Indels] | Total analysed [# known**] | # samples with 1 [> 1] mutations | Lineages | Ave. mut. Susc. samples | Ave. mut. MDR samples | Ave. mut. XDR samples | Ave. mut. DR samples | Diversity × 10−5*** |
|---|---|---|---|---|---|---|---|---|---|---|---|
| BDQ | 15 [1,0] | 5 [5] | 26 [1] | 48 [1] | 1–4, | 0.002 | 0 | 0 | 0.002 | 0.87 | |
| BDQ | 116 [25, 29] | 14 [4] | 163 [38] | 555 [17] | 1–6, | 0.008 | 0.040 | 0.079 | 0.017 | 3.2 | |
| BDQ | 117 [2, 3] | 0 [0] | 120 [0] | 482 [4] | 1–6 | 0.018 | 0.010 | 0.004 | 0.009 | 2.4 | |
| DLM/PTM | 118 [4, 9] | 11 [1] | 139 [4] | 4229 [35] | 1–7, | 0.141 | 0.095 | 0.075 | 0.124 | 23 | |
| DLM/PTM | 86 [16, 27] | 18 [2] | 132 [31] | 743 [21] | 1–5 | 0.025 | 0.019 | 0.015 | 0.023 | 7.6 | |
| DLM/PTM | 113 [2, 3] | 3 [0] | 119 [4] | 991 [0] | 1–5, | 0.037 | 0.016 | 0.007 | 0.019 | 5.5 | |
| DLM/PTM | 135 [1] | 0 [0] | 136 [3] | 851 [3] | 1–6, | 0.025 | 0.022 | 0.012 | 0.033 | 3.6 | |
| DLM/PTM | 280 [9, 17] | 26 [4] | 326 [4] | 2413 [45] | 1–6, | 0.079 | 0.052 | 0.058 | 0.091 | 3.8 | |
| DLM/PTM | 57 [0,0] | 9 [0] | 66 [0] | 223 [2] | 1–4, | 0.008 | 0.004 | 0.004 | 0.007 | 1.8 |
Indels = insertions and deletions; DLM = Delamanid; PTM = Pretomanid; BDQ = Bedaquiline; Prom. = promoter; Susc. = Susceptible; DR = Other drug resistance; fs = frame shifts; bov = M. bovis; * number of indels that lead to frameshifts; ** see S3 Table; *** Nei’s Pi nucleotide diversity per site (only non-synonymous SNPs considered).
Figure 1(A), (B) Frequency of mutations identified across data set. The vertical axis is the number of mutations that are found in 1 to 10 or more isolates (horizontal axis). Colours represent the different genes, each bar showing the distribution of those mutations in the candidate genes for each drug (A = Bedaquiline (BDQ), B = Delamanid (DLM)/Pretomanid (PTM)). (C), (D) Intersection of mutations in the different genes by sample. Bars represent the number of samples that hold mutations in each gene, or combination of them (horizontal bars show total samples with mutations in each gene); C = BDQ, D = DLM/PTM.
Figure 2Phylogenetic tree of high frequency (≥ 10 isolates) mutations in bedaquiline candidate genes. The outer track (c) shows the resistance phenotype; the second track (b) shows the convergent mutations that have arisen in more than one clade; the third track (a) shows the clades formed by isolates harbouring the same phylogenetic-related mutations. Branches are coloured by lineage as per legend.
Mutations in bedaquiline candidate genes occurring in at least 5 samples and more than one independent clade.
| Mutation | Gene | Freq | Sub-lineage (# isolates) | # sub-lineages | Max SNP dist.* | # Independent Occurrences | Susc. % | MDR or XDR % | Pre-2014%** | Functional support*** |
|---|---|---|---|---|---|---|---|---|---|---|
| 124 | 2.2.1(122); 4.3.2.1(1); 1.1.1(1) | 3 | 207 | 3 | 12.1 | 76.6 | 93.1 | - | ||
| 192_193insG (I67fs) | 44 | 4(34); 2.2.1(4); 3(2); 4.9(1); 4.8(1); 4.5(1); 1.1.1(1) | 7 | 60 | 10 | 0 | 86.4 | 100 | - | |
| 38 | 4.3.4.1(37); 2.2.1(1) | 2 | 168 | 2 | 52.6 | 47.4 | 72.2 | S,P | ||
| 23 | 3.1.1(22); 4.1.2(1) | 2 | 24 | 2 | 95.7 | 0 | 100 | - | ||
| M146T | 21 | 4.4.1.1(20); 2.2.2(1) | 2 | 11 | 2 | 0 | 100 | - | S,M | |
| 18 | 2.2.1(17); 4.1.2.1(1) | 2 | 33 | 2 | 94.4 | 0 | 75.0 | - | ||
| 193_193del (I67fs) | 16 | 4.3.4.2(10); 2.2.1(3); 4.7(2); 4.3.3.1(1) | 4 | 17 | 5 | 0 | 100 | 83.3 | - | |
| 141_142insC | 15 | 2.2+(8); 4.1.2+(2); 4.3+(2); 4.4.1.1(1); 3(2) | 8 | - | 11 | 6.7 | 86.7 | 85.7 | - | |
| 10 | 4.1.2.1(8); 4.3.2.1(1); 2.2.1(1) | 3 | 23 | 3 | 90 | 10 | 83.3 | M | ||
| L117R | 9 | 3(5); 4.3.4.2(2); 4.2.2(1); 4.1(1) | 4 | 98 | 5 | 44.4 | 44.4 | 100 | S | |
| 8 | 2.2.1(8) | 1 | 21 | 3 | 0 | 87.5 | 50 | S,M | ||
| G121R | 7 | 2.2.2(5); 3(1); 4.4.1.1(1) | 3 | 4 | 3 | 0 | 100 | 100 | S,P | |
| 7 | 2.2.1(6); 1.1.3(1) | 2 | 130 | 2 | 14.3 | 57.1 | 100 | S,P | ||
| R90C | 7 | 2.2.1(6); 4.1.1.3(1) | 2 | 24 | 4 | 85.7 | 0 | 50 | - | |
| N98D | 5 | 4.1.2.1(2); 4.4.1.1(2); 2.2.1(1) | 3 | 5 | 3 | 0 | 80 | 100 | - |
Sub-lineages: + = more than 1 sub-lineage; # = number; * Maximum SNP distance calculated in clades of ≥ 5 isolates; Drug resistance (%): Susc. = Susceptible; ** % of number of samples pre-2014/total number of samples with available collection date; *** Functional support: S = snap2 score ≥ 50; P = Provean Score ≤ −4; M = mCSM predicted stability change (ΔΔG) below − 2. Mutations associated with increased MIC for BDQ in previous studies in bold; mutations associated with susceptibility to BDQ underlined (see S3 Table).
Mutations in delamanid candidate genes occurring in at least 5 samples and more than one independent clade.
| Mutation | Gene | Freq | Sub-lineage (# isolates) | # sub-lineages | Max SNP distance* | # Independent Occurrences | Susc. % | MDR or XDR % | Pre-2014%** | Functional Support*** |
|---|---|---|---|---|---|---|---|---|---|---|
| 3136 | 4.1.2 + (3135); 2.2.1(1) | 3 | 1329 | 2 | 70.1 | 18.1 | 84.2 | - | ||
| 639 | 5, 6, | 7 | 3264 | 5 | 60.1 | 8.3 | 63.1 | - | ||
| 626 | 4.8(625); 1.1.1(1) | 2 | 330 | 2 | 97.9 | 0.3 | 93.6 | - | ||
| 293 | 3(293) | 1 | 496 | 3 | 57.7 | 30.0 | 51.1 | - | ||
| 267 | 5(264); 2.2.1(3) | 2 | 1402 | 2 | 70.7 | 15.4 | 91.7 | - | ||
| 162 | 6(161); 4.1.2.1(1) | 2 | 933 | 2 | 87.0 | 3.7 | 85.7 | - | ||
| I208V | 122 | 4.1.2(121); 4.1.2.1(1) | 2 | 524 | 2 | 70.5 | 11.5 | 96.9 | - | |
| 96 | 4.3.3(88); 1.1.1(8) | 2 | 87 | 2 | 8.3 | 81.3 | 90.9 | P | ||
| 79 | 2.2.1(79) | 0 | 91 | 2 | 0 | 81.0 | 100 | - | ||
| 75 | 1.1.2(75) | 1 | 345 | 2 | 76.0 | 10.7 | 70.2 | S,P | ||
| 71 | 2.2.1(70); 2.2.2(1) | 1 | 238 | 3 | 54.9 | 9.9 | 100 | - | ||
| 47 | 4.3.2(44); 4.3.4.2(3) | 2 | 147 | 2 | 89.3 | 8.5 | 0.0 | S,P | ||
| 37 | 4.1.2.1(31); 4.1.1.3(3); 6(2); 4.4.2(1) | 4 | 244 | 4 | 56.8 | 16.2 | 100 | - | ||
| 34 | 2.2.1(25); 4.3.4.2.1(9) | 2 | 30 | 2 | 26.5 | 73.5 | 94.7 | - | ||
| 21 | 2.2.2(12); 2.1(9) | 2 | 277 | 2 | 33.3 | 52.4 | 100 | S,P | ||
| L49P | 21 | 2.2.1.1(21) | 1 | 226 | 3 | 57.1 | 9.5 | 94.4 | S,P | |
| 18 | 2.2.1(17); 1.1.2(1) | 2 | 30 | 2 | 94.4 | 0 | 75.0 | P | ||
| 17 | 4.5(11); 5(6) | 2 | 241 | 2 | 100 | 0 | 75.0 | - | ||
| 16 | 4.1.2.1(12); 4.8(3); 4.4.1.1(1) | 3 | 75 | 3 | 18.8 | 31.3 | 37.5 | - | ||
| 14 | 3(14) | 1 | 419 | 2 | 50.0 | 14.3 | 14.3 | - | ||
| 13 | 2.2.1(12); 1.1.2(1) | 2 | 77 | 3 | 30.8 | 0 | 100 | - | ||
| 10 | 1.2.1(8); 4.5(1); 3(1) | 3 | 141 | 4 | 70 | 0 | 66.7 | - | ||
| 9 | 5(4); 4.2.1(3); 2.2.1(2) | 3 | 244 | 3 | 77.8 | 22.2 | 100 | - | ||
| 9 | 4.8(8); 4.3.4.2.1(1) | 2 | 50 | 2 | 88.9 | 0 | 100 | S,P | ||
| 9 | 2.2.1(8); 4.1.2(1) | 2 | 24 | 2 | 33.3 | 0 | 100 | - | ||
| 8 | 3(7); 4.5(1) | 2 | 62 | 2 | 100 | 0 | 50 | P | ||
| 8 | 2.2.2(7); 4.1(1) | 2 | 17 | 2 | 100 | 0 | 100 | - | ||
| R304Q | 8 | 3(8) | 2 | 203 | 2 | 87.5 | 0 | 50 | - | |
| 7 | 2.2.1(5); 4.9(1); 4.1.2.1(1) | 3 | 63 | 3 | 100 | 0 | 83.3 | - | ||
| 6 | 4.3.4.2.1(3); 6(3) | 2 | 320 | 2 | 66.7 | 16.7 | 100 | - | ||
| 6 | 4.9(3); 4.1.2.1(2); 2.2.1(1) | 3 | 3 | 3 | 83.3 | 0 | - | - | ||
| 5 | 4.4.1.1(3); 3(1);1.1.1(1) | 3 | 63 | 3 | 80 | 0 | 100 | S,P | ||
| 5 | 4.6.1.1(2); 4.1.2+(2); 4.8(1) | 4 | - | 4 | 100 | 0 | - | - | ||
| 5 | 3(4); 1.1.1(1) | 2 | 236 | 2 | 80 | 0 | 100 | - |
Sub-lineages: + = more than 1 sub-lineage; # = number; * Maximum SNP distance calculated in clades of ≥ 5 isolates; Drug resistance (%): Susc. = Susceptible; ** % of number of samples pre-2014/total number of samples with available collection date; *** Functional support: S = snap2 score ≥ 50; P = Provean Score ≤ −4; M = mCSM predicted stability change (ΔΔG) below − 2; Mutations associated with increased MIC for DLM/PTM in previous studies in bold; mutations associated with susceptibility to DLM/PTM underlined (see S3 Table).
Figure 3Phylogenetic tree of high frequency mutations (≥ 10 isolates) in delamanid and pretomanid candidate genes (fgd1 K270M and R64S, fbiC −32A > G and T273A, fbiA T302M and fbiB K448R found in > 290 isolates not represented). Clades formed by isolates harbouring the same mutations are differentiated by colour. The outer (c) track shows the resistance phenotype; the second track (b) shows the convergent mutations that have arisen in more than one clade; the third track (a) shows the clades formed by isolates harbouring the same phylogenetic-related mutations. Branches are coloured by lineage as per legend.