| Literature DB >> 31086182 |
Maha R Farhat1,2, Luca Freschi3, Roger Calderon4, Thomas Ioerger5, Matthew Snyder6, Conor J Meehan7, Bouke de Jong7, Leen Rigouts7, Alex Sloutsky8, Devinder Kaur9, Shamil Sunyaev3,10, Dick van Soolingen11, Jay Shendure6,12,13, Jim Sacchettini5, Megan Murray14.
Abstract
Drug resistance diagnostics that rely on the detection of resistance-related mutations could expedite patient care and TB eradication. We perform minimum inhibitory concentration testing for 12 anti-TB drugs together with Illumina whole-genome sequencing on 1452 clinical Mycobacterium tuberculosis (MTB) isolates. We evaluate genome-wide associations between mutations in MTB genes or non-coding regions and resistance, followed by validation in an independent data set of 792 patient isolates. We confirm associations at 13 non-canonical loci, with two involving non-coding regions. Promoter mutations are measured to have smaller average effects on resistance than gene body mutations. We estimate the heritability of the resistance phenotype to 11 anti-TB drugs and identify a lower than expected contribution from known resistance genes. This study highlights the complexity of the genomic mechanisms associated with the MTB resistance phenotype, including the relatively large number of potentially causal loci, and emphasizes the contribution of the non-coding portion of the genome.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31086182 PMCID: PMC6513847 DOI: 10.1038/s41467-019-10110-6
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Genetic and phenotypic data of 1452 isolates. a MIC distributions for four drugs. Dotted red line represents the WHO recommended critical concentration on 7H10 media; blue line represents the lower limit of achievable serum concentration from pharmacodynamic studies (Supplementary Table 2). b Allele frequency distribution relative to H37Rv. The isolate count axis interrupted and scaled to accommodate the data range. c Heatmap displaying genome-level similarity of the isolates. Similarity assessed as isolate–isolate genetic co-variance. Isolates indexed in lineage order L1, L2, L3, L4 (4., 4.1, 4.3), and L5 as shown. Co-variance ruler displayed on the far right with darker red indicating higher similarity/co-variance. Phylogenetic tree of the same isolates is given in Supplementary Fig. 4. Figure 1a–c can be regenerated using code in Supplementary Data 8 that will refer to data items provided in the Source Data file (compressed source data folder)
Non-canonical regions confirmed in the second validation GWAS and four canonical regions for comparison
| Locus/site | Drug | AF | Scaled effect size | Scaled SE | Drug | AF | OR | SE | ||
|---|---|---|---|---|---|---|---|---|---|---|
| EMB, INH, RIF, PZA, KAN | 0.071 | 0.52 | 0.07 | 1E-13 | EMB, INH, RIF, PZA | 0.066 | 1.25 | 0.10 | 3.6E-03 | |
| AMI | 0.050 | 2.72 | 0.63 | 4E-06 | AMI | 0.283 | 1.11 | 0.03 | 4.7E-04 | |
| RIF | 0.069 | 1.86 | 0.47 | 2E-05 | AMI, STR | 0.293 | 1.10 | 0.03 | 8.8E-04 | |
| CAP, AMI, KAN | 0.037 | 0.59 | 0.15 | 3E-05 | CAP, AMI | 0.069 | 1.16 | 0.06 | 2.7E-03 | |
| INH, RIF | 0.042 | 0.77 | 0.20 | 8E-05 | KAN, AMI, RIF, INH | 0.067 | 1.29 | 0.06 | 2.8E-08 | |
| ETA | 0.070 | 0.78 | 0.21 | 1E-04 | CAP, KAN, STR | 0.015 | 1.27 | 0.09 | 8.0E-04 | |
| PZA | 0.112 | 0.54 | 0.14 | 1E-04 | EMB | 0.334 | 1.24 | 0.08 | 4.9E-04 | |
| KAN, AMI | 0.048 | 1.14 | 0.33 | 1E-04 | KAN, AMI | 0.047 | 1.18 | 0.06 | 2.3E-03 | |
| RFB, RIF | 0.053 | 1.21 | 0.34 | 2E-04 | AMI, STR | 0.283 | 1.11 | 0.03 | 4.7E-04 | |
| EMB | 0.067 | 0.64 | 0.17 | 2E-04 | STR,AMI INH,CAP | 0.203 | 1.29 | 0.08 | 2.6E-05 | |
| KAN | 0.052 | 1.64 | 0.47 | 3E-04 | AMI, CAP | 0.274 | 1.11 | 0.03 | 2.7E-04 | |
| AMI | 0.018 | 0.74 | 0.22 | 3E-04 | STR | 0.016 | 1.32 | 0.13 | 3.9E-03 | |
| PZA | 0.137 | 0.80 | 0.24 | 6E-04 | RIF,INH, PZA,EMB STR | 0.235 | 1.60 | 0.16 | 2.0E-06 | |
| INH | 0.03 | 1.10 | 0.26 | 2E-05 | – | |||||
| INH | 0.10 | 0.63 | 0.16 | 6E-05 | – | |||||
| PZA | 0.24 | 1.40 | 0.07 | 4E-89 | – | |||||
| PZA | 0.01 | 0.81 | 0.20 | 7E-5 | – |
All drugs to which they were found to be associated are listed. The first drug listed was the drug found to be most significantly associated and for which the GWAS results are listed in the subsequent columns. Left half of the table represents the test GWAS results (n = 1452) and the rightward half lists results of the validation (n = 792). Four known/canonical resistance loci are also listed along with their respective allele frequency, effect size, and P value for comparison. Full results detailed in Supplementary Data Files 2–3. Drug abbreviations detailed in Supplementary Table 2. AF: allele frequency. OR: odds ratio. SE standard error. Scaled effect size and SE are on the logMIC scale. *transmembrane protein, **integral membrane transport protein, ***probable methyl transferase and membrane protein, ^transcriptional regulator, †structure-specific helicase
PVE for each drug attributable to subsets of genetic variations
| Drug | All PVE | All SE | woDR PVE | woDR SE | woDRwoNC PVE | woDRwoNC SE | DR-related PVE | NC-related PVE |
|---|---|---|---|---|---|---|---|---|
| INH | 0.809 | 0.020 | 0.732 | 0.032 | 0.723 | 0.029 | 0.08 | 0.01 |
| RIF | 0.838 | 0.017 | 0.701 | 0.034 | 0.692 | 0.033 | 0.14 | 0.01 |
| RFB | 0.833 | 0.023 | 0.722 | 0.041 | 0.693 | 0.042 | 0.11 | 0.03 |
| EMB | 0.748 | 0.027 | 0.674 | 0.036 | 0.665 | 0.035 | 0.07 | 0.01 |
| PZA | 0.659 | 0.038 | 0.634 | 0.044 | 0.602 | 0.043 | 0.03 | 0.03 |
| KAN | 0.833 | 0.022 | 0.671 | 0.040 | 0.658 | 0.041 | 0.16 | 0.01 |
| AMI | 0.879 | 0.019 | 0.639 | 0.057 | 0.640 | 0.055 | 0.24 | <0.01 |
| CAP | 0.743 | 0.030 | 0.690 | 0.038 | 0.666 | 0.038 | 0.05 | 0.02 |
| ETA | 0.701 | 0.034 | 0.689 | 0.038 | 0.675 | 0.038 | 0.01 | 0.01 |
| STR | 0.710 | 0.033 | 0.604 | 0.047 | 0.601 | 0.045 | 0.11 | <0.01 |
| MXF | 0.643 | 0.058 | 0.494 | 0.083 | 0.456 | 0.081 | 0.15 | 0.04 |
PVE all measurable genetic variation given along with PVE related to variation excluding known drug resistance regions and non-canonical regions associated in this study (n = 1452 isolates). Drug abbreviations detailed in Supplementary Table 2. PVE: proportion of variance explained. DR: drug resistance regions as detailed in the methods. NC: Non-canonical regions specified in Table 1. wo: without. SE: standard error
Fig. 2Promoter vs gene body mutations and their effect on MIC (y axis in mg per L) for four drugs (n = 1452). Wilcoxon rank sum test comparing each pair was significant at p < 0.01. For reference critical resistance testing concentration is 0.2 mg per L for INH, 5 mg per L for ETA, 100 mg per L for PZA, 5 mg per mL for EMB and indicated by the horizontal dotted line. Figure 2 can be regenerated using code in Supplementary Data 8 that will refer to data items provided in the Source Data file (compressed source data folder)