| Literature DB >> 35485492 |
Siyuan Feng1,2, Zhuoxing Wu3, Wanfei Liang1,2, Xin Zhang3, Xiujuan Cai4, Jiachen Li1,2, Lujie Liang1,2, Daixi Lin1,2, Nicole Stoesser5, Yohei Doi6,7,8, Lan-Lan Zhong1,2, Yan Liu9, Yong Xia10, Min Dai11, Liyan Zhang12, Xiaoshu Chen4, Jian-Rong Yang2,3,13, Guo-Bao Tian1,2,14.
Abstract
The antibiotic resistance crisis continues to threaten human health. Better predictions of the evolution of antibiotic resistance genes could contribute to the design of more sustainable treatment strategies. However, comprehensive prediction of antibiotic resistance gene evolution via laboratory approaches remains challenging. By combining site-specific integration and high-throughput sequencing, we quantified relative growth under the respective selection of cefotaxime or ceftazidime selection in ∼23,000 Escherichia coli MG1655 strains that each carried a unique, single-copy variant of the extended-spectrum β-lactamase gene blaCTX-M-14 at the chromosomal att HK022 site. Significant synergistic pleiotropy was observed within four subgenic regions, suggesting key regions for the evolution of resistance to both antibiotics. Moreover, we propose PEARP and PEARR, two deep-learning models with strong clinical correlations, for the prospective and retrospective prediction of blaCTX-M-14 evolution, respectively. Single to quintuple mutations of blaCTX-M-14 predicted to confer resistance by PEARP were significantly enriched among the clinical isolates harboring blaCTX-M-14 variants, and the PEARR scores matched the minimal inhibitory concentrations obtained for the 31 intermediates in all hypothetical trajectories. Altogether, we conclude that the measurement of local fitness landscape enables prediction of the evolutionary trajectories of antibiotic resistance genes, which could be useful for a broad range of clinical applications, from resistance prediction to designing novel treatment strategies.Entities:
Keywords: antibiotic resistance; evolutionary trajectories; high-throughput sequencing; prediction model; β-lactamase
Mesh:
Substances:
Year: 2022 PMID: 35485492 PMCID: PMC9087888 DOI: 10.1093/molbev/msac086
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 8.800
Fig. 1.Determining the fitness landscape of the blaCTX-M-14 gene. (A) Illustration of the experimental workflow for assessing the fitness landscape of CTX-M-14. The blaCTX-M-14 variant library was generated using “doped” oligonucleotides (3% per-site mutation rate). Then, the variant library was cloned into the integrated pOSIP-KH plasmid to create the pir+ strain pool. The plasmid library was sequenced with the PacBio Sequel System after plasmid extraction in order to determine the correspondence between genotypes and barcodes. The plasmid library was then integrated into the Escherichia coli MG1655 genome. Competition experiments were conducted in LB liquid medium containing cefotaxime or ceftazidime. After barcode amplification, Illumina HiSeq sequencing was used to obtain the frequency of mutant genotype f (Mutant) or wild-type f (WT). The relative growth of each genotype was evaluated as the increase in frequency under antibiotic selection relative to wild-type blaCTX-M-14. (B) Numbers of variants with 1–6 single-nucleotide mutations whose relative growth was determined in our experimental pipeline. (C) Mutual comparison of genotype frequencies between biological replicates at CS1, CS2, and CS3 in the presence of 1 × MIC ceftazidime. At each culture stage, three biological replicates are represented, for example, CS1 repeat 1 (CS1-R1). The color scale represents R (Pearson's correlation coefficient) between samples (see also supplementary fig. S2, Supplementary Material online). (D) SNR of the relative growth, estimated by variation among barcodes of the same genotype (see Materials and Methods). CS1–CS3 (x axis) represent three culture stages, with the corresponding OD600 values listed within parentheses. Error bars represent standard error.
Fig. 2.Fitness effects of all single-nucleotide mutations in CTX-M-14 under antibiotic selection. (A and B) Fitness landscape of blaCTX-M-14 in the presence of 1 × MIC ceftazidime (A) or cefotaxime (B) at CS3. Each tile represents a variant with one single-nucleotide mutation (x axis) at one specific position (y axis), whose relative growth is indicated by the color of the tile scaled according to the corresponding color scale bar on top. In addition, mutants with relative growth >100 are marked with black dots. (C) Functional assay of six mutations in CTX-M-14 essential for ceftazidime resistance. A representative result from three independent experiments is presented. The bars and error bars respectively represent the mean and the standard deviation. (D and E) For the relative growth measured in the presence of 1 × MIC ceftazidime (D) or cefotaxime (E), the maximum value of relative growth associated with the nine single-nucleotide substitutions at an amino acid position is indicated by colors overlaid on the three-dimensional structure of CTX-M-14 (PDB ID: 6D7H). The omega loop of CTX-M-14 required for antibiotic hydrolysis is highlighted with an oval. (F) The average relative growth of all beneficial (defined as relative growth ≥1.2 at CS3) mutants were increased in a time-dependent manner in the presence of ceftazidime and cefotaxime. The error bar represents the standard deviation. (G) Enrichment of mutations found in clinical isolates among CTX-M-14 single-nucleotide mutants with elevated relative growth in 1 × MIC ceftazidime at CS3, relative to all but nonsense mutants. The enrichment (y axis) strengthens as the threshold for relative growth (x axis) increases. The P values of hypergeometric tests are indicated. *P < 0.01, **P < 0.001. The error bar represents the standard error of mean.
Fig. 3.Pleiotropy and epistasis of CTX-M-14. (A) Pleiotropy of CTX-M-14 mutations for relative growth in ceftazidime and cefotaxime (1 × MIC at CS3). Blue dots represent synergistic pleiotropy; red dots represent antagonistic pleiotropy. Number of dots within each quadrant is respectively indicated. The binomial P value against the null expectation of equal chances of antagonistic/synergistic pleiotropy is shown. (B) Pleiotropy of subgenic regions within blaCTX-M-14 on relative growth in ceftazidime and cefotaxime (1 × MIC at CS3). Sliding windows of 10 bp (i.e., position 82 denotes the region between 82 and 91, and 83 denotes between 83 and 92, etc.) were used to analyze the whole gene encoding for the mature β-lactamase. Two-tailed binomial P values corrected for multiple testing (by the Benjamini–Hochberg procedure) are shown for each 10-bp window (y axis). The black dashed line indicates P = 1, and the gray dashed line indicates P = 0.05. (C and D) Comparison of relative growth for all single-nucleotide mutants under various concentrations of ceftazidime at CS3. Several mutations previously reported to confer an “extended-spectrum” phenotype in the CTX-M family among clinical isolates are shown in red with annotations describing the mutations. (E) The competitive indexes of three Escherichia coli MG1655 strains harboring different CTX-M-14 variants. sfGFP-labeled E. coli MG1655 (pOSIP-KH-CTX-M-14WT) was mixed with mCherry-labeled E. coli MG1655 (pOSIP-KH-CTX-M-14mutant) in a 1 : 1 ratio, and the bacteria were co-cultured in presence of the designated concentration of ceftazidime (x axis). The mixed population in the log phase of growth was analyzed by flow cytometry. The competitive index is calculated as (Mt/M0)/(WTt/WT0), whereas Mt and M0 are, respectively, the frequency of mCherry-labeled mutant cells at the beginning and end of the co-culture, and WTt and WT0 are, respectively, the frequency of sfGFP-labeled WT cells at the beginning and end of the co-culture. Error bars represent the standard error of the mean (n = 3). (F) Single-nucleotide mutations that conferred higher resistance (x axis) tended to appear more often in beneficial variants (relative growth ≥2, including variants with single or multiple mutations). N1 mutants that appeared only once were not considered. N1 mutants are variants with one single-nucleotide substitution. ρ is Spearman's rank correlation coefficient, and the related P value and regression line (blue) are shown.
Fig. 4.Prospective binary resistance prediction model (PEARP) and its clinical application. (A) Graphical illustration of the PEAR model: first, a genotype was transformed to a 792 × 4 matrix for input. The result was passed to a convolution layer and max-pooling layer to extract features and reduce the number of parameters, respectively. A subsequent BLSTM layer considers features from different regions, which are output to a fully connected final layer that summarizes the information learned by the network. (B) Schematic illustration of the prospective PEARP binary classification model. The inputs are mutant DNA sequences, and the output is a prediction of whether the sequence confers resistance to a given antimicrobial. (C) Receiver operating characteristic analysis of the test set of phenotypic variants characterized in our experiments. Ten gray lines represent ten different receiver operating characteristic curves from different random splits of the total dataset (80% training set, 10% validation set, and 10% test set). The blue line represents the mean AUC of the 10 gray lines. The light blue area represents the mean ± SD of the AUC. (D) Enrichment of clinically isolated mutant alleles among variants predicted to confer resistance by PEARP, relative to all mutants without nonsense mutations. The enrichment (y axis) increases as the maximum number of mutations (x axis) increases (details in section Materials and Methods). The P values of hypergeometric tests with the null distribution of all mutants are indicated. ***P < 10−6, ****P < 10−9. The error bar represents the standard error. AUC, area under the curve.
Fig. 5.Retrospective prediction of the evolutionary path. (A) Schematic illustration of the PEARR regression model, which is able to model plausible evolutionary trajectories for mutants similar to blaCTX-M-14/14 based on the comparison of the relative growth of input genotypes. The inputs are ancestral and final genotypes, and the output is a prediction of whether there is a plausible evolutionary trajectory under the assumption that the predicted changes in relative growth for each mutational step should be the best among all alternatives and not detrimental. (B) The predicted evolutionary trajectories of a clinical isolate with five single-nucleotide substitutions in the ancestor of blaCTX-M-14. The black numbers and lines represent log2(MIC) values and corresponding evolutionary trajectories (see Materials and Methods), respectively. The red numbers and lines represent the log10(relative growth) from predictions and the corresponding evolutionary trajectories, respectively. Filled colored squares reflect corresponding mutations at the position, as shown in the top right legend.