| Literature DB >> 34342537 |
Nicole Pearcy1, Yue Hu1, Michelle Baker1, Alexandre Maciel-Guerra1,2, Ning Xue1, Wei Wang3, Jasmeet Kaler1, Zixin Peng3, Fengqin Li3, Tania Dottorini1.
Abstract
Antimicrobial resistance (AMR) is becoming one of the largest threats to public health worldwide, with the opportunistic pathogen Escherichia coli playing a major role in the AMR global health crisis. Unravelling the complex interplay between drug resistance and metabolic rewiring is key to understand the ability of bacteria to adapt to new treatments and to the development of new effective solutions to combat resistant infections. We developed a computational pipeline that combines machine learning with genome-scale metabolic models (GSMs) to elucidate the systemic relationships between genetic determinants of resistance and metabolism beyond annotated drug resistance genes. Our approach was used to identify genetic determinants of 12 AMR profiles for the opportunistic pathogenic bacterium E. coli. Then, to interpret the large number of identified genetic determinants, we applied a constraint-based approach using the GSM to predict the effects of genetic changes on growth, metabolite yields, and reaction fluxes. Our computational platform leads to multiple results. First, our approach corroborates 225 known AMR-conferring genes, 35 of which are known for the specific antibiotic. Second, integration with the GSM predicted 20 top-ranked genetic determinants (including accA, metK, fabD, fabG, murG, lptG, mraY, folP, and glmM) essential for growth, while a further 17 top-ranked genetic determinants linked AMR to auxotrophic behavior. Third, clusters of AMR-conferring genes affecting similar metabolic processes are revealed, which strongly suggested that metabolic adaptations in cell wall, energy, iron and nucleotide metabolism are associated with AMR. The computational solution can be used to study other human and animal pathogens. IMPORTANCE Escherichia coli is a major public health concern given its increasing level of antibiotic resistance worldwide and extraordinary capacity to acquire and spread resistance via horizontal gene transfer with surrounding species and via mutations in its existing genome. E. coli also exhibits a large amount of metabolic pathway redundancy, which promotes resistance via metabolic adaptability. In this study, we developed a computational approach that integrates machine learning with metabolic modeling to understand the correlation between AMR and metabolic adaptation mechanisms in this model bacterium. Using our approach, we identified AMR genetic determinants associated with cell wall modifications for increased permeability, virulence factor manipulation of host immunity, reduction of oxidative stress toxicity, and changes to energy metabolism. Unravelling the complex interplay between antibiotic resistance and metabolic rewiring may open new opportunities to understand the ability of E. coli, and potentially of other human and animal pathogens, to adapt to new treatments.Entities:
Keywords: Escherichia coli; antimicrobial resistance; genome-scale metabolic model; machine learning
Year: 2021 PMID: 34342537 PMCID: PMC8409726 DOI: 10.1128/mSystems.00913-20
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1k-mer-based supervised machine learning prediction of antibiotic resistance signature profiles to 12 antibiotics in the E. coli cohort. Boxplots showing the prediction performance results of the gradient boosting classifier for the 50 iterations. The performance indicators (y axis) are accuracy, precision, recall, and AUC. Predictive models were generated to classify the resistance versus susceptibility profiles of 12 different antibiotics (x axis).
Known AMR genes identified by the k-mer-based AMR classifiers
| Antibiotic | Drug class | Known AMR gene(s) to the antibiotic | Known AMR genes associated with other antibiotics |
|---|---|---|---|
| Ampicillin | Beta-lactam | TEM-1**, CTX-M-15, | |
| Aztreonam | Beta-lactam | CTX-M-55* | AAC(6′)-Ib-cr, |
| Cefepime | Beta-lactam | CTX-M-1**, CTX-M-15, CTX-M-55 | dfrA25*, AAC(6′)-Ib10*, AAC(3)-IId, |
| Cefoxitin | Beta-lactam | CMY-2*, | |
| Ciprofloxacin | Fluoroquinolone | OXA-1*, CTX-M-15*, | |
| Gentamicin | Aminoglycoside | AAC(3)-IId**, AAC(6′)-Ib7**, | |
| Levofloxacin | Fluoroquinolone | ||
| Tetracycline | Tetracycline | APH(6)-Id, | |
| Tobramycin | Aminoglycoside | AAC(3)-IId**, AAC(6′)-Ib-cr**, AAC(3)-IIe, AAC(6′)-Ib7 | |
| Trimethoprim | Diaminopyrimidine | ANT(2′′)-Ia**, |
Genes in the top 10% features, ranked according to their maximum feature important assigned by the GBC classifier, are presented.
Symbols: **, gene was associated with feature in the top 10% features; *, gene was associated with feature in the top 50% features.
FIG 2SNP-based supervised machine learning prediction of antibiotic resistance signature profiles to 12 antibiotics in the E. coli cohort. Boxplots showing the prediction performance results of the gradient boosting classifier of the 50 iterations. The performance indicators (y axis) are accuracy, precision, recall, and AUC. Predictive models were generated to classify the resistance versus susceptibility profiles of 12 different antibiotics (x axis).
Known AMR genes identified by the SNP-based AMR classifiers
| Antibiotic | Known AMR genes to the antibiotic | Known AMR genes associated with other antibiotics |
|---|---|---|
| Ciprofloxacin | ||
| Levofloxacin | ||
| Meropenem |
Genes in the top 10% features, ranked according to their maximum contribution to the classifier, are presented.
Symbols: **, gene was associated with feature in the top 10% features; *, gene was associated with feature in the top 50% features.
FIG 3Number of metabolic genes occurring in the 11 AMR classifiers. (a) Bar chart showing proportions of metabolic genes compared to the entire set of genes found in each AMR model. The blue bars represent gene proportions from the k-mer AMR models, whereas the red bars represent gene proportions from the SNP AMR models (AUC > 95%). (b) Heatmap showing the Jaccard index comparing the gene sets between two antibiotic classes. (c) Pie chart showing the proportions of genes associated with 10 metabolic systems (outer ring presented using the “tab10” color theme in Matplotlib). The inner ring shows the proportion of genes from each antibiotic class associated with each metabolic system and is presented using the “Set3” color theme in Matplotlib. Note that genes contributing to multiple antibiotic classifications will contribute multiple times in the pie chart, and therefore, the total area of the pie chart does not amount to 289. (d) Heatmap showing the normalized number of genes associated with each metabolic system. Note that the number of genes was normalized via column standardization. Hierarchical clustering was applied to both rows (metabolic systems) and columns (antibiotic classes) using the single linkage method and Euclidean distance as the metric. Each panel shows the results for the top 10% of genes identified in each AMR classifier. Panels b, c, and d show the results for the 289 genes found by combining the genes that correspond to the features in the top 10% of the k-mer and SNP classifications.
In silico-predicted gene lethality from the top-ranked discriminant genes in k-mer-based and SNP-based classifiers listed for each antibiotic
| Antibiotic | Essential genes (rich media) | Essential genes (glucose minimal medium only) |
|---|---|---|
| Ampicillin |
| |
| Aztreonam |
| |
| Cefepime |
| |
| Cefoxitin | ||
| Ciprofloxacin | ||
| Levofloxacin |
| |
| Gentamicin |
| |
| Meropenem |
|
|
| Tetracyline |
| |
| Tobramycin |
| |
| Trimethoprim |
|
Symbol: *, genes associated with top 50 ranked features of the antibiotic AMR model. Boldface genes have not been found essential in experimental studies.
FIG 4An overview of the metabolic pathways involving potential gene targets for E. coli. The genes accA, lptG, fabD, fabG, murG, mraY, folP, glmM, and metK were all found to be essential in the GSM of E. coli, whereas knockout of the genes hisA and thiD all resulted in auxotrophic behavior. The genes fucK, fucI, nupG, speB, uxaA, uxaB, dgoD, uidB, and ttdB were all found to be essential to the growth on alternative carbon sources. Note that all genes presented corresponded to the top 50 features of the AMR models. Note that the antibiotic that each of these genes were found important to by the AMR models are provided. Abbreviations: 2-dehydro-3-deoxy-d-galactonate (2-DH3DGAL), 2-dehydro-3-deoxy-d-galactonate 6-phosphate (2-DH3DGAL-6P), fuculose 1-phosphate (fuculose-1P), dihydroxyacetone phosphate (DHAP), glyceraldehyde 3-phosphate (glyceraldehyde-3P), tagaturonate (TAG), altronate (ALTR), 2-dehydro-3-deoxy-d-galactonate 6-phosphate (2-DH3DGLUC-6P), 2-dehydro-3-deoxygluconate (2-DH3DGLUC), 1-O-methyl-beta-d-glucuronic acid (MG), oxalacetate (OXA), citrate (CIT), isocitrate (ICIT), alpha-ketoglutarate (AKG), succinyl-CoA (SUC-CoA), succinate (SUC), fumarate (FUM), malate (MAL), tetrahydrofolate (THF), glucose 6-phosphate (glucose-6P), fructose 6-phosphate (fructose-6P), guanosine-triphosphate (GTP), ribulose 5-phosphate (ribulose-5P), 5-phospho-alpha-d-ribose 1-diphosphate (PRPP), phosphoribosyl-ATP (PRBATP), phosphoribulosyl-formimino-5-aminoimidazole-4-carboxamide ribonucleotide phosphate (PRFAR), 5′- 5-aminoimidazole ribonucleotide (AIR), 4-amino-2-methyl-5-phosphomethylpyrimidine (4AMPM), 2-methyl-4-amino-5-hydroxymethylpyrimidine diphosphate (2MAHMP), thiamine phosphate (thiamine-P), phosphoribosylaminoimidazolecarboxamide formyltransferase(AICAR), d-erythro-imidazole-glycerol-phosphate (IGP), imidazole acetol-phosphate (IMIDAZOLE-ACETOL-P), histidinol-phosphatase (HISTIDINOL-P), glucosamine-6-phosphate (GlcN-6P), UDP N-acetylglucosamine (UDP-GlcNAc), UDP-N-acetylmuramyl-pentapeptide (UDP-MurNac-Pentapeptide), S-adenosyl-L-methionine (SAM), methionine (MET), homocysteine (HCYS).
In silico-predicted gene knockouts that lead to auxotrophy from the top-ranked discriminant genes in k-mer-based and SNP-based classifiers listed for each antibiotic
| Antibiotic | Gene(s) leading to specific auxotrophy |
|---|---|
| Ampicillin | |
| Aztreonam | |
| Cefepime | Pyrimidine compounds ( |
| Cefoxitin | |
| Ciprofloxacin | Phenylalanine ( |
| Levofloxacin | Histidine ( |
| Gentamicin | Cysteine-derived compounds ( |
| Meropenem | Histidine ( |
| Tetracycline | |
| Tobramycin | Thiamine ( |
| Trimethoprim |
Symbol: *, genes associated with the top 50 ranked features of the antibiotic AMR model.
In silico-predicted essential genes on specific carbon sources from the top-ranked discriminant genes in k-mer-based and SNP-based classifiers listed for each antibiotic
| Antibiotic | Lethal genes for growth on specific carbon sources important in AMR model |
|---|---|
| Ampicillin | |
| Aztreonam | |
| Cefepime | |
| Cefoxitin |
|
| Ciprofloxacin | |
| Levofloxacin |
|
| Gentamicin |
|
| Meropenem |
|
| Tetracyline | |
| Tobramycin | |
| Trimethoprim |
|
Symbol: *, genes associated with top 50 ranked features of the antibiotic AMR model.
FIG 5Effects of genetic determinants on metabolite yields. (a) Bipartite network with genes and metabolites as nodes. Labeled nodes represent genes, whereas unlabeled nodes represent metabolites. A gene and metabolite are connected by an edge if the deletion of the gene blocks the metabolite production. Genes and metabolites are highlighted according to the cluster they were assigned to via the Networkx modularity algorithm. The number of clusters in the figure was reduced by considering only those of size greater than 10. (b) Heatmap showing the metabolic systems associated with each of the six clusters. A gene was associated with a metabolic system, if at least one metabolite correlated with the system could no longer be produced after the gene was deleted. (c) Heatmap showing the antibiotics associated with each cluster. Note that genes occurring in multiple antibiotics were accounted for twice. Hierarchical clustering was applied to the rows of each heatmap (metabolic systems or antibiotic class) using the single linkage method and Euclidean distance as the metric. The gene counts have been normalized by the total number of genes in each cluster in each heatmap.
FIG 6Effect of genetic determinants on reaction fluxes. (a) Bipartite network with genes and reactions as nodes. Labeled nodes represent the genes, whereas unlabeled nodes represent reactions. A gene and reaction are connected by an edge if the deletion of the gene reduces the reaction flux by at least 10%. Genes and reactions are highlighted according to the cluster they were assigned to via the Networkx modularity algorithm. Note that to reduce the initial size of the network, we only included clusters of size greater than 10. (b) Heatmap showing the metabolic systems associated with each of the nine clusters. A gene was associated with a metabolic system, if the flux span of at least one reaction correlated with the system was reduced after the gene was deleted. (c) Heatmap showing the antibiotics associated with each cluster. Genes occurring in multiple antibiotics were accounted for twice. Hierarchical clustering was applied to the rows of each heatmap (metabolic systems or antibiotic class) using the single linkage method and Euclidean distance as the metric. The gene counts have also been normalized by the total number of genes in each cluster in each heatmap.