| Literature DB >> 29382143 |
Henrike Zschach1, Mette V Larsen2, Henrik Hasman3, Henrik Westh4,5, Morten Nielsen6,7, Ryszard Międzybrodzki8,9, Ewa Jończyk-Matysiak10, Beata Weber-Dąbrowska11, Andrzej Górski12,13.
Abstract
Staphylococcus aureus is a major agent of nosocomial infections. Especially in methicillin-resistant strains, conventional treatment options are limited and expensive, which has fueled a growing interest in phage therapy approaches. We have tested the susceptibility of 207 clinical S. aureus strains to 12 (nine monovalent) different therapeutic phage preparations and subsequently employed linear regression models to estimate the influence of individual host gene families on resistance to phages. Specifically, we used a two-step regression model setup with a preselection step based on gene family enrichment. We show that our models are robust and capture the data's underlying signal by comparing their performance to that of models build on randomized data. In doing so, we have identified 167 gene families that govern phage resistance in our strain set and performed functional analysis on them. This revealed genes of possible prophage or mobile genetic element origin, along with genes involved in restriction-modification and transcription regulators, though the majority were genes of unknown function. This study is a step in the direction of understanding the intricate host-phage relationship in this important pathogen with the outlook to targeted phage therapy applications.Entities:
Keywords: MRSA; bacterial phage resistance; phage therapy; regression modeling
Year: 2018 PMID: 29382143 PMCID: PMC5872120 DOI: 10.3390/antibiotics7010009
Source DB: PubMed Journal: Antibiotics (Basel) ISSN: 2079-6382
Wet lab results of susceptibility testing. All phage preparations were tested at RTD, see Methods. MS-1, OP_MS-1 and OP_MS-1_TOP are mixtures of P4/6409, A5/80 and 676/Z.
| Phage Preparation | Percent Sensitive | Percent Resistant |
|---|---|---|
| 1N/80 | 31.9% | 68.1% |
| 676/F | 50.7% | 49.3% |
| 676/T | 68.1% | 31.9% |
| 676/Z | 40.6% | 59.4% |
| A3/R | 18.8% | 81.2% |
| A5/L | 47.3% | 52.7% |
| A5/80 | 55.1% | 44.9% |
| P4/6409 | 37.7% | 62.3% |
| phi200/6409 | 44.0% | 56.0% |
| MS-1 | 33.8% | 66.2% |
| OP_MS-1 | 38.6% | 61.4% |
| OP_MS-1 TOP | 39.6% | 60.4% |
Figure 1All-against-all matrix of the genetic distance between the 207 methicillin-resistant Staphylococcus aureus (MRSA) strains used for this study. Distance is calculated as 1-orthoANI and represented as color, where blue corresponds to lower and red corresponds to greater distance. The assignment of strains to partitions is marked on the right margin.
Figure 2Abundance of gene families in the 207 strains. The peak depicted in the histogram is slightly higher than the number of housekeeping genes, 1.777, since the bin is wider than 1.
Figure 3Stacked histogram of p-value distributions across the five folds for the interaction with phage P4/6409. The density is shown instead of counts to account for fold 1 having a 100 times less p-values compared to the other folds, since it does not include partition 1 and therefore did not need to be subsampled. Left: Real data. Right: Permuted data.
Summary of the modelling results for real and permuted data. The “First Model” section reports the results of the first filtering procedure based of association analyses. The “Final Model” section gives the result of the second filtering procedure based on regression model fitting combined with consistency constraints. The area under the curve (AUC) is used as performance measure of the final model. The number of gene families selected given in the left part of the table is calculated as the average ± standard deviation across the five folds. If less than two gene families were selected based on regression weights, a final model could not be trained and the associated AUC is reported as NA (not applicable).
| First Model | Final Model | |||||
|---|---|---|---|---|---|---|
| Real Data | Permuted Data | Real Data | Permuted Data | |||
|
| No. of Gene Families Selected by Enrichment | No. of Gene Families Selected by Enrichment | No. of Gene Families Selected on Regression Weights |
| No. of Gene Families Selected on Regression Weights |
|
| 1N/80 | 10 ± 16 | 0 | 2 | NA | 0 | NA |
| 676/F | 222 ± 144 | 0 | 45 | 0.78 | 0 | NA |
| 676/T | 361 ± 243 | 12 ± 11 | 79 | 0.87 | 3 | 0.63 |
| 676/Z | 112 ± 87 | 11 ± 14 | 31 | 0.72 | 4 | 0.61 |
| A3/R | 13 ± 26 | 0 | 1 | NA | 0 | NA |
| A5/L | 184 ± 124 | 0 | 37 | 0.8 | 0 | NA |
| A5/80 | 265 ± 148 | 0 | 80 | 0.78 | 0 | NA |
| P4/6409 | 200 ± 137 | 2 ± 4 | 61 | 0.79 | 0 | NA |
| phi200/6409 | 160 ± 138 | 0 | 56 | 0.79 | 0 | NA |
| MS-1 | 6 ± 10 | 0 | 0 | NA | 0 | NA |
| OP_MS-1 | 86 ± 78 | 0 | 29 | 0.65 | 0 | NA |
| OP_MS-1_TOP | 54 ± 52 | 1 ± 1 | 13 | 0.67 | 0 | NA |
Figure 4Heat map of the regression weights for the final model of phage P4/6409. Columns are gene families, rows are cross validation folds. The color indicates the value and direction of each weight, with blue being strongly positive and red being strongly negative. Weights with low values are white. Results were comparable for other phages with the exception of 1N/80, A3/R, and mix MS-1 (see Table 2).
Figure 5Functional annotation categories of the eggNOGs matching to the set of significant genes across all nine phages.
Figure 6Histogram depicting the number of phage models where a given gene family was identified significant.
Predicted functions of the gene families found significant in interaction with eight or more phages.
| Gene Family ID | Times Observed | Increases | eggNOG Annotation | eggNOG Category |
|---|---|---|---|---|
| cluster_1791 | 9 | Resistance | - | No Hit |
| cluster_389 | 8 | Resistance | - | Function unknown |
| cluster_3112 | 8 | Resistance | Transcriptional regulator | Transcription |
| cluster_3992 | 8 | Ambiguous * | - | No Hit |
* This gene family always confers phage resistance except in one interaction in which it confers susceptibility.