| Literature DB >> 35324915 |
Erick Denamur1,2, Bénédicte Condamine1, Marina Esposito-Farèse3, Guilhem Royer1,4,5, Olivier Clermont1, Cédric Laouenan1,3, Agnès Lefort1,6, Victoire de Lastours1,6, Marco Galardini7,8.
Abstract
Escherichia coli is an important cause of bloodstream infections (BSI), which is of concern given its high mortality and increasing worldwide prevalence. Finding bacterial genetic variants that might contribute to patient death is of interest to better understand infection progression and implement diagnostic methods that specifically look for those factors. E. coli samples isolated from patients with BSI are an ideal dataset to systematically search for those variants, as long as the influence of host factors such as comorbidities are taken into account. Here we performed a genome-wide association study (GWAS) using data from 912 patients with E. coli BSI from hospitals in Paris, France. We looked for associations between bacterial genetic variants and three patient outcomes (death at 28 days, septic shock and admission to intensive care unit), as well as two portals of entry (urinary and digestive tract), using various clinical variables from each patient to account for host factors. We did not find any association between genetic variants and patient outcomes, potentially confirming the strong influence of host factors in influencing the course of BSI; we however found a strong association between the papGII operon and entrance of E. coli through the urinary tract, which demonstrates the power of bacterial GWAS when applied to actual clinical data. Despite the lack of associations between E. coli genetic variants and patient outcomes, we estimate that increasing the sample size by one order of magnitude could lead to the discovery of some putative causal variants. Given the wide adoption of bacterial genome sequencing of clinical isolates, such sample sizes may be soon available.Entities:
Mesh:
Year: 2022 PMID: 35324915 PMCID: PMC8946752 DOI: 10.1371/journal.pgen.1010112
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Univariate analysis on the combined dataset.
Only clinical variables significantly associated with BSI outcomes are shown. CI, confidence interval.
| Patient outcome | Clinical variable | Odds-ratio [95% CI] | P-value |
|---|---|---|---|
| death | urinary tract | 0.51 [0.38–0.69] | 2E-5 |
| pulmonary tract | 2.88 [1.70–4.87] | 8E-5 | |
| malignant tumor | 1.75 [1.30–2.35] | 2E-4 | |
| digestive tract | 1.51 [1.12–2.04] | 0.006 | |
| chronic alcoholism | 1.74 [1.16–2.60] | 0.007 | |
| immunosuppression | 1.50 [1.11–2.02] | 0.007 | |
| active smoking | 1.56 [1.11–2.19] | 0.01 | |
| septic shock | pulmonary tract | 2.12 [1.27–3.55] | 0.003 |
| admission to ICU | cirrhosis | 1.99 [1.37–2.89] | 3E-4 |
| digestive tract | 1.53 [1.18–1.99] | 0.001 | |
| active smoking | 1.59 [1.17–2.16] | 0.003 |
Multivariate analysis on the combined dataset.
Clinical variables with p-value < 0.01 are reported for each patient outcome; the intercept is excluded. CI, confidence interval.
| Patient outcome | Clinical variable | Odds-ratio [95% CI] | P-value |
|---|---|---|---|
| death | study: septicoli | 0.59 [0.41–0.83] | 0.003 |
| pulmonary tract | 2.40 [1.33–4.20] | 0.003 | |
| urinary tract | 0.64 [0.45–0.89] | 0.008 | |
| septic shock | study: septicoli | 2.54 [1.96–3.33] | 5E-12 |
| pulmonary tract | 2.10 [1.25–3.51] | 0.005 | |
| admission to ICU | digestive tract | 1.57 [1.19–2.08] | 0.002 |
Fig 1Core genome phylogenetic tree of the 912 E. coli isolates used in this study.
Each ring reports the main bacterial and clinical variables of this study. The light color in the rings related to patient outcomes and portals of entry indicates the absence of the trait.
Fig 2Narrow-sense heritability (h) estimation for the target variables on the combined dataset.
a) Heritability estimates in the two studies combined, using a covariance matrix generated from the isolates’ phylogroup (phylogroup), a kinship matrix generated from the unitigs presence/absence matrix (variants), and the same kinship matrix conditioned with the clinical variables (variants + covariates). b) Heritability estimates in the Colibafi and c) Septicoli cohorts alone.
Fig 3Genome-wide association analysis results on the combined dataset.
a) Number of unitigs passing the multiple testing correction p-value threshold for each target phenotype. b) Number of genes with significantly associated unitigs mapped to them for each target phenotype. c) Manhattan plots for tested unitigs mapping to E. coli IAI39 for each target phenotype; red dashed line indicates the p-value threshold used to call significant associations. d) Zoomed-in Manhattan plots for the urinary tract trait. Genes annotated with a gene name in E. coli IAI39 and with associated unitigs are indicated in the third subpanel.
Fig 4Power simulations.
The proportion of causal variants passing the significance threshold is reported for each sample size and heritability for the simulated phenotypes.