| Literature DB >> 36246273 |
Evan S Bradley1,2, Abigail L Zeamer2,3, Vanni Bucci2,3, Lindsey Cincotta1, Marie-Claire Salive1, Protiva Dutta1, Shafik Mutaawe1, Otuwe Anya1, Christopher Tocci4, Ann Moormann5, Doyle V Ward2,3, Beth A McCormick2,3, John P Haran1,2,3.
Abstract
The oropharyngeal microbiome, the collective genomes of the community of microorganisms that colonizes the upper respiratory tract, is thought to influence the clinical course of infection by respiratory viruses, including Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), the causative agent of Coronavirus Infectious Disease 2019 (COVID-19). In this study, we examined the oropharyngeal microbiome of suspected COVID-19 patients presenting to the Emergency Department and an inpatient COVID-19 unit with symptoms of acute COVID-19. Of 115 initially enrolled patients, 50 had positive molecular testing for COVID-19+ and had symptom duration of 14 days or less. These patients were analyzed further as progression of disease could most likely be attributed to acute COVID-19 and less likely a secondary process. Of these, 38 (76%) went on to require some form of supplemental oxygen support. To identify functional patterns associated with respiratory illness requiring respiratory support, we applied an interpretable random forest classification machine learning pipeline to shotgun metagenomic sequencing data and select clinical covariates. When combined with clinical factors, both species and metabolic pathways abundance-based models were found to be highly predictive of the need for respiratory support (F1-score 0.857 for microbes and 0.821 for functional pathways). To determine biologically meaningful and highly predictive signals in the microbiome, we applied the Stable and Interpretable RUle Set to the output of the models. This analysis revealed that low abundance of two commensal organisms, Prevotella salivae or Veillonella infantium (< 4.2 and 1.7% respectively), and a low abundance of a pathway associated with LPS biosynthesis (< 0.1%) were highly predictive of developing the need for acute respiratory support (82 and 91.4% respectively). These findings suggest that the composition of the oropharyngeal microbiome in COVID-19 patients may play a role in determining who will suffer from severe disease manifestations.Entities:
Keywords: COVID-19; LPS biosynthesis; Prevotella; SARS-CoV-2; commensal organisms; oropharyngeal microbiome; random forest classification
Year: 2022 PMID: 36246273 PMCID: PMC9561819 DOI: 10.3389/fmicb.2022.1009440
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 6.064
Figure 1Study enrollment and data analysis flowcharts. (A) Patients at UMass Medical center were enrolled for our study according to the following flow chart. Fifty patients with acute COVID-19 were ultimately selected for our study cohort and followed for a clinical outcome of whether they needed respiratory support and what level of respiratory support was required, ranging from supplemental oxygen via simple nasal cannula escalating through intubation and mechanical ventilation. The number of patients requiring each level of respiratory support is shown in the final chart on the right. (B) Data from clinical covariates and microbiome sequencing results are combined in a random forest classifier to determine features predicting the need for respiratory support. We then applied the Stable and Interpretable RUle Set (SIRUS) to these results to generate easily interpretable rules predicting which clinical covariates and microbiome features are predictive of the need for respiratory support.
Study population characteristics.
| Respiratory support | ||||
|---|---|---|---|---|
| Characteristic | Overall, | no, | yes, | |
| BMI | 29.12 (7.01) | 23.83 (5.11) | 30.79 (6.74) | 0.003 |
| Age | 68.00 (15.24) | 60.83 (19.52) | 70.26 (13.12) | 0.15 |
| Male (%) | 25/50 (50) | 5/12 (42) | 20/38 (53) | 0.5 |
|
| ||||
| Caucasian (%) | 32/50 (64) | 5/12 (42) | 27/38 (71) | 0.089 |
| Black (%) | 5/50 (10) | 2/12 (17) | 3/38 (7.9) | 0.6 |
| Asian (%) | 2/50 (4.0) | 2/12 (17) | 0/38 (0) | 0.054 |
| Other (%) | 11/50 (22) | 3/12 (25) | 8/38 (21) | >0.9 |
| Hispanic or Latino (%) | 38/50 (76) | 7/12 (58) | 31/38 (82) | 0.13 |
| CCI | 4.50 (2.58) | 3.75 (3.05) | 4.74 (2.41) | 0.2 |
| Hypertension (%) | 33/50 (66) | 7/12 (58) | 26/38 (68) | 0.7 |
| Diabetes (%) | 18/50 (36) | 5/12 (42) | 13/38 (34) | 0.7 |
| Asthma (%) | 8/50 (16) | 1/12 (8.3) | 7/38 (18) | 0.7 |
| COPD (%) | 10/50 (20) | 2/12 (17) | 8/38 (21) | >0.9 |
| OSA (%) | 3/50 (6.0) | 0/12 (0) | 3/38 (7.9) | >0.9 |
| Smoker, current (%) | 1/50 (2.0) | 1/12 (8.3) | 0/38 (0) | 0.2 |
| Smoker, former (%) | 21/50 (42) | 3/12 (25) | 18/38 (47) | 0.2 |
| COVID fatality (%) | 8/50 (16) | 0/12 (0) | 8/38 (21) | 0.2 |
|
| ||||
| Shannon | 2.25 (0.62) | 2.50 (0.35) | 2.17 (0.66) | 0.2 |
| Simpson | 0.80 (0.13) | 0.86 (0.04) | 0.78 (0.15) | 0.3 |
| Inverse Simpson | 7.04 (3.69) | 7.70 (2.39) | 6.83 (4.02) | 0.3 |
Mean (SD);
Wilcoxon Rank Sum test; Fisher’s exact test; Pearson’s Chi-squared test.
CCI, Charlson Comorbidity Index; COPD, chronic obstructive pulmonary disease; OSA, obstructive sleep apnea.
Figure 2Results of random forest classification model. (A) F1-scores of RFC models including clinical covariates (CC), individual microbial abundances, and the combination of bacterial abundances, alpha diversity, and clinical covariates show that all models perform well with models including all multimodal data performing slightly better. (B) Median ranked importance of features from the final model (blue boxplot) trained on all data including microbiome features, alpha diversity, and clinical data (median importance ± median absolute deviation) are shown. The size of the circle represents how often each feature was selected. The relative abundance of Prevotella salivae is the top predictor with the relative abundance of Campylobacter concisus, V. infantium and Actinomycetes sp. S6-Spd3 and the Shannon diversity index also showing significant contributions. Significantly contributing clinical covariates were age and BMI. Features which do not contribute to the predictive model are not shown. (C) The relative abundance of the microbiome features or values of clinical variables determined to be important in predicting the need for respiratory support by our RFC model are displayed along with discriminative rules based on the probability of requiring respiratory support (pres).
Figure 3Random Forest Classification Using Metabolic Pathways. (A) F1-scores of RFC models built on relative abundance of detected metabolic pathways alone or in combination with clinical covariates (CC) show that models combining both data modalities perform slightly better. (B) Median relative importance of variables in predicating the need for respiratory support from the model trained on relative pathway abundances and clinical covariates (median importance ± median absolute deviation) are shown. The size of the circle represents how often each feature was selected. Features which do not contribute to the predictive model are not shown. (C) The relative abundance of top metabolic pathways or values of clinical features determined to be important in predicting need for respiratory support by our RFC model are displayed along with discriminative rules based on the probability of requiring respiratory support (pres). (D) Contributions of detected bacterial genera to pathway abundance of CMP-3-deoxy-D-manno-octusonate biosynthesis and L-threonine biosynthesis in patients who did or did not go on to require respiratory support are shown. The detection of Pseudomonas contributing to the abundance of the CMP-3-deoxy-D-manno-octusonate pathway and the detection of Tropheryma contributing to the abundance of the L-threonine biosynthetic pathway are notable and highlighted.
Figure 4Conceptual Diagram of SARS-nCoV2 Interaction with the Oropharyngeal Microbiome. When an infection with SARS-nCoV2 begins within the oropharynx, it occurs within the environment of the microbiome. If the oropharyngeal microbiome has a high abundance of non-inflammatory or minimally inflammatory species that do not produce strongly inflammatory bacterial products, this will be protective against more severe lung disease and the need for respiratory support. If the oropharyngeal microbiome has a low abundance of non-inflammatory or minimally inflammatory species, then these produce more inflammatory bacterial products, leading to more severe lung disease and the need for respiratory support.