| Literature DB >> 34928464 |
Ivayla Roberts1, Marina Wright Muelas2, Joseph M Taylor3, Andrew S Davison3, Yun Xu1,4, Justine M Grixti1, Nigel Gotts1,4, Anatolii Sorokin1, Royston Goodacre1,4, Douglas B Kell5,6.
Abstract
INTRODUCTION: The diagnosis of COVID-19 is normally based on the qualitative detection of viral nucleic acid sequences. Properties of the host response are not measured but are key in determining outcome. Although metabolic profiles are well suited to capture host state, most metabolomics studies are either underpowered, measure only a restricted subset of metabolites, compare infected individuals against uninfected control cohorts that are not suitably matched, or do not provide a compact predictive model.Entities:
Keywords: COVID-19; LC–MS/MS Orbitrap; Serum; UHPLC-MS/MS; Untargeted metabolomics
Mesh:
Substances:
Year: 2021 PMID: 34928464 PMCID: PMC8686810 DOI: 10.1007/s11306-021-01859-3
Source DB: PubMed Journal: Metabolomics ISSN: 1573-3882 Impact factor: 4.290
Patient demographics by severity and outcome
| Condition | Severity | Outcome | ||||
|---|---|---|---|---|---|---|
| Mild | Severe | P-v | Discharged | Deceased | P-v | |
| N | 71 | 49 | – | 89 | 31 | – |
| 4C score | 8.39 (4.75–12) | 12.87 (10.00–16) | *** | 8.91 ( 5.25–12) | 14.34 (13.00–16) | *** |
| Male | 38 (54%) | 32 (65%) | 48 (54%) | 22 (71%) | ||
| Female | 33 (46%) | 17 (35%) | 41 (46%) | 9 (29%) | ||
| Age | 66.0 (54–77) | 69.2 (56–87) | 63.8 (52.0–77.0) | 77.1 (65.5–88.5) | *** | |
| BMI | 26.4 (21.8–28.6) | 27.0 (23.0–32.0) | 26.7 (22.3–29.0) | 26.4 (21.9–29.6) | ||
| O2 support | 32 (45%) | 49 (100%) | *** | 50 ( 56%) | 31 (100%) | *** |
| Mechanical respiratory support | 1 (1%) | 12 (25%) | *** | 3 (10%) | 10 (11%) | |
| Max FiO2 | 24.7 (21–28) | 71.6 (40–100) | *** | 33.3 (21–35) | 74.2 (36–100) | *** |
| Required O2 on presentation | 17 (24%) | 39 (80%) | *** | 34 (38%) | 22 (71%) | ** |
| FiO2 (%) on presentation | 22.8 (21–21) | 47.0 (28–60) | *** | 28.2 (21.0–28) | 45.5 (22.5–60) | ** |
| NEWS | 2.56 (1–4) | 7.27 (5–10) | *** | 3.47 (1–6.0) | 7.40 (6–10.8) | *** |
| Hypertension | 28 (39%) | 26 (53%) | 35 (39%) | 19 (61%) | * | |
| Cardiac disease | 20 (28%) | 17 (35%) | 25 (28%) | 12 (39%) | ||
| Kidney disease | 24 (34%) | 19 (39%) | 28 (31%) | 15 (48%) | ||
| eGFR | 67.7 (49–90) | 56.5 (30–90) | * | 67.7 (48.0–90.0) | 49.9 (26.5–68.5) | ** |
| Liver disease | 7 (9.9%) | 6 (12.2%) | 9 (10%) | 4 (13%) | ||
| Malignancy | 13 (18%) | 6 (12%) | 13 (15%) | 6 (19%) | ||
| Fever | 34 (48%) | 30 (61%) | 49 (55%) | 15 (48%) | ||
| Cough | 33 (46%) | 29 (59%) | 43 (48%) | 19 (61%) | ||
| SOB | 35 (49%) | 29 (59%) | 48 (54%) | 16 (52%) | ||
| Pulse (BPM) | 87.9 (72.0–102) | 102.8 (91.8–114) | *** | 90.6 (74–106) | 103.8 (92–114) | ** |
| Temp. (°C) | 37.1 (36.6–37.4) | 37.5 (36.7–38.1) | * | 37.2 (36.6–37.5) | 37.3 (36.7–37.9) | |
| BP (mmHg) | 137 (120–151) | 123 (102–138) | ** | 135 (120–149) | 120 (102–130) | ** |
| Respiratory rate | 20.1 (17–22) | 26.6 (20–30) | *** | 21.7 (17.0–22) | 25.7 (19.2–30) | * |
Hb (g/L) Male [133–168] Female [118–148] | 126 (111–141) | 124 (107–146) | 127 (113–140) | 120 ( 96–146) | ||
WBC (× 109/L) [3.5–11.0] | 8.28 (4.8–10.9) | 9.98 (5.9–11.7) | 8.63 (5.1–11.0) | 9.97 (6.5–11.6) | ||
Lymphs (× 109/L) [1.0–3.5] | 1.25 (0.75–1.4) | 1.06 (0.60–1.6) | 1.239 (0.7–1.50) | 0.994 (0.5–1.35) | ||
PLTS (× 109/L) [150–400] | 256 (176–306) | 236 (166–292) | 255 (175–305) | 226 (169–259) | ||
| HCT (%) | 0.372 (0.337–0.417) | 0.371 (0.329–0.436) | 0.375 (0.341–0.416) | 0.361 (0.287–0.443) | ||
ALT (U/L) Male [≤ 41] Female [≤ 33] | 32.8 (13–31.8) | 47.1 (16–39.0) | 36.1 (15–39.0) | 46.6 (13–31.5) | ||
Urea (mmol/L) [2.5–7.8] | 8.29 (4.6–8.75) | 13.05 (5.2–19.00) | ** | 8.53 (4.4–8.8) | 15.12 (8.0–22.2) | *** |
Creatinine (µmol/L) Male [59–104] Female [45–84] | 112 (66–104) | 133 (70–168) | 109 (65.0–105) | 153 (84.5–184) | ||
CRP (mg/L) [< 4] | 47.7 (11–61.2) | 132.9 (49–190.0) | *** | 61.4 (13.5–79.5) | 144.3 (52.5–209.5) | *** |
Cohort demographics are presented as counts and percentages (%) for categorical data and means with interquartile range for continuous data. P-values indicating significant differences between groups follow ‘star’ notation i.e., ‘***’ correspond to p-values < 0.001, ‘**’ < 0.01, ‘*’ < 0.05, ‘.’ < 0.1 and missing when > 0.1. P-values were not calculated for the group counts (N). This study looked at the serum samples from 120 COVID-19 patients. 49 patients developed severe symptoms and 31 patients died as a result of the infection. Severity score metrics based on the 4C Mortality score (Knight et al., 2020) are provided as group means. Some disparity can be observed in gender as women represent 13% of the severe cases and only 29% of the deceased patients. Age groups of severe and deceased patients also tend to be slightly higher. BMI is not significantly different between the groups. O2 support indicates the number of patients that required oxygen support at any time during their hospitalization. Mechanical respiratory support indicates the need of invasive support or continuous positive airway pressure (CPAP). Max FiO2 captures the maximum fraction of inspired oxygen required by the patient during the hospitalization period, where respiratory support captures the patients requiring any support at diagnosis and FiO2 represents the fraction of inspired oxygen required at time of sample acquisition. As expected, oxygen need and inspired fraction, are highly correlated with severity and fatal outcome. National Early Warning Score (NEWS) also showed correlation with both severity and outcome. Cardiac disease refers to multiple cardiovascular conditions, most frequently: ischemic heart disease, atrial fibrillation, and heart failure. Kidney disease is a grouping of stages G2 to G5 of chronic kidney disease as defined by the National Institute for Health and Care Excellence (NICE) (NICE, 2015). Liver disease in most cases refers to cirrhosis and hepatitis. Malignancy cases vary from lung, bladder, prostate, skin cancer to haematological. Those underlying conditions did not show significant differences in severe infections or poor outcome. Despite kidney disease classification not showing correlation, estimated Glomerular Filtration Rate (eGFR) levels were significantly different across severity and outcome classes. Fever (temperature ≥ 38 °C), cough and shortness of breath (SOB) were noted at time of sample acquisition and did not show relation to severity or outcome. Pulse, systolic blood pressure (BP) and respiratory rate also taken at samples acquisition showed correlation with severity and outcome, where higher pulse, higher respiratory rate and lower blood pressure are associated with severe cases and poor outcome. Haemoglobin levels (Hb), white blood cell count (WBC), lymphocyte count (Lymphs), platelets count (PLTs), haematocrit (HCT) and alanine aminotransferase (ALT) measured at sample acquisition did not show significant correlation with severity and outcome. Urea, creatinine, and C-reactive protein (CRP) concentrations are consistently elevated in severe and deceased patients. Hb, WBC, Lymphs, PLTs and HCT were measured on Beckman analyser. ALT, urea, creatinine and CRP were measured on a Roche analyser. Reference ranges are provided in square brackets [] when available
Fig. 1Predictive model based on 20 compounds selected for their identification confidence and known biological role (see Fig. 4). Balanced accuracy and AUC for the specific train / test split are in the 0.70 s and 0.80 s, respectively. ROC 95% confidence intervals were calculated with 2000 stratified bootstrap replicates on the test data and are presented as blue shading around the mean curve. The balanced accuracy calculated using Monte Carlo cross-validation for these models is 0.716 for severity and 0.655 for outcome. Cross-validated AUC was calculated as ~ 0.79 in both conditions
Fig. 4Level of selected compounds for significant change in level between first and last patient sample. Ureidopropionate (A), uracil (B), arginine (C) and tryptophan (D) levels showed significantly different evolution measured by logistic regression OR and 95%CI. Y axes reflect the difference in ion counts or peak areas difference between the first sample and last sample per patient. Boxplot markings follow the same standard as Fig. 4
Fig. 2Compounds retained for severity and outcome predictive model. Box plot shows compound area differences between discharged and deceased patients ordered by fold change. Compound areas are standardized (mean = 0, SD = 1) to facilitate comparison. Boxes represent the quartiles Q1 to Q3 with Q2 (i.e., median) line in the middle. The ‘whiskers’ depict the upper and lower limit i.e., Q1 ± (Q3-Q1). For visualization simplicity the data is clipped between 10 and 90 percentiles. The table on the right side of the figure shows detailed information about the compounds including Fold Change (FC) and q-value (false discovery rate corrected p-value) following ‘star’ notation i.e., ‘***’ correspond to q-values < 0.001, ‘**’ < 0.01, and missing when > 0.1. In the case of ‘180.05336@4.775 ‘, which was initially identified as nicotinuric acid at MSI level 3, further standard based validation refuted this identification. Therefore, the compound remains unknown. Further details can be found in ‘Compound identity validation’ section in Methods
Adjusted logistic regression results by outcome
Positive OR indicate increased levels in patients with poor outcome and are presented with OR (95% CI). Significance is presented in ‘star’ notation i.e., ‘***’ correspond to p-values < 0.001, ‘**’ < 0.01, ‘*’ < 0.05, ‘.’ < 0.1 and missing when > 0.1. Compounds are adjusted for age, gender, BMI, liver conditions, cardiovascular diseases, hypertension, kidney disease, diabetes and all together. Details of specific diseases in each category are available in the Methods section
Fig. 3Predictive model performance based on 17 ESI + compounds previously selected in the discovery study. ROC 95% confidence intervals were calculated with 2000 stratified bootstrap replicates on the test data and are presented as blue shading around the mean curve. A Monte Carlo cross-validation results estimate the model balanced accuracy at 0.63 for severity and outcome