| Literature DB >> 35468151 |
Danika Lipman1, Sandra E Safo2, Thierry Chekouo1,3.
Abstract
COVID-19 is a disease characterized by its seemingly unpredictable clinical outcomes. In order to better understand the molecular signature of the disease, a recent multi-omics study was done which looked at correlations between biomolecules and used a tree- based machine learning approach to predict clinical outcomes. This study specifically looked at patients admitted to the hospital experiencing COVID-19 or COVID-19 like symptoms. In this paper we examine the same multi-omics data, however we take a different approach, and we identify stable molecules of interest for further pathway analysis. We used stability selection, regularized regression models, enrichment analysis, and principal components analysis on proteomics, metabolomics, lipidomics, and RNA sequencing data, and we determined key molecules and biological pathways in disease severity, and disease status. In addition to the individual omics analyses, we perform the integrative method Sparse Multiple Canonical Correlation Analysis to analyse relationships of the different view of data. Our findings suggest that COVID-19 status is associated with the cell cycle and death, as well as the inflammatory response. This relationship is reflected in all four sets of molecules analyzed. We further observe that the metabolic processes, particularly processes to do with vitamin absorption and cholesterol are implicated in COVID-19 status and severity.Entities:
Mesh:
Year: 2022 PMID: 35468151 PMCID: PMC9038205 DOI: 10.1371/journal.pone.0267047
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Enrichment analysis of genes associated with COVID-19 status.
| Top Diseases and Biological Functions | ||
|---|---|---|
|
|
| |
| Connective Tissue Disorders | 4.04E-03–1.18E-05 | |
| Developmental Disorder | 4.04E-03–1.18E-05 | |
| Gastrointestinal Disease | 1.21E-02–1.18E-05 | |
| Organismal Injury and Abnormalities | 1.21E-02–1.18E-05 | |
| Skeletal and Muscular Disorders | 7.57E-03- 1.18E-05 | |
| Cell Cycle | 1.14E-02–5.08E-07 | |
| DNA Replication, Recombination, and Repair | 9.40E-03–1.51E-05 | |
| Cell-To-Cell Signalling and Interaction | 4.04E-03–8.80E-05 | |
| Cell Death and Survival | 1.01E-02–1.24E-04 | |
| Post-Translational Modification | 6.72E-03- 1.25E-04 | |
|
| ||
|
|
|
|
| Cell Cycle: G2/M DNA Damage Checkpoint Regulation | 4.87E-04 | |
| Cell Cycle Control of Chromosomal Replication | 6.36E-04 | |
| Mitotic Roles of Polo-Like Kinase | 8.05E-04 | |
| ATM Signaling | 1.86E-03 | |
| dTMP De Novo Biosynthesis | 3.37E-03 |
|
This table contains the enrichment analysis results for genes associated with COVID-19 status. For the gene set enrichment analysis the IPA output contains the top five canonical pathways and top ten biological functions and disease associations.
Fig 1The Pearson correlations of principal components of pathways associated with disease status.
The Pearson correlations of clinical variables and principal components used to summarize the enriched pathways in COVID-19 as predicted by IPA. These are the pathways predicted to be enriched based on 16 genes determined to be associated with COVID-19 via stability selection. Only the correlations which were significant (p-value<0.05) are reported. Some of the strongest correlations are with ferritin, CRP, and lactate.
Regression models for genes associated with COVID-19 status.
| Pathway | (Intercept) | PC1 | PC2 | Age | Gender (Male) | Charlson Score | |
|---|---|---|---|---|---|---|---|
|
|
| 1.703 (1.791) | 3.221 (0.687) | 1.824 (1.292) | 0.025 (0.036) | 1.742 (0.937) | -0.091 (0.230) |
|
| - |
| 0.370 | 0.349 |
| 0.696 | |
|
| (-1.711, 5.509) | (2.101, 4.870) | (-0.515, 4.595) | (-0.045, 0.098) | (0.030, 3.800) | (-0.529, 0.380) | |
|
| - |
|
| - | - | - | |
|
|
| 5.645 (2.573) | -3.334 (0.861) | -3.781 (1.624) | -0.010 (0.042) | 1.902 (1.012) | -0.179 (0.287) |
|
| - |
|
| 0.393 |
| 0.526 | |
|
| (1.267, 11.687) | (-5.504, -2.015) | (-7.359, -0.694) | (-0.096, 0.073) | (0.080, 4.176) | (-0.789, 0.368) | |
|
| - |
|
| - | - | - | |
Summary of multivariate logistic regression models with COVID-19 status as the outcome and the first two principal components used to summarize the enriched pathways associated with COVID-19 status as the predictors. The models are also adjusted for the clinical covariates: age, sex and Charlson comorbidity score. P-values for significance are determined via the likelihood ratio test (LRT).
Fig 2Overlapping networks associated with COVID-19.
Visual of the overlapping networks enriched in COVID-19 as determined from the RNAseq data. The nodes represent the networks and the edges represent the overlapping genes between the networks. The edge labels give the number of overlapping molecules between the networks.
Enrichment analysis of proteins associated with COVID-19 status.
| Top Diseases and Biological Functions | ||
|---|---|---|
|
|
| |
| Neurological Disease | 4.85E-02–1.25E-05 | |
| Organismal Injury and Abnormalities | 4.99E-02–1.25E-05 | |
| Metabolic Disease | 2.15E-02–5.18E-05 | |
| Psychological Disorders | 3.78E-02–5.18E-05 | |
| Inflammatory Response | 4.50E-02–1.05E-04 | |
| Cellular Movement | 3.75E-02–7.82E-06 | |
| Protein Synthesis | 3.22E-03–3.28E-05 | |
| Cellular Compromise | 1.38E-02–1.82E-04 | |
| Cell Death and Survival | 4.75E-02- 2.62E-04 | |
| Cellular Development | 4.75E-02–2.62E-04 | |
|
| ||
|
|
|
|
| LXR/RXR Activation | 1.48E-04 | |
| FXR/RXR Activation | 1.63E-04 | |
| Acute Phase Response Signalling | 4.67E-04 | |
| Type I Diabetes Mellitus Signaling | 3.81E-03 | |
| Creatine-phosphate Biosynthesis | 4.33E-03 |
|
This table contains the enrichment analysis results for proteins associated with COVID-19 status. For the protein set enrichment analysis the IPA output contains the top five canonical pathways and top ten biological functions and disease associations.
Fig 3The Pearson correlations of principal components of pathways associated with disease status.
The Pearson correlations of clinical variables and the principal components used to summarize the enriched pathways with COVID-19 status as predicted by IPA. These are the pathways predicted to be enriched based on the 22 proteins determined to be associated with COVID-19 status via stability selection. Only the correlations which were significant (p-value<0.05) are reported. The strongest correlations with the LXR/RXR activation and FXR/RXR activation pathways are with SOFA score and white blood cell count. The strongest correlation with the acute phase response signalling pathway is with HFD-45. With the diabetes mellitus signalling pathway, the top correlation is with fibrinogen.
Regression models for proteins associated with COVID-19 status.
| Pathway | (intercept) | PC1 | PC2 | Age | Gender (Male) | Charlson Score | |
|---|---|---|---|---|---|---|---|
|
|
| 36.911 (6.645) | -3.915 (2.181) | 5.363 (2.418) | -0.135 (0.132) | -2.315 (3.225) | -1.561 (0.836) |
|
| - |
|
|
| 0.741 | 0.062 | |
|
| (23.886, 49.936) | (-8.190, 0.361) | (0.623, 10.102) | (-0.393, 0.123) | (-8.636, 4.005) | (-3.199, 0.076) | |
|
| - |
|
| - | - | - | |
|
|
| 38.192 (6.196) | 2.337 (0.849) | 3.028 (0.903) | -0.106 (0.122) | -1.899 (3.041) | -2.446 (0.818) |
|
|
|
|
|
| 0.921 |
| |
|
| (26.048, 50.336) | (0.674, 4.000) | (1.259, 4.797) | (-0.346, 0.134) | (-7.860, 4.062) | (-4.049, -0.843) | |
|
| - |
|
| - | - | - | |
|
|
| 2.358(1.358) | -0.938 (0.198) | 1.179 (0.369) | 0.008 (0.025) | 0.706 (0.654) | -0.163 (0.154) |
|
| - |
|
| 0.976 | 0.274 | 0.291 | |
|
| (-0.162, 5.243) | (-1.373, -0.586) | (0.547, 2.008) | (-0.043, 0.058) | (-0.565, 2.041) | (-0.471, 0.144) | |
|
| - |
|
| - | - | - | |
Summary of multivariate logistic regression models with COVID-19 status as the outcome and the principal components used to summarize the enriched pathways associated with COVID-19 status as the predictors. The models are also adjusted for the clinical covariates age, sex and Charlson comorbidity score. P-values for significance are determined via the likelihood ratio test (LRT).
Fig 4Overlapping networks associated with COVID-19.
Visual of the overlapping networks enriched in COVID-19 as determined from the proteomics data. The nodes represent the networks, and the edges represent the overlapping proteins between the networks. The edge labels give the number of overlapping molecules between the networks.
Enrichment analysis of metabolites associated with COVID-19 status.
| Top Diseases and Biological Functions | ||
|---|---|---|
|
|
| |
| Antimicrobial Response | 1.22E-04–1.22E-04 |
|
| Dermatological Diseases and Conditions | 1.22E-04–1.22E-04 |
|
| Herditary Disorder | 7.32E-04–1.22E-04 |
|
| Infectious Diseases | 9.12E-03–1.22E-04 |
|
| Neurological Disease | 7.91E-03–1.22E-04 | |
| Small Molecule Biochemistry | 4.56E-02–1.22E-04 | |
| Cell Cycle | 2.44E-04–2.44E-04 |
|
| Cell Morphology | 2.44E-04–2.44E-04 |
|
| Cellular Compromise | 1.73E-02–2.44E-04 | |
| Cellular Assembly and Organization | 4.72E-02–3.66E-04 | |
|
| ||
|
|
|
|
| Myo-inositol Biosynthesis | 1.10E-03 |
|
| Sucrose Degradation V | 2.32E-03 |
|
| D-myo inositol(1,4,5)-triphosphate Degradation | 2.68E-03 |
|
| Superpathway of D-myo inositol(1,4,5)-triphosphate Metabolism | 3.90E-03 |
|
| D-myo inositol(1,4,5)-triphosphate Biosynthesis | 4.26E-03 |
|
This table contains the enrichment analysis results for metabolites associated with COVID-19 status. For the metabolite set enrichment analysis the IPA output contains the top five canonical pathways and top ten biological functions and disease associations.
Enrichment analysis of lipids associated with COVID-19 status.
| Enriched Pathways | ||||
|---|---|---|---|---|
| Pathway | p-value | Benjamini correction | Bonferroni correction | Selected Molecules |
|
| 0.006718104 | 0.036949572 | 0.073899145 | |
|
| 0.006718104 | 0.036949572 | 0.073899145 | |
|
| 0.016853933 | 0.061797753 | 0.185393258 | |
|
| 0.023682416 | 0.065126643 | 0.260506571 | |
|
| 0.033605179 | 0.073931393 | 0.369656964 | |
This table contains the enrichment analysis results for lipids associated with COVID-19 status. For the lipidomics set enrichment analysis from LIPEA the top 10 enriched pathways are summarized.
Fig 5The Pearson correlations of principal components of pathways associated with disease status.
The Pearson correlations of clinical covariates and the principal components used to summarize the enriched pathways with COVID-19 status as predicted by LIPEA. These are the pathways predicted to be enriched based on the 9 lipids determined to be associated with COVID-19 status via stability selection. Only the correlations which were significant (p-value<0.05) are reported. The strongest correlation is with the Charlson score.
Regression models for proteins associated with COVID-19 status.
| Pathway | (intercept) | PC1 | PC2 | Age | Gender (Male) | Charlson Score | |
|---|---|---|---|---|---|---|---|
|
|
| -3.200 (1.208) | -0.315 (0.104) | -0.346 (0.116) | 0.023 (0.021) | -0.494 (0.519) | 0.058 (0.142) |
|
| - |
|
| 0.518 | 0.279 | 0.111 | |
|
| (-0.537, -0.124) | (-0.588, -0.129) | (-0.019, 0.066) | (-1.530, 0.526) | (-0.227, 0.331) | ||
|
| - |
|
| - | - | - | |
Summary of multivariate logistic regression models with COVID-19 status as the outcome and the principal components used to summarize the enriched pathways associated with COVID-19 status as the predictors. The models are also adjusted for the clinical covariates age, sex and Charlson comorbidity score. P-values for significance are determined via the likelihood ratio test (LRT).
Enrichment analysis of genes associated with COVID-19 severity (all patients).
| Top Diseases and Biological Functions | ||
|---|---|---|
|
|
| |
| Cancer | 4.98E-02–1.11E-03 | |
| Connective Tissue Disorders | 4.47E-02–1.11E-03 | |
| Dermatological Diseases and Conditions | 1.11E-03–1.11E-03 | |
| Developmental Disorder | 4.23E-02–1.11E-03 | |
| Endocrine System Disorders | 1.10E-02–1.11E-03 | |
| Cellular Movement | 3.16E-02–1.11E-03 | |
| Cellular Development | 4.78E-02–5.33E-03 | |
| Cellular Growth and Proliferation | 4.78E-02–5.33E-03 | |
| Cell Death and Survival | 6.63E-03–6.63E-03 |
|
| Amino Acid Metabolism | 1.54E-02–7.73E-03 | |
|
| ||
|
|
|
|
| Retinoate Biosynthesis II | 4.42E-03 |
|
| The Visual Cycle | 2.08E-02 |
|
| Thyroid Hormone Metabolism II | 3.38E-02 |
|
| Airway Inflammation in Asthma | 3.49E-02 |
|
| Retinoate Biosynthesis I | 3.49E-02 |
|
This table contains the enrichment analysis results for genes associated with disease severity when using all patients. For the gene set enrichment analysis the IPA output contains the top five canonical pathways and top ten biological functions and disease associations.
Enrichment analysis of proteins associated with COVID-19 severity (all patients).
| Top Diseases and Biological Functions | ||
|---|---|---|
|
|
| |
| Inflammatory Response | 2.10E-02–4.69E-13 | |
| Infectious Diseases | 1.51E-02–6.74E-12 | |
| Endocrine System Disorders | 3.47E-03–7.74E-09 | |
| Gastrointestinal Disease | 2.10E-02–7.74E-09 | |
| Metabolic Disease | 9.07E-03–7.74E-09 | |
| Cellular Compromise | 1.21E-02–4.69E-13 | |
| Cellular Movement | 2.10E-02–2.32E-10 | |
| Cellular Function and Maintenance | 1.86E-02–1.47E-09 | |
| Protein Synthesis | 6.06E-03–1.97E-08 | |
| Cell Death and Survival | 1.81E-02–1.62E-06 | |
|
| ||
|
|
|
|
| LXR/RXR Activation | 1.04E-10 | |
| FXR/RXR Activation | 1.39E-10 | |
| Acute Phase Response Signalling | 1.16E-06 | |
| Atherosclerosis Signalling | 1.97E-06 | |
| Neuroprotective Role of THOP1 in Alzheimers Disease | 2.00E-05 | |
This table contains the enrichment analysis results for proteins associated with disease severity when using all patients. For the protein set enrichment analysis the IPA output contains the top five canonical pathways and top ten biological functions and disease associations.
Regression models for proteins associated with COVID-19 severity (all patients).
| Pathway | (Intercept) | PC1 | PC2 | Age | Gender (Male) | Charlson Score | |
|---|---|---|---|---|---|---|---|
|
|
| 33.585 (5.321) | -4.715 (0.737) | 2.468 (0.941) | -0.066 (0.102) | -3.022 (2.544) | -1.088 (0.661) |
|
| - |
|
|
| 0.328 | 0.100 | |
|
| (23.157, 44.013) | (-6.159, -3.271) | (0.623, 4.314) | (-0.265, 0.134) | (-8.008, 1.963) | (-2.383, 0.207) | |
|
| - |
|
| - | - | - | |
|
|
| 33.709 (4.689) | -7.417 (0.883) | 4.620 (0.976) | -0.041 (0.090) | -4.843 (2.249) | -1.258 (0.583) |
|
| - |
|
|
| 0.058 |
| |
|
| (24.519, 42.899) | (-9.147, -5.687) | (2.708, 6.533) | (-0.218, 0.136) | (-9.251, -0.436) | (-2.401, -0.114) | |
|
| - |
|
| - | - | - | |
|
|
| 37.090 (5.250) | -4.756 (0.671) | -5.971 (1.469) | -0.116 (0.103) | -3.809 (2.569) | -1.208 (0.677) |
|
|
|
|
|
| 0.245 | 0.075 | |
|
| (26.801, 47.379) | (-6.072, -3.441) | (-8.850, -3.092) | (-0.319, 0.086) | (-8.844, 1.226) | (-2.535, 0.119) | |
|
| - |
|
| - | - | - | |
|
|
| 42.017 (6.043) | 2.078 (0.795) | -0.012 (0.905) | -0.184 (0.117) | -5.208 (2.981) | -1.049 (0.772) |
|
| - |
| 0.956 |
| 0.103 | 0.174 | |
|
| (30.172, 53.862) | (0.521, 3.636) | (-1.786, 1.762) | (-0.413, 0.045) | (-11.051, 0.635) | (-2.561, 0.464) | |
|
| - |
|
| - | - | - | |
Summary of multivariate linear regression models with COVID-19 severity as the outcome using all patients, and the principal components used to summarize the enriched pathways associated with disease severity status as the predictors. The models are also adjusted for the clinical covariates age, sex and Charlson comorbidity score. P-values for significance are determined via the likelihood ratio test (LRT).
Fig 6The Pearson correlations of principal components of pathways associated with disease severity (all patients).
The Pearson correlations of clinical covariates and the principal components used to summarize the enriched pathways with disease severity as predicted by IPA. These are the pathways predicted to be enriched based on the 67 proteins determined to be associated with disease severity via stability selection. Only the correlations which were significant (p-value<0.05) are reported. These pathways all have multiple significant correlations with clinical covariates.
Fig 7Overlapping networks associated with COVID-19 visual of the overlapping networks enriched in COVID-19 as determined from the proteomics data.
The nodes represent the networks and the edges represent the overlapping proteins between the networks. The edge labels give the number of overlapping molecules between the networks.
Enrichment analysis of metabolites associated with COVID-19 severity (all patients).
| Top Diseases and Biological Functions | ||
|---|---|---|
|
|
| |
| Cardiovascular Disease | 2.23E-02–5.73E-04 | |
| Endocrine System Disorders | 3.99E-02–5.73E-04 | |
| Hemtological Disease | 3.11E-02–5.73E-04 | |
| Metabolic Disease | 3.11E-02–5.73E-04 | |
| Organismal Injury and Abnormalities | 4.64E-02–5.73E-04 | |
| Cell Death and Survival | 3.99E-02–4.50E-04 | |
| Cellular Compromise | 2.67E-02–4.50E-04 | |
| Carbohydrate Metabolism | 3.33E-02–7.87E-04 | |
| Cell-To-Cell Signaling and Interaction | 4.86E-02–2.20E-03 | |
| Cell Morphology | 2.25E-03–2.25E-03 |
|
|
| ||
|
|
|
|
| Myo-inositol Biosynthesis | 8.97E-03 |
|
| D-myo inositol(1,4,5)-triphosphate Degradation | 1.12E-02 |
|
| Urate Biosynthesis/Inosine 5’-phosphate Degradation | 2.01E-02 |
|
| Superpathway of D-myo inositol(1,4,5)-triphosphate Metabolism | 2.01E-02 |
|
| Sucrose Degradation V | 2.23E-02 |
|
This table contains the enrichment analysis results for metabolites associated with disease severity when using all patients. For the metabolite set enrichment analysis the IPA output contains the top five canonical pathways and top ten biological functions and disease associations.
Enrichment analysis of lipids associated with COVID-19 severity (all patients).
| Enriched Pathways | ||||
|---|---|---|---|---|
| Pathway | p-value | Benjamini correction | Bonferroni correction | Selected Molecules |
|
| 6.94137E-06 | 0.000104121 | 0.000104121 | |
|
| 0.018723372 | 0.084269663 | 0.280850581 | |
|
| 0.018723372 | 0.084269663 | 0.280850581 | |
|
| 0.023682416 | 0.084269663 | 0.355236233 | |
|
| 0.028089888 | 0.084269663 | 0.421348315 | |
This table contains the enrichment analysis results for lipids associated with disease severity when using all patients. For the lipidomics set enrichment analysis from LIPEA the top 10 enriched pathways are summarized.
Regression models for lipids associated with COVID-19 severity (all patients).
| Pathway | (Intercept) | PC1 | PC2 | Age | Gender (Male) | Charlson Score | |
|---|---|---|---|---|---|---|---|
|
|
| 38.733 (6.040) | 1.2801 (0.580) | -0.648 (0.758) | -0.144 (0.116) | -2.279 (3.01) | -1.318 (0.768) |
|
| - |
| 0.394 |
| 0.310 | 0.061 | |
|
| (26.772,50.694) | (0.130, 2.430) | (-2.150, 0.853) | (-0.374, 0.087) | (-8.234, 3.676) | (-2.840, 0.203) | |
|
| - |
|
| - | - | - | |
Summary of multivariate linear regression models with COVID-19 severity as the outcome using all patients, and the principal components used to summarize the enriched pathways associated with disease severity as the predictors. The models are also adjusted for the clinical covariates age, sex and Charlson comorbidity score. P-values for significance are determined via the likelihood ratio test (LRT).
Fig 8The Pearson correlations of principal components of pathways associated with disease severity (all patients).
The Pearson correlations of clinical covariates and the principal components used to summarize the enriched pathways with disease severity using all patients as predicted by LIPEA. These are the pathways predicted to be enriched based on the 11 lipids determined to be associated with disease severity via stability selection which were able to be mapped to known pathways. Only the correlations which were significant (p-value<0.05) are reported. The strongest correlations are with fibrinogen and hemoglobin.
Enrichment analysis of genes associated with COVID-19 severity (patients with COVID-19).
| Top Diseases and Biological Functions | ||
|---|---|---|
| P-value range | Selected Molecules | |
| Dermatological Diseases and Conditions | 2.71E-02–2.16E-04 | |
| Organismal Injury and Abnormalities | 4.99E-02–2.16E-04 | |
| Renal and Urological Disease | 2.43E-02–2.54E-04 | |
| Connective Tissue Disorders | 4.71E-02–7.22E-04 | |
| Developmental Disorder | 4.94E-02–7.22E-04 | |
| Cell-To-Cell Signalling and Interaction | 4.11E-02–3.93E-05 | |
| Cell Cycle | 2.29E-02–7.22E-04 | |
| Cell Morphology | 4.87E-02–7.22E-04 | |
| Cellular Assembly and Organization | 4.87E-02–7.22E-04 | |
| Cell Death and Survival | 3.76E-02–1.44E-03 | |
|
| ||
|
|
|
|
| Pathogenesis of Multiple Sclerosis | 6.48E-03 |
|
| Agranulocyte Adhesion and Diapedesis | 6.97E-03 | |
| Thyroid Hormone Metabolism II | 2.22E-02 |
|
| Airway Inflammation in Asthma | 2.22E-02 |
|
| Complement System | 2.57E-02 |
|
This table contains the enrichment analysis results for genes associated with disease severity when using patients with COVID-19 only. For the gene set enrichment analysis the IPA output contains the top five canonical pathways and top ten biological functions and disease associations.
Fig 9The Pearson correlations of principal components of pathways associated with disease severity (COVID-19 patients).
The Pearson correlations of clinical covariates and the principal components used to summarize the enriched pathways with COVID-19 severity as predicted by IPA. These are the pathways predicted to be enriched based on 17 genes determined to be associated with COVID-19 severity via stability selection. Only the correlations which were significant (p-value<0.05) are reported. This pathway has many significant correlations with clinical covariates, especially measures of disease severity.
Regression models for genes associated with COVID-19 severity (all patients).
| Pathway | (Intercept) | PC1 | PC2 | Age | Gender (Male) | Charlson Score | |
|---|---|---|---|---|---|---|---|
|
|
| 40.446 (5.841) | -6.515 (1.407) | -5.438 (1.499) | -0.288 (0.114) | -2.010 (2.830) | -0.108 (0.792) |
|
| - |
|
|
| 0.486 | 0.891 | |
|
| (28.998, 51.895) | (-9.271, -3.758) | (-8.375, -2.500) | (-0.512, -0.064) | (-7.58, 3.537) | (-1.661, 1.444) | |
|
| - |
|
| - | - | - | |
Summary of multivariate linear regression models with COVID-19 severity as the outcome using COVID-19 patients only, and the principal components used to summarize the enriched pathways associated with disease severity status as the predictors. The models are also adjusted for the clinical covariates age, sex and Charlson comorbidity score. P-values for significance are determined via the likelihood ratio test (LRT).
Enrichment analysis of proteins associated with COVID-19 severity (patients with COVID-19).
| Top Diseases and Biological Functions | ||
|---|---|---|
| P-value range | Selected Molecules | |
| Inflammatory Response | 1.69E-02–2.27E-11 | |
| Infectious Diseases | 1.69E-02–2.05E-09 | |
| Metabolic Disease | 1.30E-02–4.96E-07 | |
| Neurological Disease | 1.69E-02–4.96E-07 | |
| Organismal Injury and Abnormalities | 1.97E-02–4.96E-07 | |
| Cellular Compromise | 1.41E-02–2.27E-11 | |
| Cellular Movement | 1.69E-02–6.06E-11 | |
| Cellular Function and Maintenance | 1.01E-02–7.52E-10 | |
| Cell-To-Cell Signalling and Interaction | 1.97E-02–1.30E-06 | |
| Protein Synthesis | 8.50E-03–2.58E-06 | |
|
| ||
|
|
|
|
| LXR/RXR Activation | 5.07E-08 | |
| FXR/RXR Activation | 6.35E-08 | |
| Atherosclerosis Signalling | 1.33E-06 | |
| Acute Phase Response Signalling | 1.16E-05 | |
| Maturity Onset of Young Diabetes Signaling (MODY) | 2.38E-05 | |
This table contains the enrichment analysis results for proteins associated with disease severity when using patients with COVID-19 only. For the protein set enrichment analysis the IPA output contains the top five canonical pathways and top ten biological functions and disease associations.
Fig 10The Pearson correlations of principal components of pathways associated with disease severity (COVID-19 patients).
The Pearson correlations of the principal components used to summarize the enriched pathways with COVID-19 severity as predicted by IPA. These are the pathways predicted to be enriched based on the 62 proteins determined to be significant in COVID-19 severity via stability selection which were able to be mapped to known pathways. Only the correlations which were significant (p-value<0.05) are reported. These pathways have many strong correlations with clinical covariates.
Enrichment analysis of proteins associated with COVID-19 severity (patients with COVID-19).
| Pathway | (Intercept) | PC1 | PC2 | Age | Gender (Male) | Charlson Score | |
|---|---|---|---|---|---|---|---|
|
|
| 36.911 (6.645) | -3.915 (2.181) | 5.363 (2.418) | -0.135 (0.132) | -2.315 (3.225) | -1.561 (0.836) |
|
| - |
|
|
| 0.741 | 0.062 | |
|
| (23.886, 49.936) | (-8.190, 0.361) | (0.623, 10.102) | (-0.393, 0.123) | (-8.636, 4.005) | (-3.199, 0.076) | |
|
| - |
|
| - | - | - | |
|
|
| 37.090 (5.250) | -4.756 (0.671) | -5.971 (1.460) | -0.116 (0.103) | -3.809 (2.569) | -1.208 (0.677) |
|
| - |
|
|
| 0.245 | 0.075 | |
|
| (26.801, 47.379) | (-6.072, -3.441) | (-8.850, -3.092) | (-0.319, 0.086) | (-8.844, 1.226) | (-2.535, 0.119) | |
|
| - |
|
| - | - | - | |
|
|
| 38.192 (6.196) | 2.337 (0.849) | 3.028 (0.903) | -0.106 (0.122) | -1.899 (3.041) | -2.446 (0.818) |
|
|
|
|
|
| 0.921 |
| |
|
| (26.048, 50.336) | (0.674, 4.000) | (1.259, 4.797) | (-0.346, 0.134) | (-7.860, 4.062) | (-4.049, -0.843) | |
|
| - |
|
| - | - | - | |
|
|
| 37.491 (6.121) | -8.948 (1.278) | -1.317 (2.195) | -0.107 (0.113) | -4.442 (2.771) | -1.292 (0.708) |
|
| - |
|
|
| 0.210 | 0.068 | |
|
| (25.494, 49.487) | (-11.453, -6.443) | (-5.619, 2.985) | (-0.330, 0.115) | (-9.872, 0.989) | (-2.679, 0.095) | |
|
| - |
|
| - | - | - | |
Summary of multivariate linear regression models with COVID-19 severity as the outcome using patients with COVID-19 only, and the principal components used to summarize the enriched pathways associated with disease severity status as the predictors. The models are also adjusted for the clinical covariates age, sex and Charlson comorbidity score. P-values for significance are determined via the likelihood ratio test (LRT).
Fig 11Overlapping networks associated with COVID-19.
Visual of the overlapping networks enriched in COVID-19 as determined from the proteomics data. The nodes represent the networks and the edges represent the overlapping genes between the networks. The edge labels provide us with the number of overlapping molecules between the networks.
Enrichment analysis of lipids associated with COVID-19 severity (COVID-19 patients).
| Enriched Pathways | ||||
|---|---|---|---|---|
| Pathway | p-value | Benjamini correction | Bonferroni correction | Selected Molecules |
|
| 6.94137E-06 | 0.000104121 | 0.000104121 | |
|
| 0.018723372 | 0.084269663 | 0.280850581 | |
|
| 0.018723372 | 0.084269663 | 0.280850581 | |
|
| 0.023682416 | 0.084269663 | 0.355236233 | |
|
| 0.028089888 | 0.084269663 | 0.421348315 | |
This table contains the enrichment analysis results for lipids associated with disease severity when using patients with COVID-19 only. For the lipidomics set enrichment analysis from LIPEA the top 10 enriched pathways are summarized.
Fig 12Absolute coefficients for first component of smCCA.
These plots contain the absolute values of the weights for each datasets’ first component in smCCA.
Fig 13Absolute coefficients for second component of smCCA.
These plots contain the absolute values of the weights for each datasets’ second component in smCCA.
Enrichment analysis of genes in smCCA component 1.
| Top Diseases and Biological Functions | ||
|---|---|---|
| P-value range | Number of Molecules | |
| Gastrointestinal Disease | 2.06E-02–7.14E-11 | 80 |
| Organismal Injury and Abnormalities | 2.06E-02–7.14E-11 | 87 |
| Cancer | 2.06E-02–1.18E-09 | 87 |
| Hematological Disease | 2.06E-02–1.18E -09 | 42 |
| Immunological disease | 2.06E-02–1.18E -09 | 52 |
| Cellular Compromise | 2.06E-02–6.90E-08 | 14 |
| Cellular Movement | 2.06E-02–1.36E-07 | 28 |
| Cellular Function and Maintenance | 1.01E-02–1.36E-07 | 8 |
| Cell-To-Cell Signalling and Interaction | 2.06E-02–3.43E-07 | 31 |
| Protein Synthesis | 2.06E-02–4.29E-07 | 22 |
|
| ||
|
|
|
|
| TH1 Pathway | 1.37E-09 | |
| TH1 and TH2 Activation Pathway | 1.86E-09 | |
| TH2 Pathway | 4.28E-09 | |
| Natural Killer Cell Signalling | 11.09E-07 | |
| T-Cell Receptor Signalling | 4.55E-07 | |
This table contains the enrichment analysis results for genes active in component 1 from smCCA For the gene set enrichment analysis the IPA output contains the top five canonical pathways and top ten biological functions and disease associations.
Enrichment analysis of genes in smCCA component 2.
| Top Diseases and Biological Functions | ||
|---|---|---|
| P-value range | Number of Molecules | |
| Cancer | 3.07E-02–6.78E-06 | 109 |
| Organismal Injury and Abnormalities | 3.07E-02–6.78E-06 | 109 |
| Gastrointestinal Disease | 2.67E-02–9.89E-06 | 101 |
| Inflammatory Response | 2.68E-02–5.90E-05 | 16 |
| Dermatological Disease and Conditions | 3.07E-02–6.48E-04 | 84 |
| Cell Death and Survival | 3.07E-02–4.90E-05 | 21 |
| Cell-To-Cell Signalling and Interaction | 2.68E-02–5.90E-05 | 26 |
| Cellular Growth and Proliferation | 2.57E-02–6.81E-05 | 16 |
| Cellular Development | 2.38E-02–5.20E-04 | 18 |
| Lipid metabolism | 3.07E-02–2.33E-03 | 9 |
|
| ||
|
|
|
|
| Cellular Effects of Sildenafil | 1.37E-09 | |
| Netrin Signaling | 1.86E-09 | |
| Chemokine Signaling | 4.28E-09 | |
| Role of NFAT in Cardiac Hypertrophy | 11.09E-07 | |
| Role of NFAT in Cardiac Hypertrophy | 4.55E-07 | |
This table contains the enrichment analysis results for genes active in component 2 from smCCA For the gene set enrichment analysis the IPA output contains the top five canonical pathways and top ten biological functions and disease associations.
Pairwise correlations of smCCA scores.
| Genes (1) | Genes (2) | Prot (1) | Prot (2) | Metab (1) | Metab (2) | Lipid (1) | Lipid (2) | |
|---|---|---|---|---|---|---|---|---|
|
| 1 | 0.95 | 0.75 | 0.54 | 0.62 | 0.49 | 0.79 | 0.6 |
|
| 0.95 | 1 | 0.75 | 0.64 | 0.7 | 0.58 | 0.78 | 0.68 |
|
| 0.75 | 0.75 | 1 | 0.66 | 0.75 | 0.72 | 0.78 | 0.76 |
|
| 0.54 | 0.64 | 0.66 | 1 | 0.83 | 0.8 | 0.69 | 0.88 |
|
| 0.62 | 0.7 | 0.75 | 0.83 | 1 | 0.84 | 0.8 | 0.92 |
|
| 0.49 | 0.58 | 0.72 | 0.8 | 0.84 | 1 | 0.63 | 0.88 |
|
| 0.79 | 0.78 | 0.78 | 0.69 | 0.8 | 0.63 | 1 | 0.82 |
|
| 0.6 | 0.68 | 0.76 | 0.88 | 0.92 | 0.88 | 0.82 | 1 |
Pairwise correlations of scores from smCCA.
Fig 14smCCA scores.
Plot of the pairwise scores from smCCA. Note the red points are patients with COVID-19 and the blue are patients without.
Logistic regression using smCCA components.
| Coef (SE) | p-values *LRT | True COVID-19 | True Non-Covid-19 | |||
|---|---|---|---|---|---|---|
|
| 2.773 (10.626) | - |
| 96 | 12 | |
|
| 0.000 (0.019) | 0.518 |
| 3 | 12 | |
|
| -1.739 (0.672) | 0.279 | ||||
|
| -0.024 (0.173) | 0.212 | ||||
|
| 0.235 (0.201) | 0.179 | ||||
|
| -0.404 (0.219) |
| ||||
|
| 0.424 (0.337) | 0.076 | ||||
|
| 1.035 (0.447) | 0.246 | ||||
|
| -1.710 (0.466) |
| ||||
|
| -0.066 (0.147) | 0.911 | ||||
|
| 0.140 (0.192) | 0.466 | ||||
Results from logistic regression using components from smCCA. Significant p-values at a level of 0.05 are bolded.