| Literature DB >> 34844191 |
Wen Zhong1, Ozlem Altay2, Muhammad Arif1, Fredrik Edfors1, Levent Doganay3, Adil Mardinoglu4, Mathias Uhlen1, Linn Fagerberg5.
Abstract
BACKGROUND: COVID-19 has caused millions of deaths globally, yet the cellular mechanisms underlying the various effects of the disease remain poorly understood. Recently, a new analytical platform for comprehensive analysis of plasma protein profiles using proximity extension assays combined with next generation sequencing has been developed, which allows for multiple proteins to be analyzed simultaneously without sacrifice on accuracy or sensitivity.Entities:
Keywords: COVID-19; Immune response; Plasma proteome; Protein profiling
Mesh:
Substances:
Year: 2021 PMID: 34844191 PMCID: PMC8626206 DOI: 10.1016/j.ebiom.2021.103723
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
Fig. 1Overview of the study cohort. (a) The study design with the COVID-19 cohort as well as the wellness cohort from the S3WP program, which were both analyzed using the PEA-NGS method. (b) The age distribution of the COVID-19 cohort (n = 33, male; n = 17, female). (c) The BMI distribution of the COVID-19 cohort. (d) Heatmap showing the symptoms of each individual of the COVID-19 cohort as well as the age, symptom positive days and the measured oxygen saturation (SPO2) levels in percentage (%) (n = 50).
Fig. 2Clustering of COVID-19 samples. (a) UMAP plot showing the distribution of day-0 and day-14 samples, with each individual connected by a dotted line. (b) Dendrogram visualizing the results from hierarchical clustering of all samples. Day-0 samples are shown in red and day-14 samples in blue.
Fig. 3Proteins associated with COVID-19 infection. (a) Results from multifactor ANOVA based on the factors COVID-19 infection (day-0 / day-14), age, sex and BMI, showing the most highly associated proteins with each factor (n = 50). (b) Example of a BMI- (adjust p < 0.001) and sex-associated (adjust p < 0.001) protein cadherin related family member 2 (CDHR2) (n = 50). (c) Example of an age-associated protein ectodysplasin A2 receptor (EDA2R) (adjust p < 0.001, n = 50). (d) Volcano plot with differentially expressed proteins between day-0 and day-14 samples showing the difference in NPX values on the x-axis and -log10(adjusted p-value) on the y-axis (multifactor ANOVA with Benjamini Hochberg correction, n = 50). We show the manual annotation of the top 50 most significant proteins into groups of ‘cytokine’ (red), ‘immune related’ (orange) and ‘other’ (blue). (e) Boxplots of examples of up- and down- regulated proteins in day-0 samples (n = 50).
Top 50 elevated plasma proteins in COVID-19 infected patients.
| Protein | UniProt description | Classification | NPX difference | adjust |
|---|---|---|---|---|
| SCARB2 | scavenger receptor class B member 2 | Immune related | 1.08 | 1.5E-21 |
| SIGLEC1 | sialic acid binding Ig like lectin 1 | Immune related | 1.35 | 1.4E-20 |
| CTSO | cathepsin O | Other | 1.14 | 4.2E-19 |
| CXCL10 | C-X-C motif chemokine ligand 10 | Cytokine | 2.72 | 3.1E-18 |
| GRN | granulin precursor | Cytokine | 1.12 | 3.8E-18 |
| LAG3 | lymphocyte activating 3 | Immune related | 1.08 | 8.3E-18 |
| CCL8 | C-C motif chemokine ligand 8 | Cytokine | 2.09 | 3.0E-17 |
| IFNL1 | interferon lambda 1 | Cytokine | 1.95 | 1.0E-16 |
| LAMP3 | lysosomal associated membrane protein 3 | Immune related | 1.43 | 3.3E-16 |
| CSF1 | colony stimulating factor 1 | Cytokine | 1.00 | 4.3E-15 |
| TCN2 | transcobalamin 2 | Other | 0.99 | 5.4E-15 |
| CLEC6A | C-type lectin domain containing 6A | Immune related | 1.24 | 1.2E-13 |
| ANGPTL1 | angiopoietin like 1 | Other | 0.99 | 4.2E-13 |
| LGALS9 | galectin 9 | Immune related | 0.88 | 4.2E-13 |
| CD300E | CD300e molecule | Immune related | 1.03 | 6.2E-13 |
| TNFSF10 | TNF superfamily member 10 | Cytokine | 0.71 | 6.2E-13 |
| IL15 | interleukin 15 | Cytokine | 0.85 | 8.1E-13 |
| CD14 | CD14 molecule | Immune related | 1.34 | 3.0E-12 |
| EBI3_IL27 | NA | Cytokine | 0.61 | 1.7E-11 |
| CX3CL1 | C-X3-C motif chemokine ligand 1 | Cytokine | 0.87 | 3.0E-11 |
| LGMN | legumain | Other | 0.83 | 5.7E-11 |
| CLEC4C | C-type lectin domain family 4 member C | Immune related | 0.89 | 7.1E-11 |
| TINAGL1 | tubulointerstitial nephritis antigen like 1 | Other | 0.70 | 9.1E-11 |
| CRLF1 | cytokine receptor like factor 1 | Cytokine | 0.74 | 1.0E-10 |
| PTX3 | pentraxin 3 | Immune related | 0.80 | 1.2E-10 |
| C1QA | complement C1q A chain | Immune related | 0.55 | 1.4E-10 |
| LILRA5 | leukocyte immunoglobulin like receptor A5 | Immune related | 0.62 | 2.3E-10 |
| IL18BP | interleukin 18 binding protein | Cytokine | 0.73 | 3.5E-10 |
| TNF | tumor necrosis factor | Cytokine | 0.61 | 5.4E-10 |
| HMOX1 | heme oxygenase 1 | Other | 1.09 | 9.9E-10 |
| IL18R1 | interleukin 18 receptor 1 | Cytokine | 0.57 | 1.3E-09 |
| ENTPD6 | ectonucleoside triphosphate diphosphohydrolase 6 (putative) | Other | 0.45 | 2.5E-09 |
| VWA1 | von Willebrand factor A domain containing 1 | Other | 0.62 | 3.1E-09 |
| ESM1 | endothelial cell specific molecule 1 | Other | 0.73 | 3.2E-09 |
| DLL1 | delta like canonical Notch ligand 1 | Immune related | 0.65 | 3.4E-09 |
| TNFSF13B | TNF superfamily member 13b | Cytokine | 0.72 | 3.6E-09 |
| FOLR2 | folate receptor beta | Other | 0.67 | 4.2E-09 |
| GAS6 | growth arrest specific 6 | Other | 0.58 | 5.8E-09 |
| LILRB4 | leukocyte immunoglobulin like receptor B4 | Immune related | 0.71 | 9.6E-09 |
| SEMA3F | semaphorin 3F | Other | 0.65 | 1.0E-08 |
| SIGLEC5 | sialic acid binding Ig like lectin 5 | Immune related | 1.50 | 1.3E-08 |
| TNFSF13 | TNF superfamily member 13 | Cytokine | 0.57 | 1.8E-08 |
| TPP1 | tripeptidyl peptidase 1 | Other | 0.77 | 2.2E-08 |
| ENTPD5 | ectonucleoside triphosphate diphosphohydrolase 5 | Other | 0.42 | 2.2E-08 |
| SMOC1 | SPARC related modular calcium binding 1 | Other | 0.48 | 2.2E-08 |
| BST2 | bone marrow stromal cell antigen 2 | Immune related | 0.82 | 2.5E-08 |
| FST | follistatin | Other | 0.70 | 3.5E-08 |
| VCAM1 | vascular cell adhesion molecule 1 | Immune related | 0.55 | 3.6E-08 |
| VSIG4 | V-set and immunoglobulin domain containing 4 | Immune related | 0.71 | 4.1E-08 |
| CD74 | CD74 molecule | Immune related | 0.58 | 5.4E-08 |
Fig. 4Analysis of the plasma proteins related to COVID-19 infection. (a) Heatmap showing the expression levels of the 50 most significant proteins in all day-0 and day-14 samples, clustered based on expression in the 50 proteins. (b) Scatterplot showing the difference in expression levels between day-0 and day-14 samples in our study on the x-axis, and the difference between COVID-19 positive and COVID-19 negative samples in the Filbin et al[16] study on the y-axis. All proteins with a significant difference in our study are shown. The color code depicts the -log10 adjusted p-value in the Filbin et al study, where the grey dots represent non-significant change. The statistical analysis is based on ANOVA with sex, age, bmi as covariates. (c) Boxplots of up- and down- regulated proteins in COVID positive and negative patients in both our study (n = 50) and in the Filbin et al study (n = 242, positive; n = 78, negative).
Fig. 5Analysis of the 50 most highly associated proteins to COVID-19 infection. (a) Radar plot showing the average expression levels of the 50 proteins for each of the three groups of samples (day-0 in red, color day-14 in blue color, and wellness healthy cohort in green color). (b) Boxplot showing the expression levels of two differentially expressed proteins ectonucleoside triphosphate diphosphohydrolase 5 (ENTPD5) and tubulointerstitial nephritis antigen like 1 (TINAGL1) in both the COVID-19 and wellness studies (n = 50, COVID-19 study; n = 76, wellness study). (c) UMAP plot showing the distribution of samples both from the COVID-19 cohort and the 76 samples from the wellness cohort based on the top 50 proteins. (d) UMAP plot highlighting the age of the individuals in the two different groups of day-14 samples. (e) Boxplot of the age distribution of the two different groups of day-14 samples and the p-value based on paired t-test (adjust p < 0.001, n = 50). (f) Boxplot showing the distribution of protein levels of a differentially expressed protein sialic acid binding Ig like lectin 1 (SIGLEC1) in day-0 samples, Group 1 (G1) and Group 2 (G2) of day-14 samples and the wellness cohort (n = 50, COVID-19 study; n = 76, wellness study).