| Literature DB >> 33594362 |
Dorien Feyaerts, Julien Hedou, Joshua Gillard, Han Chen, Eileen S Tsai, Laura S Peterson, Kazuo Ando, Monali Manohar, Evan Do, Gopal K R Dhondalay, Jessica Fitzpatrick, Maja Artandi, Iris Chang, Theo T Snow, R Sharon Chinthrajah, Christopher M Warren, Rich Wittman, Justin G Meyerowitz, Edward A Ganio, Ina A Stelzer, Xiaoyuan Han, Franck Verdonk, Dyani K Gaudilliere, Nilanjan Mukherjee, Amy S Tsai, Kristen K Rumer, Sizun J Jiang, Sergio I Valdes-Ferrer, J Daniel Kelly, David Furman, Nima Aghaeepour, Martin S Angst, Scott D Boyd, Benjamin A Pinsky, Garry P Nolan, Kari C Nadeau, Brice Gaudilliere, David R McIlwain.
Abstract
The biological determinants of the wide spectrum of COVID-19 clinical manifestations are not fully understood. Here, over 1400 plasma proteins and 2600 single-cell immune features comprising cell phenotype, basal signaling activity, and signaling responses to inflammatory ligands were assessed in peripheral blood from patients with mild, moderate, and severe COVID-19, at the time of diagnosis. Using an integrated computational approach to analyze the combined plasma and single-cell proteomic data, we identified and independently validated a multivariate model classifying COVID-19 severity (multi-class AUCtraining = 0.799, p-value = 4.2e-6; multi-class AUCvalidation = 0.773, p-value = 7.7e-6). Features of this high-dimensional model recapitulated recent COVID-19 related observations of immune perturbations, and revealed novel biological signatures of severity, including the mobilization of elements of the renin-angiotensin system and primary hemostasis, as well as dysregulation of JAK/STAT, MAPK/mTOR, and NF-κB immune signaling networks. These results provide a set of early determinants of COVID-19 severity that may point to therapeutic targets for the prevention of COVID-19 progression.Entities:
Year: 2021 PMID: 33594362 PMCID: PMC7885914 DOI: 10.1101/2021.02.09.430269
Source DB: PubMed Journal: bioRxiv
Figure 1 -Combined plasma and single-cell proteomic profiling of patients with mild, moderate, and severe COVID-19.
A Patients with mild (50), moderate (21), and severe (26) COVID-19 were examined together with 40 healthy controls. B Schematic representation of the experimental workflow. Plasma proteins were measured using the Olink Explore 1536 assay, while PBMC were stimulated with either LPS+CL097, IFNα+IL-2+IL-4+IL-6, PMA+ionomycin (PI), or left unstimulated (Unstim) before barcoding, antibody staining, and analysis by single-cell mass cytometry. C–D Correlation networks of single-cell mass cytometry and proteome dataset. Each node represents a feature, with edges representing the correlation between features (cor > 0.9). Node size reflects −log of p-value of the correlation with severity (Spearman) and node color represents the different data layers. E Bivariate scatterplot of patients with COVID-19 and healthy controls plotted along factors 3 and 10 identified by multi-omic factor analysis (MOFA; see Figure S2).
Patient characteristics.
Data are shown as a number and a percentage. Age is reported in median years (minimum to maximum).
| Total (N =137) | Control (N=40) | COVID+ (N=97) | ||||
|---|---|---|---|---|---|---|
|
| 45 (19–78) | 48 (23–73) | 44 (19–78) | |||
|
|
|
|
|
|
|
|
| M | 59 | 43.07 | 16 | 40 | 43 | 44.33 |
| F | 78 | 56.93 | 24 | 60 | 54 | 55.67 |
|
|
|
|
|
|
|
|
| Asian | 34 | 24.82 | 11 | 27.5 | 23 | 23.71 |
| Hispanic/Latino | 33 | 24.09 | 0 | 0 | 33 | 34.02 |
| Caucasian | 49 | 35.77 | 15 | 37.5 | 34 | 35.05 |
| Black | 6 | 4.38 | 1 | 2.5 | 5 | 5.15 |
| Multiracial | 5 | 3.65 | 3 | 7.5 | 2 | 2.06 |
| Middle Eastern | 1 | 0.73 | 1 | 2.5 | 0 | 0 |
| Not reported | 9 | 6.57 | 9 | 22.5 | 0 | 0 |
|
|
|
|
|
|
|
|
| Healthy | 40 | 29.2 | 40 | 100 | 0 | 0 |
| Asymptomatic | 2 | 1.46 | 0 | 0 | 2 | 2.06 |
| Mild | 48 | 35.04 | 0 | 0 | 48 | 49.48 |
| Moderate | 21 | 15.33 | 0 | 0 | 21 | 21.65 |
| Severe | 26 | 18.98 | 0 | 0 | 26 | 26.8 |
|
| ||||||
|
|
|
|
|
|
|
|
| Yes | 13 | 9.49 | 0 | 0 | 13 | 13.4 |
| Prediabetic | 5 | 3.65 | 1 | 2.5 | 4 | 4.12 |
| No | 96 | 70.07 | 32 | 80 | 64 | 65.98 |
| Not reported | 23 | 16.79 | 7 | 17.5 | 16 | 16.5 |
|
|
|
|
|
|
|
|
| Yes | 24 | 17.52 | 4 | 10 | 20 | 20.62 |
| No | 85 | 62.04 | 29 | 72.5 | 56 | 57.73 |
| Not reported | 28 | 20.44 | 7 | 17.5 | 21 | 21.65 |
|
|
|
|
|
|
|
|
| Yes | 15 | 10.95 | 5 | 12.5 | 10 | 10.31 |
| No | 115 | 83.94 | 28 | 70 | 87 | 89.69 |
| Not reported | 7 | 5.11 | 7 | 17.5 | 0 | 0 |
|
|
|
|
|
|
|
|
| Yes | 18 | 13.14 | 0 | 0 | 18 | 18.6 |
| No | 89 | 64.96 | 33 | 82.5 | 56 | 57.7 |
| Not reported | 30 | 21.9 | 7 | 17.5 | 23 | 23.7 |
Figure 2 -Integrated modeling of plasma and single-cell proteomic events categorizes COVID-19 severity
A Integration of all six data layers (proteome, frequency, baseline signaling, LPS/CL097 signaling response, IFNα/IL-2/IL-4/IL-6 signaling response, and PI signaling response) using a stacked generalization (SG) method. B–C Outcome of predicted vs true disease severity derived from SG model for the B training (R = 0.61, p-value = 4.2e–6, n = 74) and C validation cohort (R = 0.69, p-value = 7.7e−6, n = 73). D–E Multi-class area under the curve receiver operating characteristic (ROC) analysis of D the training (AUC = 0.799, n = 74) and E validation (AUC = 0.773, n = 73) severity model (see Table S2 for individual AUCs). 1 = control, 2 = mild, 3 = moderate, and 4 = severe.
Figure 3 -An iterative bootstrapping method identifies robust informative features for the differentiation of mild, moderate, and severe COVID-19.
A Workflow of the iterative bootstrap method used to identify informative features in the six data layers of the severity model. A LASSO regression model was run 1,000 times on random sub-samples with replacement for each data layer, Xi, then the number of times an individual feature was selected in one of the bootstrap iterations was counted and the features were ranked according to the frequency of selection in the bootstrap models. B–C Correlation network depicting single-cell (B) or plasma (C) proteomic features. Edges represent the correlation between features (Spearman cor > 0.9). Blue/orange nodes highlight positive/negative correlation with disease severity. Node size reflects −log of p-value (Spearman). Communities containing the bootstrap-selected informative single-cell (B) or plasma (C) proteomic features are highlighted and annotated.
Figure 4 -Severity model features reveal biological signatures that demarcate patients with mild, moderate, and severe COVID-19.
Boxplots, classified by disease severity, showing features informative to the model, including the correlation of the feature with disease severity (Spearman coefficient). A Frequency features are shown as a percentage of mononuclear cells. Granulocytes are shown as a percentage of singlets. pDC and granulocyte frequency are plotted on a log-scale. B Immune cell signaling at baseline (arcsinh transformed values; see methods). C Immune cell signaling response to PI stimulation is reported as the arcsinh transformed ratio over the baseline signaling response (see methods). D Immune cell signaling responses to IFNα/IL-2/IL-4/IL-6 (IFN/IL) stimulation are reported as the arcsinh transformed ratio over the baseline signaling response (see methods). E Plasma protein levels are reported as the normalized protein expression, an arbitrary unit provided by the Olink assay. Tmem = memory T cell; MERTK = tyrosine-protein kinase Mer.