Literature DB >> 33945677

Longitudinal proteomic profiling provides insights into host response and proteome dynamics in COVID-19 progression.

Jee-Soo Lee1, Dohyun Han2, So Yeon Kim3, Ki Ho Hong4, Myoung-Jin Jang5, Man Jin Kim1, Young-Gon Kim1, Jae Hyeon Park1, Sung Im Cho1, Wan Beom Park6, Kyung Bok Lee7, Ho Seob Shin1, Hyeon Sae Oh1, Taek Soo Kim1, Sung Sup Park1, Moon-Woo Seong1.   

Abstract

In managing patients with coronavirus disease 2019 (COVID-19), early identification of those at high risk and real-time monitoring of disease progression to severe COVID-19 is a major challenge. We aimed to identify potential early prognostic protein markers and to expand understanding of proteome dynamics during clinical progression of the disease. We performed in-depth proteome profiling on 137 sera, longitudinally collected from 25 patients with COVID-19 (non-severe patients, n = 13; patients who progressed to severe COVID-19, n = 12). We identified 11 potential biomarkers, including the novel markers IGLV3-19 and BNC2, as early potential prognostic indicators of severe COVID-19. These potential biomarkers are mainly involved in biological processes associated with humoral immune response, interferon signalling, acute phase response, lipid metabolism, and platelet degranulation. We further revealed that the longitudinal changes of 40 proteins persistently increased or decreased as the disease progressed to severe COVID-19. These 40 potential biomarkers could effectively reflect the clinical progression of the disease. Our findings provide some new insights into host response to SARS-CoV-2 infection, which are valuable for understanding of COVID-19 disease progression. This study also identified potential biomarkers that could be further validated, which may support better predicting and monitoring progression to severe COVID-19.
© 2021 Wiley-VCH GmbH.

Entities:  

Keywords:  COVID-19; Keywords; prognosis; proteomics; serum; severity

Mesh:

Substances:

Year:  2021        PMID: 33945677      PMCID: PMC8206655          DOI: 10.1002/pmic.202000278

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


coronavirus disease 2019 lactate dehydrogenase C‐reactive protein interferon tumour necrosis factor mass spectrometry severe acute respiratory syndrome coronavirus 2 differentially expressed protein trifluoroacetic acid styrene divinylbenzene reversed‐phase sulfonate Liquid chromatography‐tandem mass spectrometry data‐independent acquisition data‐dependent acquisition false discovery rate gene ontology confidence interval best cut‐off value interquartile range Pearson's correlation coefficients angiotensin I‐converting enzyme 2 lipopolysaccharides toll‐like receptor 4 myeloid differentiation factor 2

INTRODUCTION

A wide range of disease severity, ranging from asymptomatic to life‐threatening illness, has been observed in patients with coronavirus disease 2019 (COVID‐19) [1]. While the current estimate is that 81% of COVID‐19 patients have a mild or moderate disease course, some patients who initially present with mild symptoms subsequently experience acute clinical deterioration within 1–2 weeks after symptom onset [1, 2]. Therefore, precise prognostication and early identification of disease progression are needed to effectively manage patients who are at high risk of clinical progression to severe COVID‐19. Several previous studies have reported clinical laboratory findings including lactate dehydrogenase (LDH), C‐reactive protein (CRP), albumin, D‐dimer, lymphocytes, neutrophils, and platelets at admission as parameters for progressive risk estimation [3, 4, 5, 6]. Plasma levels of proteins associated with cytokine storms and immune response including interleukin (IL)‐2, IL‐6, IL‐7, G‐CSF, interferon (IFN)‐γ inducible protein 10, monocyte chemoattractant protein‐1, macrophage inflammatory protein‐1α, and tumour necrosis factor (TNF)‐α have been reported to be elevated in severe COVID‐19 patients [2, 7, 8]. In addition, mass spectrometry (MS)‐based quantitative proteome profiling has been performed in several recent studies, which reveal characteristic molecular changes and biological processes in patients infected with severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) and at high risk of clinical progression [9, 10]. These studies have reported that severe COVID‐19 patients have altered abundance of proteins involved in immune/inflammatory regulation (e.g., IL‐6 and type 1 IFN), coagulation (e.g., fibrinogen and SERPINA10), lipid metabolism, and acute phase response (e.g., CRP) [11, 12]. Further proteome profiling may enable us to expand the understanding of prognostic molecular signatures altered at initial assessment or during the disease course in patients with a high risk of disease progression. In this study, we addressed two questions about prognostication in COVID‐19: (1) Which protein markers are differentially expressed preceding clinical deterioration and are able to significantly predict the high risk in patients who progressed to severe COVID‐19 compared to patients who had a non‐severe disease course. (2) Which differentially expressed protein (DEP) markers exhibit longitudinal changes that effectively reflect clinical progression of the disease. We performed longitudinal proteome profiling in serum samples of COVID‐19 patients at admission, and serially collected samples during their hospital stay. We then applied pathway analysis to gain insight into the relevant biological processes of altered protein expression. The overall strategy in this study is shown in Figure 1.
FIGURE 1

Study workflow. In‐depth proteome profiling was performed on 137 serum samples collected from 25 patients with COVID‐19 (non‐severe, n = 13; severe, n = 12). First, differentially expressed proteins (DEPs) between the two groups were identified. Then, the abundance of each candidate protein at admission and the risk probability of progression to severe COVID‐19 were evaluated. Also, dynamics of DEPs in patients with disease progression to severe COVID‐19 was assessed.

Significance Statement

The coronavirus disease 2019 (COVID‐19) pandemic highlights the need for biomarkers to identify the patients at high risk of severe COVID‐19 early and to monitor the disease progression. In this study, we performed longitudinal proteomic profiling on sera serially collected from patients with COVID‐19. Findings from this study identified 11 potential early prognostic indicators, including the novel protein markers IGLV3‐19 and BNC2. IGLV3‐19 is a variable domain of immunoglobulin light chains, which may provide better understanding of the host immune response in severe COVID‐19. BNC2 might play a role in the upregulation of interferon‐stimulated genes in the interferon signalling pathway. We further identified that the longitudinal changes of 40 proteins persistently increased or decreased as the disease progressed to severe COVID‐19, which effectively reflect the clinical progression of the disease. This study identified potential biomarkers that could be further validated, which may support better predicting and monitoring progression to severe COVID‐19. Study workflow. In‐depth proteome profiling was performed on 137 serum samples collected from 25 patients with COVID‐19 (non‐severe, n = 13; severe, n = 12). First, differentially expressed proteins (DEPs) between the two groups were identified. Then, the abundance of each candidate protein at admission and the risk probability of progression to severe COVID‐19 were evaluated. Also, dynamics of DEPs in patients with disease progression to severe COVID‐19 was assessed.

MATERIALS AND METHODS

Patients and samples

A total of 137 serum samples (sampled at admission, n = 25; longitudinally sampled during hospitalization, n = 112) were collected from 25 patients with laboratory‐confirmed SARS‐CoV‐2 infection (Figure S1). Each sampling time point is described in Table S1. Patients were from two tertiary hospitals in South Korea (National Medical Center and Seoul Medical Center). The patients’ clinical status was assessed daily during hospitalization, and they were classified into two groups, namely non‐severe patients and patients who progressed to severe COVID‐19, according to the 8‐point ordinal scale (defined in Table S2) as described by Messner et al. [12, 13]. This study was approved by the institutional review board (IRB No. H‐2004‐153‐1118).

Sample preparation

The protein digestion process was optimized to 2 μL of serum as previously described, with some modifications [14]. Briefly, 23 μL of digestion buffer [8 M urea, 10 mM Tris (2‐carbocyethyl) phosphine, and 50 mM chloroacetamide in 50 mM ammonium bicarbonate] was added to 2 μL of serum. The mixture was incubated for 25 min at 60°C for simultaneous denaturation, reduction, and alkylation of proteins. After cooling to room temperature, protein digestion was performed at 37°C overnight using a trypsin/LysC mixture at a 100:1 protein‐to‐protease ratio. The second digestion was performed at 37°C for 2 h using trypsin (enzyme‐to‐substrate ratio [w/w] of 1:1000). All resulting peptides were acidified with 10% trifluoroacetic acid (TFA). The acidified peptides were loaded onto a custom styrenedivinylbenzene reversed‐phase sulfonate (SDB‐RPS)‐StageTips, as described previously [14]. Following washing with 0.2% TFA, the peptides were eluted with 80% acetonitrile containing 1% ammonia. The eluate was dehydrated in a vacuum‐centrifuge and stored at ‐80°C.

Liquid chromatography‐tandem mass spectrometry (LC‐MS/MS) and data analysis

All LC‐MS/MS analyses were performed using Quadrupole Orbitrap mass spectrometers, Q‐Exactive Plus (Thermo Fisher Scientific, Waltham, MA, USA) coupled to an Ultimate 3000 RSLC system (Dionex, Sunnyvale, CA, USA) via a nano electrospray source, as described with some modifications [15]. Peptide samples were separated on a 2‐column setup, with a trap column (75 μm I.D. × 2 cm, C18 3 μm, 100 Å) and an analytical column (50 μm I.D. × 15 cm, C18 1.9 μm, 100 Å). Prior to sample injection, the dried peptide samples were redissolved in solvent A (2% acetonitrile and 0.1% formic acid). After the samples were loaded onto the nano LC, a 90‐min gradient from 8% to 26% solvent B (100% acetonitrile and 0.1% formic acid) was applied to all samples. The spray voltage was 2.0 kV in the positive ion mode, and the temperature of the heated capillary was set to 320°C. The data‐independent acquisition (DIA) method consisted of a survey scan at 35,000 resolution from 400 to 1220 m/z (AGC target of 3 × 106 or 60 ms injection time). Then, 19 DIA windows were acquired at 35,000 resolutions with an automatic gain control target of 3 × 106 and auto for injection time [16]. The stepped collision energy was 10% at 27%.

Spectral library generation

To generate the spectral libraries for DIA, 24 data‐dependent acquisition (DDA) measurements of the immunodepleted plasma samples were performed. DDA spectra were searched using the Maxquant against Uniprot Human Database (December 2014, 88,657 entries) and the iRT standard peptide sequence. A spectral library for DIA‐MS was generated using spectral library generation in Spectronaut PulsarX (Biognosys, Schlieren, Switzerland). In addition, we used Spectronaut Pulsar's protein identification algorithm for spectral library generation. For the generation of Spectronaout Pulsar spectral libraries, DIA raw data were directly loaded into Spectronaut Pulsar, and spectral library generation was performed using the default settings

Data processing for DIA‐MS

The DIA data of individual samples were analysed with Spectronaut 13 (Biognosys, Schlieren, Switzerland). First, the DIA raw files were converted into an htrm format using the GTRMS Converter provided by Spectronaut. The false discovery rate (FDR) was estimated with the mProphet [17] approach and set to 1% at the peptide precursor level and 1% at the protein level. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE [18] partner repository with the dataset identifier PXD023686. The proteins were inferred by the software, and the quantification information was acquired at the protein level by using the q‐value < 0.01 criteria, which was used for subsequent analyses. The relative abundance of all proteins in all longitudinal samples were presented in Table S3.

Quantification and statistical analysis

Differential expression analysis

Serum protein levels were estimated using the protein group quantity calculated using Spectronaut software. Proteins with > 50% missing ratios in a particular patient group were removed. Following log2 transformation, missing values were imputed based on a normal distribution (width  =  0.3, downshift =  1.8) to simulate signals of low‐abundance proteins. Fold change was calculated using the mean of a particular patient group. DEPs between the two groups were identified using the Mann‐Whitney U test. The level of statistical significance was set as adjusted p value < 0.05, and the FDR was corrected using Benjamini and Hochberg for multiple testing. Protein abundances were subjected to z‐normalization followed by hierarchical clustering with Pearson's correlation distance.

Pathway analysis

Canonical pathways, disease and functions, and protein networks were evaluated by Ingenuity Pathway Analysis (IPA, QIAGEN, Hilden, Germany) based on the DEPs in the serum. The analytical algorithms embedded in IPA uses lists of DEPs to predict biological processes and pathways. Additionally, a tree map was constructed, in which the major boxes represented categories of related biological functions or diseases. The statistical significance of both the gene ontology (GO) classification and enrichment analysis was determined by Fisher's exact test. All statistical tests were two‐sided, and p value < 0.05 was considered statistically significant. Wiki pathway and GO biological processes were identified using the Enrichr online tool (http://amp.pharm.mssm.edu/Enrichr/). The top ten enriched terms (and their p values) were identified.

Statistical analysis

We evaluated the association between the level of each candidate protein marker (i.e., DEPs) at admission and estimated the risk of disease progression using the Cox proportional hazard model. Hazard ratios with 95% confidence intervals (CIs) were calculated using Cox proportional hazard regression. The best cut‐off value (BCV) of each candidate protein was determined using maximally selected log‐rank statistics, as previously described [19]. The time to clinical progression to severe COVID‐19 was plotted using Kaplan–Meier curves, and compared with a log‐rank test; p value < 0.05 was considered statistically significant. We further explored the effect of longitudinal changes in each candidate protein level on the risk of clinical progression using a linear mixed‐effects model; the level of statistical significance was set as adjusted p value < 0.05, FDR corrected [20]. Analysis was performed using commercially available software (SPSS 19.0, Stata SE12.0, and SAS version 9.4).

RESULTS

Clinical characteristics of patients

The demographics and clinical characteristics of the study patients are summarized in Table 1 and Figure S2. The median age was 70 years (interquartile range [IQR], 65 to 76 years), and 48.0% of the patients were male. The median time from symptom onset to admission was 5 days (IQR, 4–9 days). All patients were followed up for >10 days after admission. Among the total population, 13 (52.0%) patients who maintained an ordinal scale ≤ 4 (category 3, n = 4; category 4, n = 9) were classified into the non‐severe group, and 12 (48.0%) patients with clinical deterioration to ordinal scale >4 (category 5, n = 9; category 6, n = 2; category 8, n = 1) were classified into the severe group. In the non‐severe group, 9 (69.2%) patients required supplemental oxygen using either nasal prongs or mask; all the patients in the non‐severe group improved and were discharged. In the severe group, the median time from admission to clinical deterioration requiring high‐flow oxygen was 5 days (IQR, 3.5‐ 7.5 days). Invasive mechanical ventilation and renal replacement therapy were administered in 25.0% (n = 3) and 8.3% (n = 1) of patients in the severe group, respectively. One patient (8.3%) died, and 11 (91.7%) patients were discharged with clinical improvement.
TABLE 1

Demographics and clinical characteristics of COVID‐19 patients

Total (N = 25)Non‐severe (n = 13)Severe (n = 12)
Age – year (median, IQR)70 (65–76)71 (64–77)69.5 (67.3–72)
Male sex – No. (%)12 (48)4 (30.8)8 (66.7)
Time from symptom onset to admission5 (4–9)5 (4–14)4 (4–8)
Eight‐point score on ordinal scale – No (%)
3: hospitalized, not requiring supplemental oxygen4 (16.0)4 (30.8)0 (0)
4: hospitalized, requiring low‐flow oxygen by nasal prongs or facial mask9 (36.0)9 (69.2)0 (0)
5: hospitalized, use of high‐flow oxygen devices or non‐invasive ventilation9 (36.0)0 (0)9 (75.0)
6: hospitalized, intubation and invasive mechanical ventilation2 (8.0)0 (0)2 (16.7)
7: hospitalized, invasive mechanical ventilation, and ECMO, RRT or both0 (0)0 (0)0 (0)
8: death1 (4.0)0 (0)1 (8.3)
Time from admission to severe disease5 (3.5–7.5)
Symptoms
Fever10 (40.0)5 (38.5)5 (41.7)
Chill4 (16.0)1 (7.7)3 (25.0)
Cough8 (32.0)2 (15.4)6 (50.0)
Sputum6 (24.0)2 (15.4)4 (33.3)
Dyspnoea3 (12.0)1 (7.7)2 (16.7)
Myalgia6 (24.0)2 (15.4)4 (33.3)
Fatigue1 (4.0)1 (7.7)0 (0)
Headache2 (8.0)1 (7.7)1 (8.3)
Rhinorrhoea1 (4.0)1 (7.7)0 (0)
Sore throat4 (16.0)2 (15.4)2 (16.7)
Epigastric pain1 (4.0)0 (0)1 (8.3)
Diarrheal2 (8.0)1 (7.7)1 (8.3)
Indigestion1 (4.0)0 (0)1 (8.3)
Olfactory and gustatory sensory dysfunction2 (8.0)1 (7.7)1 (8.3)
Comorbidity
HTN10 (40.0)6 (46.2)4 (33.3)
DM6 (24.0)4 (30.8)2 (16.7)
Dyslipidaemia4 (16.0)2 (15.4)2 (16.7)
Dementia4 (16.0)2 (15.4)2 (16.7)
Parkinson's disease1 (4.0)0 (0)1 (8.3)
Treatment
Lopinavir‐Ritonavir8 (32.0)4 (30.8)4 (33.3)
Antibiotic agent2 (8.0)0 (0)2 (16.7)
Glucocorticoid therapy2 (8.0)0 (0)2 (16.7)
Oxygen by nasal prongs or mask21 (84.0)9 (69.2)12 (100)
High flow oxygen or non‐invasive mechanical ventilation12 (48.0)0 (0)12 (100)
Invasive mechanical ventilation3 (12.0)0 (0)3 (25.0)
Renal replacement therapy1 (4.0)0 (0)1 (8.3)
ECMO0 (0)0 (0)0 (0)
Outcomes at discharge
Improved24 (96.0)13 (100)11 (91.7)
Died1 (4.0)0 (0)1 (8.3)

Abbreviations: IQR, interquartile range; ECMO, extracorporeal membrane oxygenation; RRT, renal replacement therapy; HTN, hypertension; DM, diabetes mellitus.

Demographics and clinical characteristics of COVID‐19 patients Abbreviations: IQR, interquartile range; ECMO, extracorporeal membrane oxygenation; RRT, renal replacement therapy; HTN, hypertension; DM, diabetes mellitus.

DEPs in severe COVID‐19 patient serum

Overall, 608 proteins were identified and quantified in the cross‐sectional samples. An average of 484 proteins were quantified in the individual samples (Figure S3A). To identify the differences within and between groups, protein profiles were generated by drawing multi‐scatter plots and calculating Pearson's correlation coefficients (PCCs; Figure S3B). Non‐severe and severe groups showed mean PCCs of 0.88 and 0.89, respectively. The average PCC between the non‐severe and severe groups was 0.87. Based on serum protein expression levels of patients in the non‐severe and severe groups, two groups were separated by principal component analysis (Figure S3C). After proteins were determined to be quantified by at least 50% in either the non‐severe or severe group subjects, 513 proteins were subjected to statistical analysis. We found that 46 proteins were differentially expressed in the serum, sampled immediately after patients had progressed to severe COVID‐19, in severe patients compared with that in non‐severe patients (Table 2, Figure 2A). Pathway analysis and GO enrichment analysis revealed that the DEPs were mainly involved in several biological processes associated with humoral immune response (IGLV3‐19, IGLC2, and IGHA2), IFN signalling (BNC2), acute phase response (CRP, LBP, SERPINA3, SAA1, SAA2, ITIH4, RBP4, TTR, albumin, and transferrin), inflammatory response (angiotensinogen, HSP90AA1, transketolase), lipid metabolism (APOA1, APOA2, APOA4, APOC1, APOM, and paraoxonase‐1), platelet degranulation (LEFTY2, AHSG, SEPP1, A2M, KNG1, PF4, HRG, and SERPINA4), coagulation cascade (FIX, FX, SERPINA1, and SERPING1), and cellular metabolic process (ALDOA, PIK3C2β, MAN1A, MAN1C1, Gc‐globulin, ITIH2, IGFALS, and PI16) (Figure 2B, Figure S4).
TABLE 2

Differentially expressed proteins between non‐severe and severe COVID‐19 patient groups

ProteinGeneFC p valueadjusted p value
A0A075B6J8IGLV3‐191.31670.001460.02345
A0A075B6K9IGLC21.01790.000450.01361
A0A087 × 1L7LEFTY2−0.61190.000130.01004
B1APH0BNC2−0.59720.001760.02512
D6RAR4HGFAC−0.64650.005090.04836
D6REX5SEPP1−0.46080.003610.03784
D6RF35GC−0.66780.001460.02345
K7ERI9APOC1−1.25370.001210.023
O95445APOM−0.60690.003030.0331
P00740F90.67160.005090.04836
P00742F100.38910.001760.02512
P01009SERPINA10.90940.0010.01971
P01011SERPINA31.2860.000240.01255
P01019AGT0.55870.002540.031
P01023A2M−0.74020.003610.03784
P01042KNG1−0.57410.000370.01255
P01877IGHA20.99390.002540.031
P02647APOA1−0.74960.000160.01004
P02741CRP3.10160.001460.02345
P02753RBP4−1.6150.000160.01004
P02765AHSG−1.02880.000080.01004
P02766TTR−1.2320.000550.01417
P02768ALB−0.94850.000820.01687
P02776PF4−1.77380.000370.01255
P02787TF−0.95030.000130.01004
P04075ALDOA1.60740.002540.031
P04196HRG−0.97780.001460.02345
P05155SERPING10.42390.003030.0331
P06727APOA4−1.20040.003030.0331
P07900HSP90AA11.1730.005090.04836
P0DJI8SAA13.89540.001760.02512
P0DJI9SAA24.10520.002540.031
P18428LBP0.97990.001460.02345
P19823ITIH2−0.59390.000670.01574
P27169PON1−0.69050.000370.01255
P29401TKT1.36810.00430.04407
P29622SERPINA4−1.08060.002120.02936
P33908MAN1A11.33830.002540.031
P35858IGFALS−0.72910.000820.01687
Q14624ITIH42.40590.000670.01574
Q5SWA0PIK3C2B2.03390.000550.01417
Q5T7N2L1TD1−0.62340.003030.0331
Q6UXB8PI16−1.49810.001760.02512
Q92954PRG41.19840.000370.01255
Q9NR34MAN1C17.79970.005080.04836
V9GYM3APOA2−0.96250.000040.01004

Abbreviation: FC, fold‐change (Log2 fold change was calculated.)

Samples obtained immediately after patients progressed to severe COVID‐19 (i.e., high‐flow O2 start) for the severe COVID‐19 group and samples at admission for non‐severe COVID‐19 group were assessed.

FIGURE 2

(A) Volcano plot analysis of differentially expressed proteins (DEPs) between two groups. The expression difference in protein between severe groups and non‐severe groups is plotted on the x‐axis, and false discovery rate – adjusted p value is plotted on the y‐axis (‐log10 scale). Upregulated and downregulated proteins in the severe group are represented red and blue, respectively. (B) Gene ontology enrichment analysis. Heatmap of 46 proteins that are differentially abundant in severe COVID‐19. Protein levels are presented on a low to high scale (blue – white – red).

Differentially expressed proteins between non‐severe and severe COVID‐19 patient groups Abbreviation: FC, fold‐change (Log2 fold change was calculated.) Samples obtained immediately after patients progressed to severe COVID‐19 (i.e., high‐flow O2 start) for the severe COVID‐19 group and samples at admission for non‐severe COVID‐19 group were assessed. (A) Volcano plot analysis of differentially expressed proteins (DEPs) between two groups. The expression difference in protein between severe groups and non‐severe groups is plotted on the x‐axis, and false discovery rate – adjusted p value is plotted on the y‐axis (‐log10 scale). Upregulated and downregulated proteins in the severe group are represented red and blue, respectively. (B) Gene ontology enrichment analysis. Heatmap of 46 proteins that are differentially abundant in severe COVID‐19. Protein levels are presented on a low to high scale (blue – white – red).

Early prognostic biomarkers for progression to severe COVID‐19

We revealed that several DEPs (IGLV3‐19, BNC2, CRP, LBP, RBP4, TTR, APOC1, APOA2, KNG1, PF4, and SERPINA4) were candidate predictors of progression to severe COVID‐19. Notably, IGLV3‐19, a variable domain of immunoglobulin light chains, and BNC2, an activator of genes in the IFN signalling pathway [21], are novel candidate markers that have not yet been reported in SARS‐CoV‐2 infection. Figure 3 shows the significant linear correlation between the abundance of each protein at admission and the risk probability of progression to severe COVID‐19. The levels of IGLV3‐19, CRP, LBP, and the levels of BNC2, RBP4, TTR, APOC1, APOA2, KNG1, PF4, and SERPINA4 at admission were positively and negatively correlated with the risk of clinical deterioration to severe COVID‐19, respectively (p < 0.05, Table S4).
FIGURE 3

Estimated probability of disease progression to severe COVID‐19, according to serum level of each protein. The risk probability was estimated using a Cox proportional hazard regression model. Data sampled at the time of admission for both severe and non‐severe groups were assessed. HR, hazard ratio; CI, confidence interval.

Estimated probability of disease progression to severe COVID‐19, according to serum level of each protein. The risk probability was estimated using a Cox proportional hazard regression model. Data sampled at the time of admission for both severe and non‐severe groups were assessed. HR, hazard ratio; CI, confidence interval. Next, the BCV of each candidate protein was determined, and cumulative progression rates to severe COVID‐19 were compared between the patients with high and low serum protein levels (Figure 4). Patients in the high‐IGLV3‐19, high‐CRP, and high‐LBP groups showed significantly higher progression rates, respectively, while patients in the low‐BNC2, low‐RBP4, low‐TTR, low‐APOC1, low‐APOA2, low‐KNG1, low‐PF4, and low‐SERPINA4 groups showed significantly higher progression rates (log‐rank p < 0.05).
FIGURE 4

Kaplan‐Meier estimates of cumulative clinical progression to severe COVID‐19. Cumulative progression estimates to severe COVID‐19 are present in patients with high serum levels (coloured in blue) and in patients with low serum levels (coloured in red). The best cut‐off value of each protein was determined using maximally selected log‐rank statistics. Data sampled at the time of admission for both severe and non‐severe groups were assessed. HR, hazard ratio; CI, confidence interval.

Kaplan‐Meier estimates of cumulative clinical progression to severe COVID‐19. Cumulative progression estimates to severe COVID‐19 are present in patients with high serum levels (coloured in blue) and in patients with low serum levels (coloured in red). The best cut‐off value of each protein was determined using maximally selected log‐rank statistics. Data sampled at the time of admission for both severe and non‐severe groups were assessed. HR, hazard ratio; CI, confidence interval.

Proteome dynamics during disease progression to severe COVID‐19

To further investigate the dynamics of DEPs in patients with disease progression to severe COVID‐19, we examined longitudinal changes in serum DEPs sampled at sequential time points (median, 7 time points; IQR, 6–11 time points) from each patient during hospitalization (Figure S1, Table S1). The longitudinal changes in 40 DEPs were significantly different between non‐severe patients and patients who were progressing to severe COVID‐19 (adjusted p < 0.05; Table 3, Figure S5). The abundances of CRP, SAA1, SAA2, SERPINA3, ITIH4, FIX, FX, SERPINA1, SERPING1, IGLC2, AGT, TKT, MAN1A1, PIK3C2B, PRG4, and MAN1C1 increased as disease progressed (p < 0.05); the level of LBP tended to increase. The abundance of RBP4, TTR, ALB, TF, LEFTY2, AHSG, SEPP1, A2M, KNG1, HRG, SERPINA4, BNC2, APOA1, APOA2, APOA4, APOC1, APOM, PON1, HGFAC, GC, ITIH2, IGHALS, and L1TD1 persistently decreased as the disease progressed to severe COVID‐19 (p < 0.05).
TABLE 3

Linear mixed effect model to identify which longitudinal changes of candidate protein markers were predictive of disease progression

ProteinsNon‐severe COVID‐19Severe COVID‐19Non‐severe vs. severe
BSE p valueBSE p valueFDR adjusted p value
IGLC2−0.0030.0040.5220.0770.0330.0230.026
LEFTY2−0.0010.0020.793−0.0860.0170.000<0.001
BNC20.0010.0030.801−0.0470.0200.0170.023
HGFAC0.0030.0030.253−0.0810.0220.000<0.001
SEPP1−0.0020.0020.379−0.0410.0170.0190.033
GC0.0000.0020.963−0.0640.0160.000<0.001
APOC10.0020.0030.513−0.0680.0230.0040.006
APOM0.0020.0020.331−0.0510.0160.0020.003
F90.0030.0030.2750.1030.0210.000<0.001
F10−0.0040.0020.0080.0620.0120.000<0.001
SERPINA1−0.0090.0030.0040.0990.0230.000<0.001
SERPINA3−0.0070.0040.0510.1750.0270.000<0.001
AGT−0.0080.0030.0040.1040.0200.000<0.001
A2M0.0090.0020.000−0.0560.0140.000<0.001
KNG1−0.0050.0020.004−0.0360.0130.0060.026
APOA10.0010.0020.653−0.0930.0170.000<0.001
CRP−0.0270.0150.0640.2540.1080.0210.016
RBP40.0110.0040.011−0.0910.0320.0050.003
AHSG0.0040.0030.156−0.1360.0220.000<0.001
TTR0.0110.0040.011−0.0810.0320.0130.008
ALB0.0050.0030.133−0.1290.0260.000<0.001
TF0.0030.0030.237−0.1100.0190.000<0.001
HRG0.0120.0040.002−0.0720.0280.0110.005
SERPING10.0000.0020.8210.0660.0160.000<0.001
APOA4−0.0020.0040.594−0.0820.0330.0140.025
SAA1−0.0250.0160.1060.2430.1160.0380.030
SAA2−0.0350.0170.0350.2300.1230.0650.041
LBP−0.0170.0040.0000.0590.0330.0760.030
ITIH20.0020.0020.424−0.0760.0140.000<0.001
PON10.0060.0020.011−0.0820.0170.000<0.001
TKT−0.0050.0060.3990.1510.0460.0010.002
SERPINA40.0100.0040.022−0.0840.0320.0090.006
MAN1A1−0.0080.0060.1330.1450.0420.0010.001
IGFALS−0.0010.0030.606−0.0870.0190.000<0.001
ITIH4−0.0070.0080.4040.2510.0600.000<0.001
PIK3C2B−0.0070.0070.2910.1050.0500.0380.034
L1TD10.0050.0020.026−0.0800.0170.000<0.001
PRG4−0.0030.0050.6140.1560.0410.000<0.001
MAN1C10.0490.0250.0500.6880.1840.0000.001
APOA20.0020.0030.547−0.1150.0200.000<0.001
IGLV3‐190.0070.0080.3640.0330.0580.5680.695
IGHA2−0.0060.0020.0000.0110.0120.3660.178
PF4−0.0120.0060.054−0.0960.0450.0360.077
ALDOA−0.0030.0080.7270.0220.0590.7130.707
HSP90AA10.0090.0070.1960.0790.0530.1420.211
PI160.0190.0100.0450.0010.0710.9940.808

Abbreviations: B, β‐estimate; SE, standard error; FDR, false discovery rate.

Linear mixed effect model to identify which longitudinal changes of candidate protein markers were predictive of disease progression Abbreviations: B, β‐estimate; SE, standard error; FDR, false discovery rate.

DISCUSSION

Here, we performed proteome profiling using serum samples from patients with COVID‐19 at admission and serially collected more during hospitalization. We aimed to identify potential early prognostic protein markers that could stratify high‐risk patients prior to disease progression. We then further explored DEPs, which exhibit longitudinal changes that effectively reflect the clinical progression of the disease. We found that IGLV3‐19 and BNC2 could be potentially important prognostic factor for severe COVID‐19. We observed increased abundance of IGLV3‐19 at admission in patients who finally progressed to severe COVID‐19; however, we did not find evidence of paired abundance in other immunoglobulin segments (i.e., IGHV families) in this study. Further studies of serum from larger cohorts of patients are required to more precisely understand immunoglobulin expression in severe COVID‐19. Nevertheless, the increased abundance of IGLV3‐19 in severe COVID‐19 is a characteristic immune response that can be detected in advance of disease progression. This finding may help direct the development of vaccines as well as therapeutic strategies [22]. BNC2 might play a role in the upregulation of interferon‐stimulated genes (ISGs) in the IFN signalling pathway [21]. There is a reportedly impaired IFN type 1 response with low levels of IFN‐1 and ISGs associated with persistent blood viral load in severe COVID‐19 patients [23, 24]. We speculate that dysregulation of BNC2 might affect IFN type 1 response and reflect a high risk for progression to severe COVID‐19. Of interest, which has not been reported yet, MAN1A1 and MAN1C1 persistently increased in the severe group subjects, and was significantly associated with the risk of disease progression. These are Golgi alpha‐1,2‐mannosidases, which are primarily involved in N‐glycosylation that covalently adds glycan residues to the asparagine of proteins. This post‐translational modification is increased in certain pathological conditions [25]. We speculate that MAN1A1 and MAN1C1 may reflect glycoproteomic changes in the host immune process in a patient with SARS‐CoV‐2 infection, which allows for stratification of the severity of inflammatory reactions in COVID‐19 patients. Understanding significant changes in these proteins might provide insights for therapeutic strategies in COVID‐19. Inhibited N‐glycosylation of angiotensin I‐converting enzyme 2 (ACE2) reportedly disrupts membrane fusion between SARS‐CoV and host endosomes [26, 27]. In this regard, patients infected with SARS‐CoV‐2 might benefit from the use of inhibitors of N‐glycosylation (e.g., α‐glucosidase inhibitors), although further experimental studies are needed to confirm this. Shen et al. reported that proteins, including multiple lipoproteins (APOC1, APOM), acute phase proteins (SAA1, SAA2, SAA4, CRP, SERPINA3), and proteins involved in platelet degranulation (PPBP, PF4), differentiate between severe and non‐severe patients. Messner et al. reported 27 proteins involved in inflammation including IL‐6 signalling (TNF, IFN‐γ, ITIH4, HP, LRG1, LGALS3BP), coagulation (fibrinogen, coagulation factors, SERPINA10), acute phase response (CRP, LBP, ALB, SAA1, SAA2), and complement pathway (C1R, C1S, C8A, CFB, CFI, CFH) are differentially expressed according to disease severity. In this study, we verified that a number of proteins belonging to the acute phase response, lipid metabolism, and platelet degranulation were differently expressed between the groups of non‐severe and severe patients [11, 12]. In contrast, we could not verify differential abundance of proteins belonging to the complement system. We further identified potential biomarkers differentially expressed preceding clinical deterioration in severe group and described longitudinal dynamics of the potential biomarkers during COVID‐19 disease progression. Acute phase proteins such as CRP and LBP initiate the host innate immune response to viral infection. LBP transfers lipopolysaccharides (LPS) to CD14, which subsequently bind to the toll‐like receptor 4 (TLR4)/myeloid differentiation factor 2 (MD‐2) complex. The CD14‐LPS‐MD2‐TLR4 complex initiates the signalling cascade of pro‐inflammatory cytokines such as IL‐6, IL‐8, and TNF‐a [28]. The interactions of LBP and pro‐inflammatory cytokines in viral infections have been previously documented [12, 29]. The hepatic synthesis of RBP4 and TTR generally decreases during acute inflammatory states, and although the underlying pathomechanisms still remain to be clarified, a reduction in serum RBP4 and TTR has been reported as a predictor of poor clinical outcome in other diseases such as sepsis and critical ill patients [30, 31]. Dysregulation of multiple apolipoproteins (e.g., APOA1, APOA2, APOC1, and APOM) and suppressed platelet degranulation in the serum of patients with severe COVID‐19 have been previously reported [11, 12, 32], which is in accordance with the findings of the present study. Currently, it is unclear whether SARS‐CoV‐2 infection and the pathogenesis of severe COVID‐19 directly alter platelet activity [33]. However, considering that platelets directly recognize pathogens and release antimicrobial mediators by degranulation in viral infections, dysregulation of platelet degranulation in SARS‐CoV‐2 infection implies a possible impaired immune response in severe COVID‐19 patients [34]. Our proteomic analysis further elucidates the predictive role of initial and serial measurement of these protein abundances over clinical outcomes in patients with COVID‐19. One of the limitations of our study is that our findings have not yet been externally validated. However, we performed proteomic analysis of sequential time points from each patient, which enabled us to identify reliable DEPs that were consistently differentially expressed in severe disease status, at initial assessment, and during hospitalization. Our results should be validated in an independent patient cohort. In addition, the number of included patients was small (n = 25). Future larger prospective studies are warranted to validate these potential biomarkers. In summary, this study investigated the proteomics dynamics through longitudinal proteomic profiling on serum samples from patients with COVID‐19, and demonstrates proteins significantly changed in severe versus non‐severe disease. Our findings may give some new insights into host response to SARS‐CoV‐2 infection, which are valuable for understanding of disease progression to severe COVID‐19. Our data could be integrated with additional data in the near future to develop protein biomarkers, which might enable better predicting and monitoring progression to severe COVID‐19.

CONFLICT OF INTEREST

The authors declare no conflict of interest. Table S1 Click here for additional data file. Supporting Information Click here for additional data file.
  32 in total

1.  Proteomic Biomarkers to Guide Stratification for Covid-19 Treatment: Exemplifying a Path Forward Toward Implementation?

Authors:  Joachim Jankowski
Journal:  Proteomics       Date:  2020-11-03       Impact factor: 3.984

2.  Longitudinal proteomic profiling provides insights into host response and proteome dynamics in COVID-19 progression.

Authors:  Jee-Soo Lee; Dohyun Han; So Yeon Kim; Ki Ho Hong; Myoung-Jin Jang; Man Jin Kim; Young-Gon Kim; Jae Hyeon Park; Sung Im Cho; Wan Beom Park; Kyung Bok Lee; Ho Seob Shin; Hyeon Sae Oh; Taek Soo Kim; Sung Sup Park; Moon-Woo Seong
Journal:  Proteomics       Date:  2021-05-14       Impact factor: 3.984

3.  Circulating retinol binding protein 4 in critically ill patients before specific treatment: prognostic impact and correlation with organ function, metabolism and inflammation.

Authors:  Alexander Koch; Ralf Weiskirchen; Edouard Sanson; Henning W Zimmermann; Sebastian Voigt; Hanna Dückers; Christian Trautwein; Frank Tacke
Journal:  Crit Care       Date:  2010-10-08       Impact factor: 9.097

Review 4.  Platelets and infection - an emerging role of platelets in viral infection.

Authors:  Alice Assinger
Journal:  Front Immunol       Date:  2014-12-18       Impact factor: 7.561

5.  Proteomic and Metabolomic Characterization of COVID-19 Patient Sera.

Authors:  Bo Shen; Xiao Yi; Yaoting Sun; Xiaojie Bi; Juping Du; Chao Zhang; Sheng Quan; Fangfei Zhang; Rui Sun; Liujia Qian; Weigang Ge; Wei Liu; Shuang Liang; Hao Chen; Ying Zhang; Jun Li; Jiaqin Xu; Zebao He; Baofu Chen; Jing Wang; Haixi Yan; Yufen Zheng; Donglian Wang; Jiansheng Zhu; Ziqing Kong; Zhouyang Kang; Xiao Liang; Xuan Ding; Guan Ruan; Nan Xiang; Xue Cai; Huanhuan Gao; Lu Li; Sainan Li; Qi Xiao; Tian Lu; Yi Zhu; Huafen Liu; Haixiao Chen; Tiannan Guo
Journal:  Cell       Date:  2020-05-28       Impact factor: 41.582

6.  Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China.

Authors:  Chaolin Huang; Yeming Wang; Xingwang Li; Lili Ren; Jianping Zhao; Yi Hu; Li Zhang; Guohui Fan; Jiuyang Xu; Xiaoying Gu; Zhenshun Cheng; Ting Yu; Jiaan Xia; Yuan Wei; Wenjuan Wu; Xuelei Xie; Wen Yin; Hui Li; Min Liu; Yan Xiao; Hong Gao; Li Guo; Jungang Xie; Guangfa Wang; Rongmeng Jiang; Zhancheng Gao; Qi Jin; Jianwei Wang; Bin Cao
Journal:  Lancet       Date:  2020-01-24       Impact factor: 79.321

7.  Antimicrobial Photodynamic Therapy in the Control of COVID-19.

Authors:  Adelaide Almeida; M Amparo F Faustino; Maria G P M S Neves
Journal:  Antibiotics (Basel)       Date:  2020-06-11

8.  Comparison of serum protein profiles between major depressive disorder and bipolar disorder.

Authors:  Sang Jin Rhee; Dohyun Han; Yunna Lee; Hyeyoung Kim; Junhee Lee; Kangeun Lee; Hyunsuk Shin; Hyeyoon Kim; Tae Young Lee; Minah Kim; Se Hyun Kim; Yong Min Ahn; Jun Soo Kwon; Kyooseob Ha
Journal:  BMC Psychiatry       Date:  2020-04-03       Impact factor: 3.630

View more
  6 in total

Review 1.  Advances and Utility of the Human Plasma Proteome.

Authors:  Eric W Deutsch; Gilbert S Omenn; Zhi Sun; Michal Maes; Maria Pernemalm; Krishnan K Palaniappan; Natasha Letunica; Yves Vandenbrouck; Virginie Brun; Sheng-Ce Tao; Xiaobo Yu; Philipp E Geyer; Vera Ignjatovic; Robert L Moritz; Jochen M Schwenk
Journal:  J Proteome Res       Date:  2021-10-21       Impact factor: 5.370

2.  Dynamics of circulating calprotectin accurately predict the outcome of moderate COVID-19 patients.

Authors:  Nicolas Chapuis; Nusaibah Ibrahimi; Thibaut Belmondo; Claire Goulvestre; Anne-Emmanuelle Berger; Alice-Andrée Mariaggi; Muriel Andrieu; Camille Chenevier-Gobeaux; Arnaud Bayle; Lydia Campos; Cherifa Cheurfa; Richard Chocron; Jean-Luc Diehl; Benoît Doumenc; Jérôme Duchemin; Manon Duprat; Fabien François; Nicolas Gendron; Tristant Mirault; Frédéric Pène; Aurélien Philippe; Fanny Pommeret; Olivier Sanchez; David M Smadja; Tali-Anne Szwebel; Aymeric Silvin; Florent Ginhoux; Ludovic Lacroix; Gérôme Jules-Clément; Sarobidy Rapeteramana; Colette Mavier; Laura Steller; Barbara Perniconi; Fabrice André; Damien Drubay; Michaela Fontenay; Sophie Hüe; Stéphane Paul; Eric Solary
Journal:  EBioMedicine       Date:  2022-05-26       Impact factor: 11.205

3.  Longitudinal proteomic profiling provides insights into host response and proteome dynamics in COVID-19 progression.

Authors:  Jee-Soo Lee; Dohyun Han; So Yeon Kim; Ki Ho Hong; Myoung-Jin Jang; Man Jin Kim; Young-Gon Kim; Jae Hyeon Park; Sung Im Cho; Wan Beom Park; Kyung Bok Lee; Ho Seob Shin; Hyeon Sae Oh; Taek Soo Kim; Sung Sup Park; Moon-Woo Seong
Journal:  Proteomics       Date:  2021-05-14       Impact factor: 3.984

4.  Differential Co-Expression Network Analysis Reveals Key Hub-High Traffic Genes as Potential Therapeutic Targets for COVID-19 Pandemic.

Authors:  Aliakbar Hasankhani; Abolfazl Bahrami; Negin Sheybani; Behzad Aria; Behzad Hemati; Farhang Fatehi; Hamid Ghaem Maghami Farahani; Ghazaleh Javanmard; Mahsa Rezaee; John P Kastelic; Herman W Barkema
Journal:  Front Immunol       Date:  2021-12-15       Impact factor: 7.561

Review 5.  COVIDomics: The Proteomic and Metabolomic Signatures of COVID-19.

Authors:  Michele Costanzo; Marianna Caterino; Roberta Fedele; Armando Cevenini; Mariarca Pontillo; Lucia Barra; Margherita Ruoppolo
Journal:  Int J Mol Sci       Date:  2022-02-22       Impact factor: 5.923

6.  Combining Deep Phenotyping of Serum Proteomics and Clinical Data via Machine Learning for COVID-19 Biomarker Discovery.

Authors:  Antonio Paolo Beltrami; Maria De Martino; Emiliano Dalla; Matilde Clarissa Malfatti; Federica Caponnetto; Marta Codrich; Daniele Stefanizzi; Martina Fabris; Emanuela Sozio; Federica D'Aurizio; Carlo E M Pucillo; Leonardo A Sechi; Carlo Tascini; Francesco Curcio; Gian Luca Foresti; Claudio Piciarelli; Axel De Nardin; Gianluca Tell; Miriam Isola
Journal:  Int J Mol Sci       Date:  2022-08-15       Impact factor: 6.208

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.