Yang Chen1,2, Hangping Yao3, Nan Zhang1,2, Jie Wu3, Shuaixin Gao1, Jiangtao Guo1, Xiangyun Lu3, Linfang Cheng3, Rui Luo3, Xue Liang3, Catherine C L Wong1,2,4,5,6, Min Zheng3. 1. Center for Precision Medicine Multi-Omics Research, Peking University Health Science Center, Peking University, Beijing 100191, China. 2. School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China. 3. State Key Laboratory for Diagnosis and Treatment of Infectious Disease, National Clinical Research Center for Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310003, China. 4. Peking University First Hospital, Beijing 100034, China. 5. Peking-Tsinghua Center for Life Sciences, Beijing 100871, China. 6. Advanced Innovation Center for Human Brain Protection, Capital Medical University, Beijing 100069, China.
Abstract
The COVID-19 pandemic has become a worldwide health crisis. So far, most studies have focused on the epidemiology and pathogenesis of this infectious disease. Little attention has been given to the disease sequelae in patients recovering from COVID-19, and nothing is known about the mechanisms underlying these sequelae. Herein, we profiled the serum proteome of a cohort of COVID-19 patients in the disease onset and recovery stages. Based on the close integration of our proteomic analysis with clinical data, we propose that COVID-19 is associated with prolonged disorders in cholesterol metabolism and myocardium, even in the recovery stage. We identify potential biomarkers for these disorders. Moreover, severely affected patients presented more serious disturbances in these pathways. Our findings potentially support clinical decision-making to improve the prognosis and treatment of patients.
The COVID-19 pandemic has become a worldwide health crisis. So far, most studies have focused on the epidemiology and pathogenesis of this infectious disease. Little attention has been given to the disease sequelae in patients recovering from COVID-19, and nothing is known about the mechanisms underlying these sequelae. Herein, we profiled the serum proteome of a cohort of COVID-19patients in the disease onset and recovery stages. Based on the close integration of our proteomic analysis with clinical data, we propose that COVID-19 is associated with prolonged disorders in cholesterol metabolism and myocardium, even in the recovery stage. We identify potential biomarkers for these disorders. Moreover, severely affected patients presented more serious disturbances in these pathways. Our findings potentially support clinical decision-making to improve the prognosis and treatment of patients.
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and the disease it causes,
coronavirus disease 2019 (COVID-19), first emerged in 2019 and have proceeded to cause a
global health pandemic that continues on an unprecedented scale until today. The number of
COVID-19deaths around the world has surpassed 3.5 million, with more than 170 million
peopleinfected. Although first characterized as a primarily respiratory syndrome, COVID-19
is now known to involve multiple organ systems, including the heart, gastrointestinal tract,
kidneys, liver, muscle, and central nervous system.[1−3] Several independent proteomic studies have revealed that SARS-CoV-2infection triggers disruption of multiple biological pathways, including the immune
response, coagulation, and metabolism.[4−7] Furthermore, studies found that in patients who had
recovered from COVID-19, 87.4% reported persistence of fatigue and dyspnea. However, the
underlying mechanisms are unknown and, therefore, no strategies exist to prevent these
sequelae.[8] Thus, there is an urgent need for a systematic understanding
of the disease mechanisms on a molecular level throughout the entire course of the disease
including recovery.Here, to have a complete picture of the disease onset, progression, and recovery, we
established a long-term cohort of 10 patients with a 6 month follow-up. We carried out
quantitative proteomic analysis of patient samples and we analyzed the data in conjunction
with clinical results from regular blood tests. We found that the levels of a group of serum
proteins related to infection changed after disease onset but were restored in the recovery
stage, possibly representing disease stress reactions. Intriguingly, we found that another
group of proteins changed during the disease stage but did not return to normal in the
recovery stage, suggesting the presence of long-lasting disorders. Pathways functionally
linked to the persistently changed proteins included blood coagulation, fibrin clot
formation, cholesterol metabolism, and cardiopathy. These prolonged disturbances are more
serious in severe than moderate patients, suggesting a possible correlation of these
disorders with disease severity. We also identified potential biomarkers that could be used
as indicators to prevent those disorders in advance before clinical symptoms emerge.Our study brings attention to patient prognosis in the recovery stage. The combination of
quantitative proteomic data and regular clinical serum tests shed light on the molecular
changes that occur during both the disease and recovery stages. Our results potentially
support clinical decision-making to improve the prognosis and treatment of patients.
Experimental Section
Sample Collection
Serum samples were collected from The First Affiliated Hospital, School of Medicine,
Zhejiang University. Blood samples for research purposes were collected in pro-coagulation
vacuum tubes using standard venepuncture protocols. Serum was extracted by centrifugation
for 10 min at 3000 rpm. The serum samples were inactivated at 65 °C for 1 h and
subsequently stored at −80 °C before use. The patients with H7N9 avian
influenza virus infection were recruited between 1 April and 10 June 2013. The H1N1 flu
patients were retrospectively selected from the Inpatient Department of The First
Affiliated Hospital of Zhejiang University School of Medicine. Patients were diagnosed
according to their clinical symptoms and qPCR.The study was approved by the Ethics Committee of The First Affiliated Hospital of
Zhejiang University School of Medicine (ethical approval no. 2020IIT-255). All experiments
with live virus were performed in a biosafety level 3 laboratory approved by The First
Affiliated Hospital, School of Medicine Zhejiang University.
Depletion of Highly Abundant Proteins from Serum Samples
High-Select Top14 Abundant Protein Depletion of serum samples was conducted according to
the manufacturer instructions of “Multi Affinity Removal Column, Human-14”
(Agilent Technologies). Briefly, 20 μL of serum and 60 μL of buffer A were
mixed evenly, transferred into a Spin-X centrifuge tube filter 0.22 μm cellulose
acetate certified tube (COSTOR), and subsequently centrifuged at 16,000g
for 1 min. The filtrate was removed into 29 × 5 mm preassembled plastic springs for
High-Select Top14 Abundant Protein Depletion using a liquid chromatography system (Agilent
Technologies 1290). The depleted samples were collected and the protein concentration was
measured using a BCA Protein Assay Kit (Thermo Scientific) following the
manufacturer’s instructions.
Sample Preparation for DIA Analysis
The depleted serum sample (50 μg) was precipitated by 1/3 volume of trichloroacetic
acid (TCA) solution at 4 °C overnight and then centrifuged at
16,000g for 30 min at 4 °C. The precipitate was washed three times
with acetone and then dried with a vacuum concentrator (Labconco, USA). The dried
precipitate was dissolved in 40 μL of 8 M urea (pH 8.5), incubated with 20 mM
(2-carboxyethyl)phosphine hydrochloride (TCEP), and alkylated with 40 mM IAA. The mixture
was diluted with 200 μL of 100 mM Tris–HCl buffer (pH 8.5) to a final
concentration of 1.3 M urea and then digested with 3 μg of trypsin protease at 37
°C for 16 h. The sample was desalted using a Monospin C18 column (GL Science, Tokyo,
Japan). The desalted peptides were vacuum-centrifuged to dryness and reconstituted in
Milli-Q water with 0.1% formic acid (FA) for LC–MS analysis. Indexed retention time
(IRT) calibration peptides were spiked into the samples before DIA analysis.
Sample Preparation for Spectral Library Generation
The mixed protein (88 μg) from all samples was processed as above. Purified
peptides were dissolved in 80 μL of buffer (98% H2O and 2%
acetonitrile).
High-pH Reversed-Phase Fractionation
The 80 μL mixed peptides were fractioned into 62 fractions within 80 min on a
chromatographic system (Waters Xevo ACQUITY UPLC, USA). The first fraction was mixed with
the 62nd fraction, the second was mixed with the 61st fraction, and so on. All 31 mixed
fractions were vacuum-centrifuged to dryness and dissolved in 10 μL Milli-Q water
with 0.1% formic acid (FA). IRT peptides were spiked before data-dependent acquisition
analysis.
Liquid Chromatography
Peptide separation was carried out on a nanoElute liquid chromatography system (Bruker
Daltonics). Digested peptides (200 ng) were separated within 90 min at a flow rate of 300
nL/min on a homemade column (25 cm × 75 μm, 1.5 μm C18-AQ particles (Dr.
Maisch)). Mobile phases A and B were water and ACN with 0.1% formic acid, respectively.
The %B was linearly increased from 2 to 22% within 70 min, then increased to 37% within 8
min, then further increased to 95% within 5 min, and finally maintained at 95% for the
last 7 min.
Mass Spectrometry
All 31 fraction samples were analyzed on a hybrid TIMS quadrupole time-of-flight mass
spectrometer (Bruker timsTOF Pro) via a CaptiveSpray nanoelectrospray ion source. The mass
spectrometer was operated in data-dependent mode for generation of the spectral library
and in data-independent mode for sample analysis. The accumulation and ramp time were set
at 100 ms each and the mass spectra were recorded in the range from m/z
100 to 1700 in positive electrospray mode. The ion mobility was scanned from 0.6 to 1.6 V
s/cm2. The overall acquisition cycle of 1.16 s comprised 1 full TIMS-MS scan
and 10 PASEF MS/MS scans. We define quadrupole isolation windows as a function of the TIMS
scan time to achieve seamless and synchronous ramps for all applied voltages. Eight
windows were defined for single 100 ms TIMS scans according to the m/z
ion mobility plane. During scanning mode, the collision energy was ramped linearly as a
function of the mobility from 59 eV at 1/K0 = 1.6 V s/cm2 to 20 eV
at 1/K0 = 0.6 V s/cm2. For PRM confirmation, the peptide precursors
of selected differential protein precursors were scheduled with 120 min retention time
windows using a Q-Exactive HF mass spectrometer (Thermo Fisher Scientific, CA, USA). The
PRM quantification was performed by Skyline software.
Generation of Spectral Libraries and DIA Data Analysis
Spectral libraries were generated with Spectronaut version 14.2 (Biognosys) and all the
parameters were default. The MS/MS spectra were matched against the human UNIPROT database
(20,421 human entries, downloaded July 2019) and the SARS-CoV-2 UNIPROT database.
Statistical Analysis
OmicsBean software was used for data analysis including data imputation, normalization,
and principal component analysis (PCA). Genefilter was used in calculation of the
fold-change values of proteins. A fold change of 2, a fold change of <0.5, and a
P value (t test) of 0.05 were used to filter
differential expression proteins. Mfuzz was used to detect different subclustering models
of gene expression among groups. Venn diagram, heatmap, and network visualization were
performed using the ggplot228 package and Cytoscape v.3.5.129 implemented in the OmicsBean
workbench. Ingenuity pathway analysis (IPA) was performed to explore the downstream
effects in the dataset of significantly up- or downregulated proteins. The
z-score algorithm was used to predict the activation state (either
activated or inhibited) of each biological process. If the z-score
≤ −2, then the process is predicted to be statistically significantly
inhibited.
Data Availability
The experimental data that support the findings of this study have been deposited in
iProX (integrated proteome resources) of ProteomeXchange with the accession code
PXD023305. Reference FASTA files contain human UNIPROT database (only reviewed entries)
(20,421 human entries, downloaded July 2019) and SARS-CoV-2 UNIPROT database that combined
with SARS protein database (38 viral entries, April 2020). The latest GO database (https://www.ebi.ac.uk/QuickGO/) was used for
pathway enrichment analysis. The source data underlying Figures c–h, 4c,d, and 5d, and
Figures S1, S2c, S3, S4, and S5b are provided as a source data file. Source
data are provided with this paper.
Figure 3
Long-term disturbance of pathways related to cholesterol metabolism in COVID-19
patients, extending from the disease stage to the recovery stage. (a, b) Interaction
diagrams of the cholesterol metabolic process, regulation of cholesterol transport,
and regulation of cholesterol esterification when moderate patients (a, top) or severe
patients (b, top) are compared to healthy controls and when recovered moderate
patients (a, bottom) or recovered severe patients (b, bottom) are compared to healthy
controls. Proteins selected for analysis meet the conditions of fold change > 2 and
fold change < 0.5; P (t test) < 0.05. The
significance of the pathways represented by −log(P value)
(Fisher’s exact test) is shown by the color scale at the bottom of panel (b),
with dark blue representing the highest significance. The color bar from orange to
green represents the fold change of the protein level from increasing to decreasing.
Fold change indicates the protein level of the moderate or severe group to the healthy
group. (c, d) Box-and-whisker plots showing the relative intensities of the indicated
proteins in moderate (c) or severe (d) COVID-19 patients. These proteins are potential
markers for long-term disorders of cholesterol metabolism. One-way ANOVA was used to
analyze the data. For the healthy group, n = 10; for the moderate
COVID-19 group, n = 4; for the recovered moderate COVID-19 group,
n = 4; for the severe COVID-19 group, n = 6; for
the recovered severe COVID-19 group, n = 6. Data are presented as
median, upper and lower quartile, and min and max. (e) Long-term monitoring of total
serum cholesterol levels in COVID-19 patients throughout the disease and recovery
stages. (f–h) Comparison of serum total cholesterol levels of COVID-19 patients
(f), influenza A patients (g), and avian influenza H7N9 patients (h) in days since
disease onset. The orange lines indicate ICU admission; the blue lines indicate no ICU
admission.
Figure 4
Long-term disturbance of pathways related to the myocardium function in COVID-19
patients, extending from the disease stage to the recovery stage. (a, b) Interaction
diagrams of the regulation of cardiac muscle hypertrophy, cardiac muscle tissue
development, cardiocyte differentiation, and cardiovascular system development when
moderate patients (a, left) or severe patients (b, left) are compared to healthy
controls and when recovered moderate patients (a, right) or severe recovered patients
(b, right) are compared to healthy controls. Proteins selected for analysis meet the
conditions of fold change > 2 and fold change < 0.5; P
(t test) < 0.05. The significance of the pathways represented by
−log(P value) (Fisher’s exact test) is indicated by
the color scale (b, bottom), with dark blue representing the highest significance.
Fold change indicates the protein level of the moderate or severe group to the healthy
group. (d) Box-and-whisker plots showing the relative intensities of the indicated
proteins in moderate (c) or severe (d) COVID-19 patients. These proteins are potential
markers for long-term disorders of the myocardium. One-way ANOVA was used to analyze
the data. For the healthy group, n = 10; for the moderate COVID-19
group, n = 4; for the recovered moderate COVID-19 group,
n = 4; for the severe COVID-19 group, n = 6; for
the recovered severe COVID-19 group, n = 6. Data are presented as
median, upper and lower quartile, and min and max.
Figure 5
Quantitative proteomic analysis reveals disturbances in pathways related to
cholesterol metabolism and myocardium function when severe COVID-19 patients are
compared to moderate patients. (a) Proteomic changes in healthy controls, moderate
patients, and severe patients. Heatmap of 157 proteins that are significantly changed
when severe COVID-19 patients are compared to moderate COVID-19 patients. The color
bar from red to blue represents the fold change from increasing to decreasing of all
proteins identified in each group. Hierarchical clustering shows a clear group
differentiation according to similarity. Intensity profiles and selected enriched KEGG
pathways are indicated for the marked clusters. The color bar represents z-score
change from −1 to 1. (b, c) Interaction diagrams of regulation of
cardiovascular system development (b) or the cholesterol metabolic process, regulation
of cholesterol transport, and regulation of cholesterol esterification (c) when severe
COVID-19 patients are compared to moderate COVID-19 patients. Proteins selected for
analysis meet the condition of fold change > 2 and fold change < 0.5;
P (t test) < 0.05. The significance of the
pathways represented by −log(P value) (Fisher’s exact
test) is indicated by the color scale, with dark blue representing the most
significant difference. Fold change indicates the protein level of the severe to
moderate group. (d) Box-and-whisker plots showing the relative intensities of proteins
related to cholesterol metabolism or myocardium function in healthy controls, moderate
COVID-19 patients, or severe COVID-19 patients. One-way ANOVA was used to analyze the
data. For the healthy group, n = 10; for the moderate COVID-19 group,
n = 4; for the severe COVID-19 group, n = 6. Data
are presented as median, upper and lower quartile, and min and max.
Results
Quantitative Proteomic Profiling of COVID-19 Serum in Both Disease and Recovery
Stages
We collected serum samples from a cohort of 10 COVID-19patients and 10 healthy donors.
Four of the patients were diagnosed with moderate symptoms and six were diagnosed with
severe symptoms according to the Chinese management guideline for COVID-19 (version 6.0)
(http://www.nhc.gov.cn/yzygj/s7653p/202002/8334a8326dd94d329df351d7da8aefc2/files/b218cfeb1bc54639af227f922bf6b817.pdf).
Of note, patients were monitored by regular blood testing at multiple time points during
the entire disease process from admission to recovery in a time range of about 6 months
(Figure a and Tables S1 and S2). The samples for quantitative proteomics were taken from
each patient at two time points (Figure a). The
first sample was taken shortly after the patient was diagnosed with COVID-19, as confirmed
by a positive test for SARS-CoV-2 based on real-time PCR (RT-PCR) analysis of a
throat-swab specimen. The second sample was taken several days after the patient tested
negative for SARS-CoV-2. To tackle the specific proteomic features of severe cases, we
also obtained serum samples from a further six patients diagnosed with moderate COVID-19
to constitute a cohort containing 10 healthy controls, 10 moderate COVID-19patients, and
6 severe COVID-19patients (Figure b and
Table S2). All samples from the two cohorts were first subjected to
depletion of high-abundance serum proteins and then processed by data-independent
acquisition (DIA) mass spectrometry. The resulting quantitative proteomic data were
subsequently analyzed for enriched KEGG pathways (Figure b). The proteomic data were correlated with clinical indicators to obtain more
accurate dissection of the results.
Figure 1
Summary of COVID-19 patient samples and experimental design. (a) Timeline of the
disease course and sample acquisition in 10 COVID-19 patients including 6 severely
affected patients (upper bars) and 4 moderately affected patients (lower bars).
Further details are presented in Tables S1 and S2. For each patient, the disease stage (yellow bar) and
recovery stage (green bar) are separated by a negative nasopharyngeal swab real-time
PCR test for SARS-CoV-2. Samples were taken for proteomic studies during both the
disease stage (thick red lines) and the recovery stage (thick blue lines). The
triangles represent the time points for testing blood total cholesterol (purple
triangles) and profiling myocardial enzymes (white triangles). (b) Experimental design
for the quantitative proteomic analysis in this study. A total of 30 serum samples
were taken from three groups (10 from healthy donors and 20 from patients in both the
disease stage and the recovery stage as marked in A) and processed for quantitative
proteomic analysis. Green, yellow, and red represent healthy donors, moderate
patients, and severe patients, respectively. At the same time, by adding another six
samples from moderate patients, a total of 26 samples were analyzed from three new
groups: 10 samples from healthy donors, 10 samples from moderate patients, and 6
samples from severe patients.
Summary of COVID-19patient samples and experimental design. (a) Timeline of the
disease course and sample acquisition in 10 COVID-19patients including 6 severely
affected patients (upper bars) and 4 moderately affected patients (lower bars).
Further details are presented in Tables S1 and S2. For each patient, the disease stage (yellow bar) and
recovery stage (green bar) are separated by a negative nasopharyngeal swab real-time
PCR test for SARS-CoV-2. Samples were taken for proteomic studies during both the
disease stage (thick red lines) and the recovery stage (thick blue lines). The
triangles represent the time points for testing blood total cholesterol (purple
triangles) and profiling myocardial enzymes (white triangles). (b) Experimental design
for the quantitative proteomic analysis in this study. A total of 30 serum samples
were taken from three groups (10 from healthy donors and 20 from patients in both the
disease stage and the recovery stage as marked in A) and processed for quantitative
proteomic analysis. Green, yellow, and red represent healthy donors, moderate
patients, and severe patients, respectively. At the same time, by adding another six
samples from moderate patients, a total of 26 samples were analyzed from three new
groups: 10 samples from healthy donors, 10 samples from moderate patients, and 6
samples from severe patients.The raw data were first processed using a double boundary Bayes (DBB) imputation method
and were standardized for further analysis. The data showed clear stratification of the
different patient/control groups according to principal component analysis (PCA) (Figure a). A total of 1056 proteins were identified
from all samples. The levels of 261 and 277 proteins significantly changed in moderate and
severe COVID-19patients, respectively, compared to healthy controls. Surprisingly, in
recovered moderate and recovered severe patients, 243 and 163 proteins were significantly
changed compared to healthy controls, and of these, 113 and 88 proteins overlapped with
the disease-stage proteins (Figure b). Thus, a
substantial number of proteins remain abnormal in the recovery stage. Next, subclustering
analysis was performed on the significantly changed 261 or 277 proteins corresponding to
moderate diseases to analyze their variation at health, disease, or recover stages. When
moderate COVID-19patients were compared to healthy controls, the significantly changed
proteins showed four different subclustering patterns (Figure c). Subclusters 1 and 2 contain proteins that changed due to
SARS-CoV-2 infection but were restored in the recovery stage. The enriched KEGG pathways
include proteasome, glycosaminoglycan degradation, lysosome, and so forth, which partially
represent the stress reaction to viral infection (Figure c). Among these proteins are GP70, CRP, SAA1, and SAA2, which are known to be
biomarkers for acute phase response after infection and other stimuli[9−11] (Figure S1). However, more interestingly, subclusters 3 and 4 contain
proteins that are significantly altered during infection but are not restored in the
recovery stage. The highlighted pathways include ECM–receptor interaction,
cholesterol metabolism, dilated cardiomyopathy, and hypertrophic cardiomyopathy (Figure c). Similar patterns were found for the
severe COVID-19patients (Figure d). The
proteomic analysis prompted us to look in more detail at the proteins and related pathways
that showed long-term disturbance.
Figure 2
Quantitative proteomic analysis of serum samples from the disease and recovery stages
in moderate or severe patients. (a) Principal component analysis (PCA) showing the
intergroup differences. Individuals in the healthy control group, the moderate or
severe COVID-19 groups, and the recovered moderate or severe COVID-19 groups are
indicated by colored dots in the figure. (b) Venn diagram of significantly changed
proteins (cutoff value of fold change > 2 and fold change < 0.5;
P (t test) < 0.05) in the moderate or severe
COVID-19 group compared to the healthy control group and in the recovered moderate or
recovered severe group compared to the healthy control group. Fold change indicates
the protein level of the moderate or severe group to the healthy group. (c, d) Heatmap
of proteomic changes in healthy controls, moderate patients, and recovered moderate
patients (c) and in healthy controls, severe patients, and recovered severe patients
(d). The color bar from red to blue represents the fold change from increasing to
decreasing of all proteins identified in each group. Hierarchical clustering shows a
clear group differentiation according to similarity. Intensity profiles and selected
enriched KEGG pathways are indicated for the marked clusters. The color bar represents
z-score change from −1 to 1.
Quantitative proteomic analysis of serum samples from the disease and recovery stages
in moderate or severe patients. (a) Principal component analysis (PCA) showing the
intergroup differences. Individuals in the healthy control group, the moderate or
severe COVID-19 groups, and the recovered moderate or severe COVID-19 groups are
indicated by colored dots in the figure. (b) Venn diagram of significantly changed
proteins (cutoff value of fold change > 2 and fold change < 0.5;
P (t test) < 0.05) in the moderate or severe
COVID-19 group compared to the healthy control group and in the recovered moderate or
recovered severe group compared to the healthy control group. Fold change indicates
the protein level of the moderate or severe group to the healthy group. (c, d) Heatmap
of proteomic changes in healthy controls, moderate patients, and recovered moderate
patients (c) and in healthy controls, severe patients, and recovered severe patients
(d). The color bar from red to blue represents the fold change from increasing to
decreasing of all proteins identified in each group. Hierarchical clustering shows a
clear group differentiation according to similarity. Intensity profiles and selected
enriched KEGG pathways are indicated for the marked clusters. The color bar represents
z-score change from −1 to 1.
Long-Term Disruption of Proteins Related to Cholesterol Metabolism and Myocardium
Function in COVID-19 Patients in the Disease and Recovery Stages
We found that proteins that changed significantly during infection and then remained
changed in the recovery stage were enriched in functions related to blood coagulation,
fibrin clot formation, cholesterol metabolism, and myocardium. Consistent with our
proteomic data, thrombotic complications have been reported in COVID-19patients.[12] We found that disturbed blood coagulation might be a long-lasting problem
for patients (Figure S2). More intriguingly, we found that proteins in subclusters 3 and 4
from moderate COVID-19patients were enriched in functions related to the cholesterol
metabolic process, cholesterol transport and cholesterol esterification. Specifically, the
level of APOL1, APOA2, and SERPINA12 were altered both in disease and recovery stages
(Figure a,c).
Proteins in subclusters 3 and 4 from severe COVID-19patients showed similar results, with
APOA5, APOL1, APOA2, and CSE1 identified (Figure b,d). Apolipoprotein A-II (ApoA2) and Apolipoprotein L1 (ApoL1) identified in
this study are components of high-density lipoproteins (HDLs), which are important for
cholesterol reverse transport. The disturbance of serum HDLs may suggest defect in
cholesterol reverse transport and cholesterol functions. HDLs also display major
pleiotropic functions that could play a pivotal role in acute inflammatory conditions.
Evidence suggests that HDLs are globally protective for the endothelial layer. Both HDL
composition and functionality are profoundly modified under pathological conditions. One
study showed that HDLs from COVID-19patients were less protective for endothelial cells
stimulated with TNFα than HDLs from healthy subjects.[13] The
proteins identified may be potential indicators of a prolonged cholesterol metabolism
disorder. From the clinical data obtained from patient blood samples (Figure a), we found that the serum total cholesterol level and
the low-density lipoprotein level were increased at disease onset and did not fully return
to the original values even at the 6 month follow-up, both in moderate and severe COVID-19patients (Figure e and Figure S3). Notably, this phenotype was not found in patients with influenza
A or H7N9infection. According to the total cholesterol level of 600 COVID-19patients,
1177 influenza A-infectedpatients and 522 H7N9-infectedpatients, the serum total
cholesterol showed a tendency of increase over the disease onset only for COVID-19patients (Figure f–h). It is worth noting
that the serum total cholesterol level did not exceed the generally recognized normal
range. Our study suggests that control of serum cholesterol needs to be taken into account
when considering treatments to improve the prognosis.Long-term disturbance of pathways related to cholesterol metabolism in COVID-19patients, extending from the disease stage to the recovery stage. (a, b) Interaction
diagrams of the cholesterol metabolic process, regulation of cholesterol transport,
and regulation of cholesterol esterification when moderate patients (a, top) or severe
patients (b, top) are compared to healthy controls and when recovered moderate
patients (a, bottom) or recovered severe patients (b, bottom) are compared to healthy
controls. Proteins selected for analysis meet the conditions of fold change > 2 and
fold change < 0.5; P (t test) < 0.05. The
significance of the pathways represented by −log(P value)
(Fisher’s exact test) is shown by the color scale at the bottom of panel (b),
with dark blue representing the highest significance. The color bar from orange to
green represents the fold change of the protein level from increasing to decreasing.
Fold change indicates the protein level of the moderate or severe group to the healthy
group. (c, d) Box-and-whisker plots showing the relative intensities of the indicated
proteins in moderate (c) or severe (d) COVID-19patients. These proteins are potential
markers for long-term disorders of cholesterol metabolism. One-way ANOVA was used to
analyze the data. For the healthy group, n = 10; for the moderate
COVID-19 group, n = 4; for the recovered moderate COVID-19 group,
n = 4; for the severe COVID-19 group, n = 6; for
the recovered severe COVID-19 group, n = 6. Data are presented as
median, upper and lower quartile, and min and max. (e) Long-term monitoring of total
serum cholesterol levels in COVID-19patients throughout the disease and recovery
stages. (f–h) Comparison of serum total cholesterol levels of COVID-19patients
(f), influenza A patients (g), and avian influenza H7N9patients (h) in days since
disease onset. The orange lines indicate ICU admission; the blue lines indicate no ICU
admission.We also observed long-term disturbances in the levels of proteins related to myocardium
function. Most of the related proteins were downregulated in the COVID-19 disease stage
and the downregulation persisted in the recovery stage both in moderate (Figure a) and severe
COVID-19patients (Figure b). Myocardial
infarction and pressure overload induced a pathologic response of the myocardium termed
cardiac remodeling. The cardiac myocyte growth (cardiac hypertrophy) and the deposition of
the extracellular matrix (cardiac fibrosis) are hallmarks. Cardiac remodeling involves
intense intercellular communication, wherein secreted factors such as proteases, protease
inhibitors, cytokines, and chemokines serve a central role. Among identified proteins are
TTN, MYH7, and NEXN, which function in the sarcomere, and COL14A1, COL2A1, ITGA7, MMP,
THBS4, LGALS3, and LOX, which function in cell migration and cell-to-matrix adhesions
(Figure c,d). It is worth mentioning that the
levels of myocardial enzymes detected by clinical blood tests (Figure
a) did not show a clear indication of cardiomyopathy in
moderate or severe cases (Figure S4). Thus, the serum protein markers identified in this study may
provide an early warning of cardiomyopathy before the severe heart function impairment
detected by a regular blood test.Long-term disturbance of pathways related to the myocardium function in COVID-19patients, extending from the disease stage to the recovery stage. (a, b) Interaction
diagrams of the regulation of cardiac muscle hypertrophy, cardiac muscle tissue
development, cardiocyte differentiation, and cardiovascular system development when
moderate patients (a, left) or severe patients (b, left) are compared to healthy
controls and when recovered moderate patients (a, right) or severe recovered patients
(b, right) are compared to healthy controls. Proteins selected for analysis meet the
conditions of fold change > 2 and fold change < 0.5; P
(t test) < 0.05. The significance of the pathways represented by
−log(P value) (Fisher’s exact test) is indicated by
the color scale (b, bottom), with dark blue representing the highest significance.
Fold change indicates the protein level of the moderate or severe group to the healthy
group. (d) Box-and-whisker plots showing the relative intensities of the indicated
proteins in moderate (c) or severe (d) COVID-19patients. These proteins are potential
markers for long-term disorders of the myocardium. One-way ANOVA was used to analyze
the data. For the healthy group, n = 10; for the moderate COVID-19
group, n = 4; for the recovered moderate COVID-19 group,
n = 4; for the severe COVID-19 group, n = 6; for
the recovered severe COVID-19 group, n = 6. Data are presented as
median, upper and lower quartile, and min and max.
Further Impairment of Cholesterol Metabolism and Myocardium Function in Severe
COVID-19 Patients
The degree of severity of COVID-19 may be an important factor in poor prognosis. Thus, we
aimed to focus on the specific features of severe patients by comparing them to moderate
ones. We did quantitative proteomic analysis of the cohort containing 10 healthy controls,
10 moderate COVID-19patients, and 6 severe patients (Figure b). Proteins that are significantly up- or downregulated when
comparing severe to moderate patients fall into four specific subclusters (Figure a). Specifically,
proteins with functions related to blood coagulation, cholesterol metabolism, and
cardiovascular system development, which showed prolonged disruption in the previous
analysis, were also enriched in subclusters 3 and 4 (Figure b,c and Figure S5). The levels of the identified proteins all showed significant
changes to various degrees in severe patients compared to both moderate patients and
healthy controls (Figure d). Among these
proteins are NEXN, ITGA7, HSP90AB1, APOA5, CES1, and CFL1, which we identified above to be
significantly changed in severe patients both in the disease and recovery stages. This
result provides a good hint that severe patients may have more serious sequelae due to
prolonged disturbance of the highlighted pathways.Quantitative proteomic analysis reveals disturbances in pathways related to
cholesterol metabolism and myocardium function when severe COVID-19patients are
compared to moderate patients. (a) Proteomic changes in healthy controls, moderate
patients, and severe patients. Heatmap of 157 proteins that are significantly changed
when severe COVID-19patients are compared to moderate COVID-19patients. The color
bar from red to blue represents the fold change from increasing to decreasing of all
proteins identified in each group. Hierarchical clustering shows a clear group
differentiation according to similarity. Intensity profiles and selected enriched KEGG
pathways are indicated for the marked clusters. The color bar represents z-score
change from −1 to 1. (b, c) Interaction diagrams of regulation of
cardiovascular system development (b) or the cholesterol metabolic process, regulation
of cholesterol transport, and regulation of cholesterol esterification (c) when severe
COVID-19patients are compared to moderate COVID-19patients. Proteins selected for
analysis meet the condition of fold change > 2 and fold change < 0.5;
P (t test) < 0.05. The significance of the
pathways represented by −log(P value) (Fisher’s exact
test) is indicated by the color scale, with dark blue representing the most
significant difference. Fold change indicates the protein level of the severe to
moderate group. (d) Box-and-whisker plots showing the relative intensities of proteins
related to cholesterol metabolism or myocardium function in healthy controls, moderate
COVID-19patients, or severe COVID-19patients. One-way ANOVA was used to analyze the
data. For the healthy group, n = 10; for the moderate COVID-19 group,
n = 4; for the severe COVID-19 group, n = 6. Data
are presented as median, upper and lower quartile, and min and max.The significance of some of the differentially expressed proteins identified in this
study was independently validated by parallel reaction monitoring (PRM), a targeted
quantitative MS method well suited for determining protein amounts in complex
mixtures[14,15]
(Figure S6). The validation results indicate that the intensities of APOA2,
FGA, SOD2, APOL1, and CFL1 are significantly changed, consistent with the trends shown in
Figures –5.
Discussion
In this study, we applied mass spectrometry (MS)-based proteomics, which quickly delivered
substantial amounts of biological information from serum samples. Our approach provided a
molecular atlas of serum proteins in an untargeted fashion, which does not depend on prior
knowledge of the disease. This unbiased proteomic analysis, closely integrated with clinical
symptoms, allows us to more accurately dissect the data. We obtained evidence for prolonged
disturbances of pathways related to cholesterol metabolism and myocardium function, which
provide insights into the mechanisms underlying the sequelae of COVID-19. This discovery
enables us to identify clinically applicable markers and therapeutic targets and to design
preventive strategies.We found that cholesterol metabolism pathways were significantly affected in COVID-19patients, both in moderate and severe cases, and were not restored during the recovery
stage. Cholesterol is an abundant lipid in the membranes of eukaryotic cells. Cholesterol
synthesis, uptake, or transport is fundamental for infection by viruses such as the
flavivirus HCV or the retrovirus HIV to facilitate viral entry, formation of replicative
complexes, and assembly or egress of new virus particles.[16−18] There are also reports of the role of cholesterol in the stability and
infectivity of influenza A.[19,20] Likewise, it is possible that cholesterol supports viral infection
through lipid subdomains on the plasma membrane that can harbor angiotensin-converting
enzyme 2 (ACE2), which acts as the receptor for the S-protein of
SARS-CoV-2.[21,22]
Cellular cholesterol synthesis or transport may also accompany the other stage of the life
cycle of SARS-CoV-2.[23] Pharmacological inhibition of cholesterol
homeostasis reduces the replication of SARS-CoV-2.[22] Our proteomic data
as well as the clinical data revealed that SARS-CoV-2 infection causes long-term
disturbances, lasting for at least 6 months, of cholesterol metabolism. Thus, more attention
should be given to monitoring cholesterol metabolism in COVID-19patients, and
cholesterol-lowering therapies might be useful for good prognosis.We found that proteins enriched in regulation of cardiac muscle hypertrophy, cardiac muscle
tissue development, cardiocyte differentiation, and cardiovascular system development were
significantly changed in COVID-19 disease and were not restored during recovery. Our finding
is in accordance with multiple reports of major cardiac complications in COVID-19patients
and of COVID-19-related myocarditis. Moreover, a recent study observed that the majority of
recovered patients presented impaired cardiac function, which indicates that COVID-19
results in long-term heart sequelae.[24] We speculate that there are
potentially various degrees of COVID-19-associated cardiac function impairments due to
single factors or combinations of multiple factors, including direct viral myocardial
damage, hypoxia, hypotension, cholesterol metabolism disorder, or enhanced inflammatory
status, and they last for a long time.[25,26] Some impairments are too preliminary to be detected from
clinical indicators in patients’ serum but could be identified by proteomic analysis,
which is a much more sensitive approach. However, so far, cardiac dysfunction only comes to
attention when there are organ-level impairments, which are relatively late for treatment.
We identified potential protein signatures, the level of which changed in advance of regular
serum cardiomyopathy indicators, and thus could be used as a warning of the long-term
disorder. Among the candidate protein markers are MYH7 and TTN, the transcriptional level of
which is significantly changed in SARS-CoV-2-infected iPSC-derived cardiac cells.[27] This may provide mechanistic support of our findings. The potential markers
identified in this study need further validation in large cohorts and could further be used
as clinical indicators to prevent cardiomyopathy in COVID-19patients, thus improving
prognosis.In addition, when severe patients were compared to moderate patients, the cholesterol
metabolism and cardiomyopathy pathways were more severely affected. Thus, we speculate that
there is a correlation between these pathways and disease severity. This prediction is
supported by a systematic review of 54 published articles that observed that cholesterol
levels are apparently related to COVID-19 severity.[28] Furthermore, it has
been reported that elevated levels of high-sensitivity troponin-I or natriuretic peptides,
which are biomarkers of cardiac damage or dysfunction, are the strongest predictor of
mortality in hospitalized patients.[29−32] In our study, pathways related to cholesterol
metabolism and myocardium function show prolonged disturbance and are also related to
disease severity. Thus, we reasoned that monitoring and prevention of these disorders are
required to improve the prognosis of severely affected COVID-19patients.Serum protein analysis has been published by multiple groups including unbiased proteomic
studies on COVID-19patient cohort, proteomic analysis of a certain class of proteins
enriched from serum and targeted group of proteins by an antibody array or aptamer. They
also reported significantly altered pathways as host-cell immune response, coagulation, and
cholesterol metabolism pathways. However, the individual proteins involved in these pathways
are not exactly the same. We compared all relevant proteins quantified in other studies with
data in our paper and found that most of the published differentially expressed proteins
show a similar trend in our studies, for example, SAA1, SAA2, CRP, C1R, SERPINA3, APOC1,
APOC2, APOC3, APOA4, APOE, GRN, ORM1, and SAP.[4−6,13,33,34] However, there
are other proteins that are not significantly changed or are not detected in our study.
Notably, FETUB and ApoA2 showed an increased level in COVID-19 compared to healthy controls,
while other studies showed a decreased level in severe COVID-19patients. This is possibly
due to the regional difference and the different races of the patient cohort, as well as the
exact time point from disease onset. The different detection method might be another
explanation.Limitations in this study lie in that changes in the serum level of proteins could not be
simply corresponded to the tissue level of the proteins and the exact mechanisms of these
proteins. The serum proteomic data only provided information on enriched pathways composed
of differentially expressed proteins in serum based on the available database. Furthermore,
the quantitation of these potential markers in tissue will provide more hints on their
functions. To go another step further, more rigorous studies tracing the origin of these
proteins, how these proteins are released, and how these proteins functions in disease
conditions by cells or a mouse model are required in the future mechanism studies.
Conclusions
In conclusion, our study closely integrates quantitative proteomics and regular clinical
serum tests in a well-established disease cohort throughout the disease and recovery stages.
We highlight long-term disturbances in pathways related to cholesterol metabolism and
cardiopathy, which could not be diagnosed based on regular clinical indicators. We also
provide potential protein markers for these impairments. Our findings may provide evidence
and indications to guide clinical treatment to improve prognosis.