| Literature DB >> 32657586 |
Anthony D Whetton1,2,3, George W Preston1,2, Semira Abubeker1,2, Nophar Geifman4.
Abstract
The emergence of novel coronavirus disease 2019 (COVID-19), caused by the SARS-CoV-2 coronavirus, has necessitated the urgent development of new diagnostic and therapeutic strategies. Rapid research and development, on an international scale, has already generated assays for detecting SARS-CoV-2 RNA and host immunoglobulins. However, the complexities of COVID-19 are such that fuller definitions of patient status, trajectory, sequelae, and responses to therapy are now required. There is accumulating evidence-from studies of both COVID-19 and the related disease SARS-that protein biomarkers could help to provide this definition. Proteins associated with blood coagulation (D-dimer), cell damage (lactate dehydrogenase), and the inflammatory response (e.g., C-reactive protein) have already been identified as possible predictors of COVID-19 severity or mortality. Proteomics technologies, with their ability to detect many proteins per analysis, have begun to extend these early findings. To be effective, proteomics strategies must include not only methods for comprehensive data acquisition (e.g., using mass spectrometry) but also informatics approaches via which to derive actionable information from large data sets. Here we review applications of proteomics to COVID-19 and SARS and outline how pipelines involving technologies such as artificial intelligence could be of value for research on these diseases.Entities:
Keywords: COVID-19; SARS-CoV-2; artificial intelligence; assay; biomarker; diagnosis; marker; prognosis; proteomics
Mesh:
Substances:
Year: 2020 PMID: 32657586 PMCID: PMC7384384 DOI: 10.1021/acs.jproteome.0c00326
Source DB: PubMed Journal: J Proteome Res ISSN: 1535-3893 Impact factor: 4.466
Figure 1Protein biomarkers required in COVID-19. Proteins are the main stimulants of cellular mechanisms and are responsible for cellular homeostasis. The disruption of these cellular mechanisms is generally associated with many disease phenotypes.[8] Therefore, in a complex multiorgan disease such as is associated with SARS-CoV-2, establishing the underlying proteins involved in the various stages of this disease will pave a way to the discovery of new biomarkers in the diagnosis and prognosis of COVID-19 and its complications. Markers for prognosis, diagnosis, and chronic effects are among those required. The second wave of disease depicted here infers that monitoring in the population for (re)infection would require biomarkers of sufficient specificity. In this respect, a generic inflammatory biomarker response would not be sufficiently specific. The measurement of IgG and IgM antibodies directed against the virus do offer opportunities for screening; however, these generally do not appear for several days following the onset of symptoms and are not always observed during the usual screening window.
Selected COVID-19 and SARS Studies in Which Proteins/Peptides in Biospecimens Were Analyzeda
| proteins/peptides
displaying differential abundance | |||||
|---|---|---|---|---|---|
| study description and reference | methods | comparison to which changes refer | proteins/peptides with increased abundance | proteins/peptides with decreased abundance | comments/significance |
| Single-center prospective study of
COVID-19 patients (Berlin, Germany)[ | Bottom-up serum proteomics. Automated sample preparation and LC-MS/MS with SWATH-MS acquisition. Targeted data analysis. | Correlation of protein abundance with disease severity (abundance increasing or decreasing with increasing severity) | Actins beta and gamma-1 (group), Alpha-1B-glycoprotein, C-reactive protein*, Complement component C8 alpha chain, Complement factors B*, H, and I, Complement subcomponents C1r and C1s, Fibrinogen alpha, beta and gamma chains, Galectin-3-binding protein, Haptoglobin*, Interalpha-trypsin inhibitor heavy chains H3* and H4*, Leucine-rich alpha-2-glycoprotein*, Lipopolysaccharide-binding protein*, Monocyte differentiation antigen CD14, Protein Z-dependent protease inhibitor*, Serum amyloid A-1 protein*, Serum amyloid A-1 and A-2 proteins group | Albumin*, Apolipoprotein A-1, Apolipoprotein C-1*, Gelsolin, Transferrin* | The authors highlight differences associated
with the complement
system, blood coagulation, and inflammation. Twenty-seven correlations
(those identified opposite) were reproduced in a separate group consisting
of COVID-19 patients and healthy individuals. Some of the results
of another study[ |
| Multiomics study of 65 COVID-19 patients (28 with severe disease)
and controls (Taizhou, China)[ | Untargeted bottom-up serum proteomics with labeling (Thermo Fisher Scientific TMTpro 16plex reagents). Fractionation, then LC-MS/MS with data-dependent acquisition. Database searching. | 21 patients with severe disease compared with 37 patients with nonsevere disease | Alpha-1-acid glycoproteins 1 and 2,
Alpha-1-antichymotrypsin,
Carboxypeptidase N catalytic chain, Coagulation factor V, Complement
components C2, C6, and C9, Hyaluronan-binding protein 2, Mannosyl-oligosaccharide
1,2-alpha-mannosidase IA, Out at first protein homologue, Plasma protease
C1 inhibitor, Plastin-2, Serum amyloid A-2 and A-4 proteins, von Willebrand
factor, and nine proteins also observed by Messner et al.[ | The largest observed differences (increases)
were for serum
amyloid A proteins, which are participants in the acute-phase response.
An association between serum amyloid A-1 protein abundance and disease
severity was also observed by Messner et al.[ | |
| Study of 6 COVID-19 patients
(3 with severe disease), plus
recovered individuals and healthy controls (Beijing, China)[ | Untargeted, bottom-up urine proteomics without labeling. LC-MS/MS with data-dependent acquisition. Database searching. Pathway analysis. | Two separate comparisons: 3 COVID-19 patients with mild disease compared with 32 healthy controls; 3 COVID-19 patients with severe disease compared with 32 healthy controls. | 86 proteins (patients with mild disease compared with healthy individuals) or 83 proteins (patients with severe disease compared with healthy individuals) | 100 proteins (patients with mild disease compared with healthy individuals) or 172 proteins (patients with severe disease compared with healthy individuals) | This study is notable for
its focus on urinary proteins. However,
the number of patients was very low, and the age ranges of the patients
and controls were substantially different. For full lists of proteins,
readers are directed to the supplementary tables accompanying Li and
coauthors’ preprint.[ |
| Study of 15 SARS-CoV-2-infected individuals and 15 uninfected
controls[ | Bottom-up proteomics of material from naso-oropharyngeal swabs. Real-time RT-PCR to determine presence/absence of SARS-CoV-2, then proteomic sample preparation and LC-MS/MS with SONAR acquisition. Protein–protein interaction analysis. | Infected individuals compared with uninfected individuals | Azurocidin (2.0), Cathepsin G (3.7), Ceruloplasmin (5.1), Gamma-enolase (2.1), Gap junction delta-2 protein (3.2), Hemopexin (2.7), Histone H3.1t (3.2), Immunoglobulin heavy constant alpha 1 (2.0), Immunoglobulin heavy constant mu (3.4), Immunoglobulin kappa light chain (2.8), Myeloblastin (29), Myeloperoxidase (3.7), Neutrophil elastase (2.9), Transcobalamin-1 (2.5), Transketolase (2.1), Vitronectin (2.7) | Tubulin alpha-1C chain (0.41) | The proteins displaying differential abundance were found to be enriched for associations with neutrophil activity. |
| Retrospective, single-center study of 99 COVID-19 patients
(Wuhan, China)[ | Analyses of unspecified blood products using unspecified methods. | Measured concentrations compared with reference values. The named proteins are those for which the abundance was increased/decreased in >50% of cases. | C-reactive protein, Ferritin, Interleukin-6, Lactate dehydrogenase | Albumin, Hemoglobin | Abundance of alanine aminotransferase, aspartate aminotransferase, D-dimer, myoglobin, or procalcitonin was also increased, but in ≤50% of cases. Abundance of creatine kinase was increased in some cases and decreased in others. |
| Study of a cohort
of 41 COVID-19 patients, 13 of whom were
admitted to an ICU (Wuhan China)[ | Blood for cytokine analyses collected an average of 4 days after hospital admission. Analysis of plasma using human cytokine 27-plex immunoassay (Bio-Rad). Other methods not specified. | ICU patients compared with non-ICU patients | Alanine aminotransferase (1.8), D-dimer (4.8), Granulocyte colony-stimulating factor, Interleukin-2, Interleukin-7, Interleukin-10, Lactate dehydrogenase (1.4), Macrophage inflammatory protein-1 alpha, Monocyte chemoattractant protein-1, Tumour necrosis factor alpha | Albumin (0.80) | The results of the cytokine analysis
have been interpreted
as evidence of a cytokine storm syndrome.[ |
| Single-center
study of 12 COVID-19 patients, 6
of whom developed ARDS (Shenzen, China)[ | Quantification of angiotensin II in plasma using an ELISA (Cloud-Clone) | 12 COVID-19 patients compared with 8 healthy individuals | Angiotensin II | This result appears to reflect perturbation of the renin-angiotensin system by SARS-CoV-2. The authors also observed that the abundance of angiotensin II was correlated with viral load. | |
| Blood collected at the time of hospital admission or soon after. Analysis of unspecified blood products using unspecified methods. | Correlation of protein abundance with lung injury (Murray score) | C-reactive protein, Lactate dehydrogenase | Albumin | These results were consistent with the findings of various other COVID-19 studies. | |
| Retrospective, multicenter
study of 150 COVID-19 patients,
68 of whom died (Wuhan, China)[ | Analyses done an average of 10–12 days after the onset of symptoms. Analysis of unspecified blood products using unspecified methods. | Nonsurvivors compared with patients who were discharged | C-reactive protein (3.7), Cardiac tropinin (8.7), Ferritin (2.1), Interleukin-6 (1.7), Myoglobin (3.3) | Albumin (0.88) | Cardiac troponin had a markedly increased abundance
in the
blood of nonsurvivors. (See also the meta-analysis by Lippi et al.[ |
| Study of a cohort of 40 COVID-19 patients, 20 of
whom were
admitted to an ICU (Paris, France)[ | Blood collection on hospital admission. Analysis of plasma using various immunoassays and coagulation tests. | ICU patients compared with non-ICU patients | Angiopoietin-2 (1.5), C-reactive protein (1.9), D-dimer (1.5), Fibrinogen (1.2), Soluble E-selectin (1.4) | The authors suggest that angiopoietin-2 (a participant in endothelial
cell activation[ | |
| Study of 138 COVID-19 patients, 36 of whom were
admitted to
an ICU (Wuhan, China)[ | Analysis of unspecified blood products using unspecified methods | ICU patients compared with non-ICU patients. For procalcitonin, the increase was in the proportion of patients having a concentration greater than or equal to a threshold value. | Alanine aminotransferase (1.5), Aspartate aminotransferase (1.8), Creatine kinase-MB (1.4), D-dimer (2.5), Hypersensitive troponin I (2.2), Lactate dehydrogenase (2.1), Procalcitonin | Among nonsurvivors, the abundance of D-dimer was observed to increase over the course of the disease. The levels of two different markers of cardiac injury were both increased. Comparisons are somewhat complicated by the fact that ICU patients tended to be older and have comorbidities. | |
| Meta-analysis of COVID-19
studies, focusing on cardiac troponins[ | Four-study meta-analysis including 341 COVID-19 patients (123 severe cases and 218 nonsevere cases) | Severe cases compared with nonsevere cases | Cardiac troponin I | The authors suggest that markers such as cardiac troponin I could be quantified at the time of hospital admission, monitored thereafter, and used to predict disease worsening associated with myocardial damage. | |
| Meta-analysis of COVID-19 studies focusing on procalcitonin[ | Four-study meta-analysis. Analysis of odds ratios. | Patients with severe disease compared with patients with nonsevere disease, testing for an association between increased procalcitonin concentration and the odds of severe disease. | Procalcitonin | The heterogeneity of the included results was moderate. The authors suggest that an increased concentration of procalcitonin may be an indicator of bacterial coinfection in COVID-19 patients. | |
| Meta-analysis of COVID-19 studies focusing on interleukin-6[ | Nine-study meta-analysis. Analysis of standardized mean differences. Sensitivity analysis. | Patients with severe disease compared with patients with nonsevere disease. | Interleukin-6 | The heterogeneity of the included results was high. See also
the reports by Huang et al. (no significant difference between ICU
and non-ICU patients)[ | |
| Study of 39 SARS patients
and 39 controls[ | Blood collected 3–7 days after onset of fever. Array-based analysis of serum proteins. ProteinChip arrays (Ciphergen Biosystems) with mass spectrometric detection. | SARS patients compared with non-SARS patients with SARS-like symptoms. As well as differing between groups, abundances of the named proteins were also correlated with clinical measures such as the concentration of lactate dehydrogenase. | Beta-thromboglobulin, Complement C3c, Immunoglobulin heavy constant gamma 1, Immunoglobulin kappa light chain plus ≤48 unidentified proteins | Fibrinogen alpha chain, Platelet factor 4 plus ≤53 unidentified proteins | Beta-thromboglobulin and platelet factor 4
are specific markers
of platelet activity.[ |
| Study of 13 SARS patients
and various controls (Hong Kong)[ | Blood collected at the time of hospital admission. Determination of α1-antitrypsin in serum using nephelometry. Untargeted analysis of serum proteins using 2-D gel electrophoresis and either silver staining or in-gel digestion and mass spectrometry. | SARS patients compared with patients with other types of pneumonia | Total α1-antitrypsin | Untargeted analyses revealed that the composition of the SARS patients’ α1-antitrypsin was qualitatively altered. The authors suggest that degradation of this protein to a truncated form may contribute to the development of ARDS in SARS patients. | |
Abbreviations not defined in the main text: 2-D, two-dimensional; ARDS, acute respiratory distress syndrome; ICU, intensive care unit.
Named proteins are those for which a statistically significant difference or correlation was observed. Where necessary, UniProt identifiers were converted to names via the Swiss-Prot human proteome (downloaded from https://www.uniprot.org on June 24, 2020). Where possible, a relative abundance (rounded to two significant figures) is given in parentheses after the name of the protein. A value of 2.0, for example, would mean that the protein was twice as abundant in one group as in another. Values were either extracted directly from articles or calculated using reported mean or median concentrations. Where Messner et al.[39] reproduced one of Shen and coauthors’[38] results, this is indicated by an asterisk (*).
Figure 2Network analysis of the interactions between cytokines, cells, and diseases as captured from PubMed abstracts. Associations captured from MeSH terms in SARS- and coronavirus-related PubMed abstracts. (A) Joint network with SARS on the left and coronavirus on the right. (B) Subnetwork of first-neighbors of the disease node “common cold”. (C) Subnetwork of first-neighbors of the disease node “gastroenteritis”. (D) Subnetwork of first-neighbors of the disease node “pneumonia”. Yellow nodes represent cytokines, blue nodes represent diseases, and green nodes represent cell types. Purple edges represent associations derived from coronavirus-related PubMed abstracts, while pink edges represents those derived from SARS-related abstracts and gray edges represent those connections found in both corpuses. The size of the nodes corresponds to the count of that entity within both corpuses.
Figure 3Integrated process for the rapid development of algorithms or markers of trajectory in infectious diseases; see also Figure . In 2010, 8% (524 million) of the world’s population was reported to be aged 65 or older. This figure is expected to triple by 2050 to about 1.5 billion. The regenerative or host defense capabilities for protection against SARS-CoV-2 infection diminish with age. This has led to the use of algorithms consisting of multiple markers, and factors like comorbidities, age, or sex can be used to develop “score”-based indicators of risk, prognosis, or trajectory. This is one reason for the rapid deployment of the pipeline described here inclusive of artificial intelligence.