| Literature DB >> 35953823 |
Jinlong Gao1,2, Jiale He1,2, Fangfei Zhang1,2, Qi Xiao1,2, Xue Cai1,2, Xiao Yi1,2, Siqi Zheng1,2, Ying Zhang3, Donglian Wang3, Guangjun Zhu3, Jing Wang3, Bo Shen3, Markus Ralser4,5, Tiannan Guo6,7, Yi Zhu8,9.
Abstract
BACKGROUND: Classification of disease severity is crucial for the management of COVID-19. Several studies have shown that individual proteins can be used to classify the severity of COVID-19. Here, we aimed to investigate whether integrating four types of protein context data, namely, protein complexes, stoichiometric ratios, pathways and network degrees will improve the severity classification of COVID-19.Entities:
Keywords: COVID-19; Protein complex; Proteomics; Severe cases; Stoichiometric ratio
Year: 2022 PMID: 35953823 PMCID: PMC9366758 DOI: 10.1186/s12014-022-09370-0
Source DB: PubMed Journal: Clin Proteomics ISSN: 1542-6416 Impact factor: 5.000
Fig. 1Study overview. In general, 331 proteins were identified from 54 serum samples of COVID-19 patients. Subsequently, five kinds of 868 features were derived from these proteins. The top 25 differential features were selected for the machine learning model, which was further validated in two test datasets
Information of 40 patients in the training set and all total patients
| Study total (40) | Study non-severe (25) | Study severe (15) | All total (144) | All non-severe (108) | All severe (36) | |
|---|---|---|---|---|---|---|
| Sex (male/female) | 17/23 | 8/17 | 9/6 | 77/67 | 57/51 | 20/16 |
| Age | 51.1 ± 17.3 | 48 ± 18.9 | 56.3 ± 13.1 | 47.6 ± 14.6 | 45 ± 14.2 | 55.5 ± 12.8 |
| BMI | 23.9 ± 3.3 | 22.7 ± 3.2 | 26.1 ± 2.1 | 24.2 ± 3.1 | 23.9 ± 3.2 | 25.5 ± 2.3 |
| Onset admission | 7.5 ± 4.4 | 7.5 ± 4.7 | 7.4 ± 4.1 | 7 ± 4.2 | 6.8 ± 3.9 | 7.9 ± 4.9 |
| Admission discharge | 27.5 ± 8.9 | 27.6 ± 10 | 27.2 ± 7 | 21.6 ± 9.4 | 20.5 ± 9.7 | 24.7 ± 7.8 |
| Symptoms (%) | ||||||
| Fever | 27 (67) | 12 (48) | 15 (100) | 104 (72.2) | 70 (64.8) | 34 (94.4) |
| Pharyngalgia | 6 (15) | 5 (20) | 1 (6.7) | 17 (11.8) | 15(13.9) | 2 (5.6) |
| Cough | 18 (45) | 12 (48) | 6 (40) | 65 (45.1) | 47(43.5) | 18 (50) |
| Expectoration | 10 (25) | 7 (28) | 3 (20) | 26 (18.1) | 19(17.6) | 7 (19.4) |
| Fatigue | 2 (5) | 1 (4) | 1 (6.7) | 16 (11.1) | 10 (9.3) | 6 (16.7) |
| Headache | 4 (10) | 2 (8) | 2 (13.3) | 16 (11.1) | 9 (8.3) | 7 (19.4) |
| Diarrhea | 1 (2.5) | 0 (0) | 1 (6.7) | 6 (4.2) | 3 (2.8) | 3 (8.3) |
| Chest tightness | 4 (10) | 2 (8) | 2 (13.3) | 11 (7.6) | 7 (6.5) | 4 (11.1) |
| Comorbidity (%) | ||||||
| Hypertension | 8 (20) | 4 (16) | 4 (26.7) | 22 (15.3) | 14 (13) | 8 (22.2) |
| Diabetes | 6 (15) | 2 (8) | 4 (26.7) | 14 (9.7) | 9 (8.3) | 5 (13.9) |
| Hyperlipidemia | 2 (5) | 1 (4) | 1 (6.7) | 3 (2.1) | 2 (1.9) | 1 (2.8) |
| Cardiovascular disease | 1 (2.5) | 0 (0) | 1 (6.7) | 3 (2.1) | 1 (0.9) | 2 (5.6) |
| Kidney disease | 1 (2.5) | 0 (0) | 1 (6.7) | 2 (1.4) | 1 (0.9) | 1 (2.8) |
| Digestive system | 3 (7.5) | 2 (8) | 1 (6.7) | 7 (4.9) | 6 (5.6) | 1 (2.8) |
**p < 0.01, study severe vs study non severe
#p < 0.05 and ##p < 0.01, all severe vs all non-severe
Summary of the dataset used for this study
| Patients (non-severe/severe) | Samples | MS Method | Proteins | Missing (%) | |
|---|---|---|---|---|---|
| Training | 40 (25/15) | 54 | 20 min SWATH | 331 | 21.7 |
| Test 1 | 21 (6/15) | 21 | TMTpro 16plex | 894 | 35.5 |
| Test 2 | 31 (12/19) | 102 | 5 min SWATH | 229 | 12.2 |
Fig. 2The selected features for classifying COVID-19. A All Identified features ranked by log p value; B The top 25 features identified; C The heatmap of top 25 features in the training set
Fig. 3The performance of the machine learning model. A The PCA map of the training set and test sets using all features; B The comparison of AUC between the model with five types of features and the model with only one type of feature in the training set, test set 1, and the test set 2
Fig. 4The biological interpretation of the top 25 features. MAC, membrane attack complex. Red border, upregulation; green boarder, downregulation