| Literature DB >> 36237606 |
Shubham Misra1, Praveen Singh2, Manabesh Nath1, Divya Bhalla1, Shantanu Sengupta2, Amit Kumar1, Awadh K Pandit1, Praveen Aggarwal3, Achal K Srivastava1, Dheeraj Mohania4, Kameshwar Prasad1,5, Deepti Vibha1.
Abstract
Background and purposes: Recent developments in high-throughput proteomic approach have shown the potential to discover biomarkers for diagnosing acute stroke and to elucidate the pathomechanisms specific to different stroke subtypes. We aimed to determine blood-based protein biomarkers to diagnose total stroke (IS+ICH) from healthy controls, ischemic stroke (IS) from healthy controls, and intracerebral hemorrhage (ICH) from healthy control subjects within 24 h using a discovery-based SWATH-MS proteomic approach.Entities:
Keywords: SWATH-MS; blood biomarkers; intracerebral hemorrhage; ischemic stroke; proteomics; stroke
Year: 2022 PMID: 36237606 PMCID: PMC9552908 DOI: 10.3389/fneur.2022.989856
Source DB: PubMed Journal: Front Neurol ISSN: 1664-2295 Impact factor: 4.086
Figure 1Study flow diagram.
Baseline characteristics of acute stroke patients and healthy control subjects.
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| 1. | Age (years), Mean ± SD and Median (IQR) | 52.85 ± 10.86, | 47.60 ± 9.76, 48 (43–55.5) | 0.12 | 50.22 ± 10.53, 49 (45–59.5) | 40 | 50.20 ± 10.69, 48.5 (44.5–60) | Matched |
| 2. | Male, | 11 (55) | 14 (70) | 0.33 | 25 (62.5) | 40 | 25 (62.5) | |
| 3. | Female, | 9 (45) | 6 (30) | 15 (37.5) | 40 | 15 (37.5) | ||
| 4. | Blood sampling time from onset (in h.), Mean ± SD & Median (IQR) | 12.11 ± 6.23, | 12.46 ± 6.68, 12.58 (6.25–18.62) | 0.86 | 12.28 ± 6.38, 12.58 (6.5–17.75) | - | - | - |
| 5. | Time taken to reach hospital (in hrs.), Mean ± SD & Median (IQR) | 4.21 ± 2.98, | 6.41 ± 6.27, 3.75 (2.08–10.12) | 0.16 | 5.31 ± 4.97, 3.87 (2–5.75) | - | - | - |
| 6. | Ambulance as a mode of transport, | 6 (30) | 4 (20) | 0.53 | 10 (25) | - | - | - |
| 7. | Any surgical procedure, | 2 (10) | 5 (25) | 0.21 | 7 (17.5) | - | - | - |
|
| ||||||||
| 8. | Hypertension, | 8 (40) | 14 (70) | 0.06 | 22 (55) | 40 | 22 (55) | Matched |
| 9. | Diabetes, | 4 (20) | 1 (5) | 0.15 | 5 (12.5) | 40 | 5 (12.5) | |
| 10. | Dyslipidemia, n (%) | 4 (20) | 0 (0) |
| 4 (10) | 40 | 6 (15) | 0.50 |
| 11. | Myocardial Infarction, | 0 | 0 | – | 0 | 40 | 1 (2.5) | 0.31 |
| 12. | Atrial Fibrillation, | 0 | 0 | – | 0 | 40 | 1 (2.5) | 0.31 |
| 13. | Angina Pectoris, | 1 (5%) | 0 | 0.31 | 1 (2.5) | 30 | 1 (3.33) | 0.83 |
| 14. | Migraine, | 0 | 0 | – | 0 | 40 | 3 (7.50) | 0.08 |
| 15. | Current Smoking, | 9 (45) | 10 (50) | 0.75 | 19 (47.5) | 40 | 7 (17.50) |
|
| 17. | Alcohol Intake, | 2 (10) | 6 (30) | 0.11 | 8 (20) | 40 | 11 (27.5) | 0.43 |
| 17. | No exercise, | 18 (90) | 17 (85) | 0.63 | 35 (87.5) | 39 | 11 (28.21) |
|
| 18. | Sedentary lifestyle, | 7 (35) | 7 (35) | 1.00 | 14 (35) | 37 | 6 (16.22) | 0.06 |
| 19. | Low Education, | 13 (65) | 15 (75) | 0.49 | 28 (70) | 39 | 12 (30.77) |
|
| 20. | Low socio-economic status, | 9 (45) | 11 (55) | 0.53 | 20 (50) | 40 | 0 |
|
| 21. | Obesity, | 7 (35) | 10 (50) | 0.34 | 17 (42.5) | 38 | 24 (63.16) | 0.07 |
| 22. | Family history of stroke, | 3 (15) | 1 (5) | 0.29 | 4 (10) | 40 | 2 (5) | 0.39 |
| 23. | Family history of hypertension, | 10 (50) | 6 (30) | 0.20 | 16 (40) | 40 | 12 (30) | 0.44 |
| 24. | Family history of diabetes, | 7 (35) | 3 (15) | 0.14 | 10 (25) | 40 | 16 (40) | 0.15 |
| 25. | Family history of heart attack, | 4 (20) | 3 (15) | 0.68 | 7 (17.5) | 40 | 12 (30) | 0.19 |
|
| ||||||||
| 26. | SBP (mmHg), Mean ± SD & Median (IQR) | 152.7 ± 35.71, | 178 ± 35.29, 176 (150–214) |
| 165.35 ± 37.31, 163 (137–190) | 39 | 141.46 ± 24.55, 134 (121–155) |
|
| 27. | DBP (mmHg), Mean ± SD & Median (IQR) | 87.5 ± 17.27, | 100.10 ± 17.00, 100 (90–110) |
| 93.8 ± 18.08, 90 (82–104) | 39 | 89.95 ± 15.90, 90 (78–98) | 0.32 |
obs, observations; IS, ischemic stroke; ICH, intracerebral hemorrhage; SD, standard deviation; IQR, interquartile range; SBP, systolic blood pressure; DBP, diastolic blood pressure.
Bold values, p < 0.05.
Figure 2(A) Volcano plot depicting the log2 fold change on the x-axis and -log10 p-value on the y-axis for the upregulated and downregulated proteins in 40 total stroke cases compared to 40 healthy controls. The two vertical lines represent the threshold for log2 fold change (>0.58 and < −0.58), and the horizontal line represents the threshold for log10 p-value (>1.3). The red dots on the graph indicate the proteins which are significantly upregulated (log2 fold change>0.58; log10 p-value>1.3), and blue dots indicate the proteins which are significantly downregulated (log2 fold change < −0.58; log10 p-value>1.3). (B) Feature selection using the Boruta random forest depicting important features/proteins to differentiate 40 total stroke and 40 healthy controls. The blue bars on the graph indicate shadow features for minimum, average, and maximum shadow feature values. The red bars on the graph indicate the proteins which were rejected as irrelevant features, yellow bars indicate the proteins which were marked tentative as uncertain features, and green bars indicate the proteins which were marked confirmed as valid features and identified as important proteins after the Boruta random forest feature selection analysis.
List of significantly differentially expressed proteins between total stroke and healthy controls within 24 h of symptom onset using fold change with p-value and Boruta random forest feature selection criteria.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 1 | Q15848 | Adiponectin (GN |
|
|
|
| 2 | P00450 | Ceruloplasmin (GN |
|
|
|
| 3 | P04433 | Ig kappa chain V-III region VG (Fragment) |
|
| Rejected |
| 4 | P00739 | Haptoglobin-related protein (GN |
|
| Rejected |
| 5 | P20851 | C4b-binding protein beta chain (GN |
|
| Rejected |
| 6 | P04275 | von Willebrand factor (GN |
|
|
|
| 7 | P02750 | Leucine-rich alpha-2-glycoprotein (GN |
|
|
|
| 8 | P05090 | Apolipoprotein D (GN |
|
| Rejected |
| 9 | P02763 | Alpha-1-acid glycoprotein 1 (GN |
|
| Rejected |
| 10 | P04430 | Ig kappa chain V-I region BAN |
|
| Rejected |
| 11 | P05160 | Coagulation factor XIII B chain (GN |
|
|
|
| 12 | P06331 | Ig heavy chain V-II region ARH-77 |
|
|
|
| 13 | P04114 | Apolipoprotein B-100 (GN |
|
| Rejected |
| 14 | P01009 | Alpha-1-antitrypsin (GN |
|
|
|
| 15 | P01023 | Alpha-2-macroglobulin (GN |
|
| Rejected |
| 16 | P02746 | Complement C1q subcomponent subunit B (GN |
|
| Rejected |
| 17 | P02786 | Transferrin receptor protein 1 (GN |
|
|
|
| 18 | Q92820 | Gamma-glutamyl hydrolase (GN | 1.28 |
|
|
| 19 | Q06033 | Inter-alpha-trypsin inhibitor heavy chain H3 (GN | 1.26 |
|
|
| 20 | P02790 | Hemopexin (GN | 1.20 |
|
|
| 21 | P18428 | Lipopolysaccharide-binding protein (GN | 1.20 |
|
|
| 22 | P01011 | Alpha-1-antichymotrypsin (GN | 1.03 |
|
|
| 23 | P08185 | Corticosteroid-binding globulin (GN | 0.99 |
|
|
| 24 | P19827 | Inter-alpha-trypsin inhibitor heavy chain H1 (GN | 0.98 | 0.06 |
|
| 25 | Q9UK55 | Protein Z-dependent protease inhibitor (GN | 0.85 | 0.073 |
|
| 26 | P08697 | Alpha-2-antiplasmin (GN | 0.78 |
|
|
| 27 | P01859 | Ig gamma-2 chain C region (GN |
|
| Rejected |
| 28 | P05155 | Plasma protease C1 inhibitor (GN |
|
|
|
| 29 | P06318 | Ig lambda chain V-VI region WLT |
|
|
|
| 30 | P08253 | 72 kDa type IV collagenase (GN |
|
| Rejected |
| 31 | P0DJI8 | Serum amyloid A-1 protein (GN |
|
| Rejected |
*Fold change is a ratio representing the change of protein concentration between total stroke cases and healthy control subjects.
Bold values, Fold change >1.5 or < 0.67, p-value < 0.05 and confirmed/tentative in Boruta random forest.
Figure 3(A) Heatmap showing the distribution and expression pattern of 31 significantly differentially expressed proteins between total stroke and healthy controls. Each column represents a different subject, while each row represents the UniProt ID of different proteins. The blue squares represent upregulated, and the red squares represent downregulated proteins in total stroke compared to healthy controls. White squares represent no change in protein expression. (B) Protein–protein interaction network analysis of differentially expressed proteins between total stroke and healthy controls. The color of the nodes represents the level of degree of interaction between the proteins ranging from 0 to 19, with dark green representing a high degree of interaction (toward 19) and light green representing a low degree of interaction (toward zero). The color of edges represents the interaction score ranging from zero to one, with dark red edges representing an interaction score with high confidence (toward one) and light red edges representing an interaction score with low confidence (toward zero).
Figure 4Functional enrichment analysis of differentially expressed proteins between total stroke and healthy controls.
Figure 5(A) Volcano plot depicting the log2 fold change on the x-axis and –log10 p-value on the y-axis for the upregulated and downregulated proteins in 20 IS cases compared to 20 healthy controls. (B) Feature selection using the Boruta random forest depicting important features/proteins to differentiate 20 IS and 20 healthy controls.
List of significantly differentially expressed proteins between ischemic stroke and healthy controls using fold change with p-value and Boruta random forest feature selection criteria.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 1 | Q15848 | Adiponectin (GN |
|
| Rejected |
| 2 | P05160 | Coagulation factor XIII B chain (GN |
|
| Rejected |
| 3 | P05090 | Apolipoprotein D (GN |
|
|
|
| 4 | P04003 | C4b-binding protein alpha chain (GN |
|
| Rejected |
| 5 | P00450 | Ceruloplasmin (GN |
|
| Rejected |
| 6 | P04114 | Apolipoprotein B-100 (GN |
|
|
|
| 7 | P01009 | Alpha-1-antitrypsin (GN |
|
|
|
| 8 | P01023 | Alpha-2-macroglobulin (GN |
|
|
|
| 9 | P02746 | Complement C1q subcomponent subunit B (GN |
|
| Rejected |
| 10 | P02786 | Transferrin receptor protein 1 (GN |
|
|
|
| 11 | P04275 | von Willebrand factor (GN |
|
| Rejected |
| 12 | Q13790 | Apolipoprotein F (GN | 1.38 |
|
|
| 13 | Q9UNW1 | Multiple inositol polyphosphate phosphatase 1 (GN | 1.18 | 0.07 |
|
| 14 | Q9Y2I9 | TBC1 domain family member 30 (GN | 0.74 |
|
|
| 15 | Q99972 | Myocilin (GN |
|
|
|
| 16 | P17936 | Insulin-like growth factor-binding protein 3 (GN |
|
| Rejected |
*Fold change is a ratio representing the change of protein concentration between IS cases and healthy control subjects.
Bold values: Fold change >1.5 or < 0.67, p-value < 0.05 and confirmed/tentative in Boruta random forest.
Figure 6(A) Heatmap showing the distribution and expression pattern of 16 significantly differentially expressed proteins between IS and healthy controls. (B) Protein–protein interaction network analysis of differentially expressed proteins between IS and healthy controls.
Figure 7Functional enrichment analysis of differentially expressed proteins between ischemic stroke and healthy controls.
Figure 8(A) Volcano plot depicting the log2 fold change on the x-axis and –log10 p-value on the y-axis for the upregulated and downregulated proteins in 20 ICH cases compared to 20 healthy controls. (B) Feature selection using the Boruta random forest depicting important features/proteins to differentiate 20 ICH and 20 healthy controls.
List of significantly differentially expressed proteins between intracerebral hemorrhage and healthy controls using fold change with p-value and Boruta random forest feature selection criteria.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 1 | P01861 | Ig gamma-4 chain C region (GN |
|
| Rejected |
| 2 | P27487 | Dipeptidyl peptidase 4 (GN |
|
| Rejected |
| 3 | P01880 | Ig delta chain C region (GN |
|
| Rejected |
| 4 | Q71U36 | Tubulin alpha-1A chain (GN |
|
| Rejected |
| 5 | P00450 | Ceruloplasmin (GN |
|
|
|
| 6 | P13796 | Plastin-2 (GN |
|
| Rejected |
| 7 | P01009 | Alpha-1-antitrypsin (GN |
|
| Rejected |
| 8 | P02649 | Apolipoprotein E (GN |
|
|
|
| 9 | P01625 | Ig kappa chain V-IV region Len |
|
| Rejected |
| 10 | P36955 | Pigment epithelium-derived factor (GN |
|
|
|
| 11 | P04275 | von Willebrand factor (GN |
|
|
|
| 12 | P02750 | Leucine-rich alpha-2-glycoprotein (GN |
|
|
|
| 13 | P01833 | Polymeric immunoglobulin receptor (GN |
|
|
|
| 14 | P09871 | Complement C1s subcomponent (GN |
|
|
|
| 15 | P01613 | Ig kappa chain V-I region Ni |
|
| Rejected |
| 16 | P05160 | Coagulation factor XIII B chain (GN |
|
| Rejected |
| 17 | P01701 | Ig lambda chain V-I region NEW |
|
| Rejected |
| 18 | P04430 | Ig kappa chain V-I region BAN |
|
| Rejected |
| 19 | P10643 | Complement component C7 (GN |
|
| Rejected |
| 20 | P02647 | Apolipoprotein A-I (GN |
|
| Rejected |
| 21 | P02763 | Alpha-1-acid glycoprotein 1 (GN |
|
| Rejected |
| 22 | P04217 | Alpha-1B-glycoprotein (GN |
|
|
|
| 23 | Q06033 | Inter-alpha-trypsin inhibitor heavy chain H3 (GN |
|
|
|
| 24 | P00747 | Plasminogen (GN |
|
| Rejected |
| 25 | P24592 | Insulin-like growth factor-binding protein 6 (GN |
|
|
|
| 26 | P19827 | Inter-alpha-trypsin inhibitor heavy chain H1 (GN | 1.48 |
|
|
| 27 | Q9UK55 | Protein Z-dependent protease inhibitor (GN | 1.44 |
|
|
| 28 | P08697 | Alpha-2-antiplasmin (GN | 1.42 |
|
|
| 29 | P10909 | Clusterin (GN | 1.27 |
|
|
| 30 | P06331 | Ig heavy chain V-II region ARH-77 | 1.23 |
|
|
| 31 | P18428 | Lipopolysaccharide-binding protein (GN | 1.23 |
|
|
| 32 | P05155 | Plasma protease C1 inhibitor (GN | 1.01 |
|
|
| 33 | P02654 | Apolipoprotein C-I (GN | 0.99 |
|
|
| 34 | P02652 | Apolipoprotein A-II (GN | 0.94 |
|
|
| 35 | P00746 | Complement factor D (GN |
|
| Rejected |
| 36 | P25311 | Zinc-alpha-2-glycoprotein (GN |
|
| Rejected |
| 37 | Q08380 | Galectin-3-binding protein (GN |
|
| Rejected |
| 38 | Q15485 | Ficolin-2 (GN |
|
| Rejected |
| 39 | B9A064 | Immunoglobulin lambda-like polypeptide 5 (GN |
|
|
|
| 40 | P35443 | Thrombospondin-4 (GN |
|
|
|
| 41 | P98160 | Basement membrane-specific heparan sulfate proteoglycan core protein (GN |
|
| Rejected |
*Fold change is a ratio representing the change of protein concentration between ICH cases and healthy control subjects.
Bold values: Fold change >1.5 or < 0.67, p-value < 0.05 and confirmed/tentative in Boruta random forest.
Figure 9(A) Heatmap showing the distribution and expression pattern of 41 significantly differentially expressed proteins between ICH and healthy controls. (B) Protein–protein interaction network analysis of differentially expressed proteins between ICH and healthy controls.
Figure 10Functional enrichment analysis of differentially expressed proteins between intracerebral hemorrhage and healthy controls.