| Literature DB >> 35603148 |
Congqi Hu1, Zhao Dai2, Jia Xu2, Lianyu Zhao2, Yanping Xu3, Meilin Li2, Jiahui Yu2, Lu Zhang2, Hui Deng2, Lijuan Liu1, Mingying Zhang1, Jiarong Huang4, Linping Wu4, Guangxing Chen1,3.
Abstract
Rheumatoid arthritis (RA) causes serious disability and productivity loss, and there is an urgent need for appropriate biomarkers for diagnosis, treatment assessment, and prognosis evaluation. To identify serum markers of RA, we performed mass spectrometry (MS)-based proteomics, and we obtained 24 important markers in normal and RA patient samples using a random forest machine learning model and 11 protein-protein interaction (PPI) network topological analysis methods. Markers were reanalyzed using additional proteomics datasets, immune infiltration status, tissue specificity, subcellular localization, correlation analysis with disease activity-based diagnostic indications, and diagnostic receiver-operating characteristic analysis. We discovered that ORM1 in serum is significantly differentially expressed in normal and RA patient samples, which is positively correlated with disease activity, and is closely related to CD56dim natural killer cell, effector memory CD8+T cell, and natural killer cell in the pathological mechanism, which can be better utilized for future research on RA. This study supplies a comprehensive strategy for discovering potential serum biomarkers of RA and provides a different perspective for comprehending the pathological mechanism of RA, identifying potential therapeutic targets, and disease management.Entities:
Keywords: ORM1; biomarker; proteomics; rheumatoid arthritis; serum
Mesh:
Substances:
Year: 2022 PMID: 35603148 PMCID: PMC9120366 DOI: 10.3389/fimmu.2022.865425
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 8.786
Figure 1Schematic diagram about the analysis of RA potential proteins based on LC-MS proteomics. (A) Identification of differentially expressed proteins. (B) Multiple-analysis methods to assess the importance of differentially expressed proteins.
Figure 2Identification of differentially expressed proteins. (A) Random forest machine learning method combined with the PPI network importance analysis algorithm to screen differential proteins. (B) Enrichment results of 24 differential proteins. The blue size represents the number of counts corresponding to the result.
Literature summary of differential proteins.
| Official Full Name | Symbol | Total articles | RA related articles | PMID | Level Change |
|---|---|---|---|---|---|
| Orosomucoid 1 |
| 3,701 | 5 | 26915672 | Up |
| Haptoglobin |
| 96,237 | 996 | 4029960 | Up |
| Transthyretin |
| 13,127 | 61 | 30308029 | Down |
| C-Reactive Protein |
| 108,764 | 6,602 | 33385862 | Up |
| Ceruloplasmin |
| 139,242 | 649 | 26001728 | Up |
| Fibrinogen Beta Chain |
| 1,920 | 15 | 26059223 | Up |
| Fibronectin 1 |
| 25,482 | 14 | 34764682 | Down |
| Apolipoprotein A2 |
| 3,263 | 3 | 12027302 | Down |
| Apolipoprotein C3 |
| 1,230 | 2 | 31966382 | Down |
| Serum Amyloid A1 |
| 986 | 52 | 17985847 | Up |
| Fibrinogen Gamma Chain |
| 2,003 | 11 | 32408093 | Up |
| Fibrinogen Alpha Chain |
| 3,606 | 10 | 22267327 | Up |
| Serpin Family A Member 1 |
| 909 | 9 | 34712268 | Up |
| Apolipoprotein B |
| 20,164 | 93 | 29997113 | Down |
| Apolipoprotein E |
| 33,596 | 53 | 32253242 | Up |
| Von Willebrand Factor |
| 24,468 | 129 | 25973092 | Up |
| Alpha-2-Macroglobulin |
| 5,871 | 113 | 10343526 | Up |
| Apolipoprotein A4 |
| 1,212 | 1 | 2547867 | Up |
| GC Vitamin D Binding Protein |
| 158,417 | 727 | 3874814 | Down |
| Histidine Rich Glycoprotein |
| 1,662 | 6 | 29246875 | Up |
| Complement C4A |
| 1,813 | 44 | 22076784 | Down |
| Complement C4B |
| 3,242 | 79 | 22076784 | Up |
| Apolipoprotein A1 |
| 15,810 | 60 | 31694752 | Down |
| Lipoprotein A |
| 16,481 | 114 | 19369465 | Up |
Figure 3Combine other proteomics datasets to analyze differential proteins. (A) Crossover situation in 5 proteomics datasets. (B) Visually display differential proteins co-expressed (greater than 2) in multiple proteomics datasets. Different shapes and colors represent different datasets. (C, D) Enrichment results of 30 differential proteins in 5 proteomics datasets. The green size represents the number of counts corresponding to the result.
Figure 4Multi-angle analysis of 24 differentially expressed proteins. (A) Immune infiltration analysis protein expression in 23 immune cells in healthy controls and RA patient. (B) Correlation analysis of 24 proteins with statistically significant immune cells. (C) Tissue specificity of 24 differential proteins. (D) Subcellular localization of 24 differential proteins. *p < 0.05, **p < 0.01, ***p < 0.001. ns, no significance.
Clinical indicator information.
| Characteristic | DAP | DIP |
| Method |
|---|---|---|---|---|
| n | 7 | 7 | ||
| SJC, n (%) | 0.051 | Chisq.test | ||
| 0 | 0 (0%) | 3 (21.4%) | ||
| 1 | 0 (0%) | 2 (14.3%) | ||
| 3 | 2 (14.3%) | 0 (0%) | ||
| 4 | 2 (14.3%) | 0 (0%) | ||
| 5 | 0 (0%) | 2 (14.3%) | ||
| 7 | 1 (7.1%) | 0 (0%) | ||
| 13 | 1 (7.1%) | 0 (0%) | ||
| 28 | 1 (7.1%) | 0 (0%) | ||
| TJC, n (%) | 0.051 | Chisq.test | ||
| 0 | 0 (0%) | 5 (35.7%) | ||
| 1 | 0 (0%) | 1 (7.1%) | ||
| 3 | 0 (0%) | 1 (7.1%) | ||
| 5 | 3 (21.4%) | 0 (0%) | ||
| 6 | 1 (7.1%) | 0 (0%) | ||
| 7 | 1 (7.1%) | 0 (0%) | ||
| 10 | 1 (7.1%) | 0 (0%) | ||
| 28 | 1 (7.1%) | 0 (0%) | ||
| PGA, n (%) | 0.051 | Chisq.test | ||
| 0.5 | 0 (0%) | 1 (7.1%) | ||
| 1 | 0 (0%) | 3 (21.4%) | ||
| 3 | 0 (0%) | 2 (14.3%) | ||
| 4 | 0 (0%) | 1 (7.1%) | ||
| 5 | 4 (28.6%) | 0 (0%) | ||
| 6 | 1 (7.1%) | 0 (0%) | ||
| 7 | 1 (7.1%) | 0 (0%) | ||
| 9 | 1 (7.1%) | 0 (0%) | ||
| MDGA, n (%) | 0.215 | Chisq.test | ||
| 0.5 | 0 (0%) | 1 (7.1%) | ||
| 1 | 0 (0%) | 2 (14.3%) | ||
| 3 | 1 (7.1%) | 3 (21.4%) | ||
| 4 | 2 (14.3%) | 1 (7.1%) | ||
| 5 | 1 (7.1%) | 0 (0%) | ||
| 6 | 2 (14.3%) | 0 (0%) | ||
| 9 | 1 (7.1%) | 0 (0%) | ||
| GH, n (%) | 0.101 | Chisq.test | ||
| 5 | 0 (0%) | 1 (7.1%) | ||
| 10 | 0 (0%) | 3 (21.4%) | ||
| 30 | 0 (0%) | 2 (14.3%) | ||
| 40 | 1 (7.1%) | 1 (7.1%) | ||
| 50 | 3 (21.4%) | 0 (0%) | ||
| 60 | 1 (7.1%) | 0 (0%) | ||
| 70 | 1 (7.1%) | 0 (0%) | ||
| 90 | 1 (7.1%) | 0 (0%) | ||
| ESR, mean ± SD | 46.57 ± 15.87 | 25.71 ± 12.53 | 0.018 | T test |
| CRP, median (IQR) | 9.32 (6.47, 46.8) | 3.53 (2.01, 13.59) | 0.097 | Wilcoxon |
| CDAI, median (IQR) | 22 (18.5, 28.5) | 7 (2.5, 9.5) | 0.002 | Wilcoxon |
| SDAI, median (IQR) | 22.93 (18.97, 34.95) | 7.14 (2.76, 11.36) | 0.002 | Wilcoxon |
| DAS28-ESR, median (IQR) | 5.5 (5.16, 5.88) | 2.33 (2.27, 3.66) | < 0.001 | Wilcoxon |
| DAS28-CRP, mean ± SD | 5.16 ± 1.41 | 2.4 ± 1 | 0.001 | T test |
| VAS, median (IQR) | 5 (4.5, 5.5) | 0.5 (0.5, 1.5) | 0.004 | Wilcoxon |
| HAQ, median (IQR) | 0.5 (0.32, 3.56) | 0 (0, 1.13) | 0.195 | Wilcoxon |
According to different data types, different analysis methods are selected.
Figure 5Correlation analysis between RA laboratory diagnostic indicators and disease activity. Positive and negative signs indicate the direction of the correlation. A positive sign indicates a positive correlation (red), and a negative sign indicates a negative correlation (blue). *p < 0.05, **p < 0.01.
Figure 6Summarize a variety of analysis methods to select the most meaningful protein as the key protein of RA.
Summary of human proteome datasets for biomarker analysis.
| Official full name | Symbol | Total articles | RA-related articles | PMID | Level change |
|---|---|---|---|---|---|
| Orosomucoid 1 |
| 3,555 | 1 | 26915672 | Up |
| Haptoglobin |
| 10,621 | 80 | 4029960 | Up |
| Transthyretin |
| 10,864 | 13 | 30308029 | Down |
| C-reactive protein |
| 91,372 | 3,271 | 33385862 | Up |
| Ceruloplasmin |
| 9,192 | 55 | 26001728 | Up |
| Fibrinogen beta chain |
| 1,453 | 6 | 26059223 | Up |
| Fibronectin 1 |
| 24,248 | 2 | 34764682 | Down |
| Apolipoprotein A2 |
| 3,112 | NA | NA | Down |
| Apolipoprotein C3 |
| 734 | NA | NA | Down |
| Serum amyloid A1 |
| 444 | 2 | 17985847 | Up |
| Fibrinogen gamma chain |
| 441 | NA | NA | Up |
| Fibrinogen alpha chain |
| 2,059 | NA | NA | Up |
| Serpin family A member 1 |
| 4 | 2 | 34712268 | Up |
| Apolipoprotein B |
| 15,507 | 36 | 29997113 | Down |
| Apolipoprotein E |
| 27,281 | 15 | 32253242 | Up |
| Von Willebrand factor |
| 22,539 | 43 | 25973092 | Up |
| Alpha-2-macroglobulin |
| 5,061 | 43 | 10343526 | Up |
| Apolipoprotein A4 |
| 1,046 | NA | NA | Up |
| GC vitamin D-binding protein |
| 849 | 1 | 3874814 | Down |
| Histidine-rich glycoprotein |
| 876 | 1 | 29246875 | Up |
| Complement C4A |
| 1,224 | 3 | 22076784 | Down |
| Complement C4B |
| 2,889 | 1 | 22076784 | Up |
| Apolipoprotein A1 |
| 14,603 | 15 | 31694752 | Down |
| Lipoprotein A |
| 9,068 | 37 | 19369465 | Up |
NA, not available.