| Literature DB >> 25144639 |
Angelique Stalmach1, Hanna Johnsson2, Iain B McInnes3, Holger Husi1, Julie Klein4, Mohammed Dakna4, William Mullen1, Harald Mischak5, Duncan Porter3.
Abstract
Early diagnosis and treatment of rheumatoid arthritis are associated with improved outcomes but current diagnostic tools such as rheumatoid factor or anti-citrullinated protein antibodies have shown limited sensitivity. In this pilot study we set out to establish a panel of urinary biomarkers associated with rheumatoid arthritis using capillary electrophoresis coupled to mass spectrometry. We compared the urinary proteome of 33 participants of the Scottish Early Rheumatoid Arthritis inception cohort study with 30 healthy controls and identified 292 potential rheumatoid arthritis-specific peptides. Amongst them, 39 were used to create a classifier model using support vector machine algorithms. Specific peptidic fragments were differentially excreted between groups; fragments of protein S100-A9 and gelsolin were less abundant in rheumatoid arthritis while fragments of uromodulin, complement C3 and fibrinogen were all increasingly excreted. The model generated was subsequently tested in an independent test-set of 31 samples. The classifier demonstrated a sensitivity of 88% and a specificity of 93% in diagnosing the condition, with an area under the receiver operating characteristic curve of 0.93 (p<0.0001). These preliminary results suggest that urinary biomarkers could be useful in the early diagnosis of rheumatoid arthritis. Further studies are currently being undertaken in larger cohorts of patients with rheumatoid arthritis and other athridities to assess the potential of the urinary peptide based classifier in the early detection of rheumatoid arthritis.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25144639 PMCID: PMC4140712 DOI: 10.1371/journal.pone.0104625
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Baseline characteristics of the training and test set populations (case and control)1.
| RA cohort | Controls | |||
| Training set (n = 33) | Test set (n = 16) | Training set (n = 30) | Test set (n = 15) | |
| Age (Years) | 59 (39; 65) | 58 (55; 68) | 31 (23; 60) | 36 (29; 58) |
| Female | 23 (70%) | 13 (76%) | 11 (37%) | 10 (67%) |
| Duration of disease (Days) | 113 (74; 261) | 91 (54; 225) | nd | nd |
| ACPA positive | 16 (48%) | 9 (56%) | nd | nd |
| ACPA (Units) | 5.7 (3; 161) | 47 (2; 214) | nd | nd |
| RF | 11 (73%) | 6 (86%) | nd | nd |
| RF | 15 (11; 30) | 20 (18; 55) | nd | nd |
| DAS28 score | 5.5 (4.3; 6.2) | 4.5 (3.9; 5.5) | nd | nd |
| HAQ score | 1.1 (0.8; 1.9) | 1.1 (0.5; 1.6) | nd | nd |
| CRP abnormal | 19 (63%) | 11 (73%) | nd | nd |
| CRP (mg/l) | 20.5 (6. 0; 41.0) | 19 (8.3; 38.5) | nd | nd |
nd, not determined; ACPA, anti-citrullinated protein antibodies; RF, rheumatoid factor; DAS28, 28 joint count disease activity score; HAQ, health assessment questionnaire score; CRP, C-reactive protein.
Differences between training set and test set within both RA and control groups were not statistically significant (Mann-Whitney for continuous values and Chi Square for categorical values; p<0.05) with the exception for the proportion of female in the control group between the training and test sets (p<0.05).
Difference in the median age value between groups is statistically significant between RA and control groups of the training set (p = 0.0023) and between RA and control groups of the test set (p = 0.0059).
Difference in the gender distribution between groups is statistically significant between RA and control groups of the training set (p<0.01) but not between RA and control groups of the test set (p>0.05) (Chi Square test).
Data missing for 18 patients in the training set and 9 patients in test set, percentage refers to proportion of patients tested.
Figure 1Work flow used for the determination of urinary biomarkers associated with RA.
Urinary peptides which were significantly less abundant in patients with RA.
| Fold change | Sequence | Identification |
| 0.018 | PpGpPGKNGDDGEAGKPG | Collagen alpha-1(I) chain |
| 0.047 | SpGERGETGPpGPA | Collagen alpha-1(III) chain |
| 0.074 | VADEAQVQKVKELEDLEHLQ | Carboxypeptidase A1 |
| 0.117 | PpGKNGDDGEAGKPGRpGERGppGP | Collagen alpha-1(I) chain |
| 0.138 | pGLPGKAGASGFPGTKGEMGmmGPPGPpGP | Collagen alpha-5(IV) chain |
| 0.138 | HAHKLRVDPVNF | Hemoglobin subunit alpha |
| 0.151 | GEAGKpGEQGVpGDLGApGP | Collagen alpha-1(I) chain |
| 0.151 | TGLSmDGGGSPKGDVDP | Sodium/potassium-transporting ATPase subunit gamma |
| 0.176 | VVHTNYDEY | Alpha-1-microglobulin |
| 0.183 | EAGENQKQPEKNAGPTAR | C-X-C motif chemokine 16 |
| 0.270 | TTLASHSTK | Mucin-1 subunit alpha |
| 0.312 | NpGPPGpSGSpGKDGPpGPAG | Collagen alpha-1(III) chain |
| 0.383 | EDLDTNADKQLSFEEF | Protein S100-A9 |
| 0.399 | NRGERGSEGSPGHpGQPGPpGPPGApGP | Collagen alpha-1(III) chain |
| 0.421 | PpGKNGDDGEAGKPGRpGERGppGPQ | Collagen alpha-1(I) chain |
| 0.432 | EGSpGRDGSpGAKGDRG | Collagen alpha-1(I) chain |
| 0.439 | GSpGSpGPDGKTGPpGPAG | Collagen alpha-1(I) chain |
| 0.456 | LSSHIANVERVPFDAATLHTSTA | Gelsolin |
| 0.460 | DQGPVGRTGEVGAVGpPGFAGEKGPSGEAGTAGPpGTpGP | Collagen alpha-2(I) chain |
| 0.460 | GLpGTGGpPGENGKpGEPGPKG | Collagen alpha-1(III) chain |
| 0.466 | SDGLAHLDNLKG | Hemoglobin subunit delta |
| 0.504 | DGVPGKDGPRGP | Collagen alpha-1(III) chain |
| 0.512 | SpGSPGPDGKTGpP | Collagen alpha-1(I) chain |
| 0.514 | DGPpGRDGQpGHKG | Collagen alpha-2(I) chain |
| 0.540 | ApGPAGSRGApGPQGpRGDKGETGERG | Collagen alpha-1(III) chain |
| 0.548 | DpGKNGDKG | Collagen alpha-2(I) chain |
| 0.579 | pPGADGQPGAKGEpGDAGAKGDAGPpGPAGPAGPPGPIG | Collagen alpha-1(I) chain |
| 0.580 | GEHNPFKGAI | T calcium channel alpha 1G subunit variant 249 |
| 0.617 | DDGEAGKpGRpG | Collagen alpha-1(I) chain |
| 0.623 | GKNGDDGEAGKPGRpGERGPpGp | Collagen alpha-1(I) chain |
| 0.632 | SpGSPGPDGKTGPpGPAG | Collagen alpha-1(I) chain |
| 0.667 | PpGPPGPpGPPGPPS | Collagen alpha-1(I) chain |
| 0.700 | pPGADGQpGAKGEPGDAGAKGDAGPpGPAGPAGPpGPIG | Collagen alpha-1(I) chain |
| 0.733 | pPGEAGKpGEQGVPGDLG | Collagen alpha-1(I) chain |
*Peptides not included in the RA classifying biomarker model.
Urinary peptides which were found in significantly higher concentration in patients with RA.
| Fold change | Sequence | Identification |
| 1.398 | DGQpGAKGEpGDAGAKGDAGPpGP | Collagen alpha-1(I) chain |
| 1.452 | EpGSpGENGApGQmGPR | Collagen alpha-1(I) chain |
| 1.522 | NSGEpGApGSKGDTGAKGEpGpVG | Collagen alpha-1(I) chain |
| 1.613 | SGHPGSPGSPGYQGPpGEPGQAGPSGPpGP | Collagen alpha-1(III) chain |
| 1.705 | ApGGKGDAGApGERGPpG | Collagen alpha-1(III) chain |
| 1.732 | NGEpGGKGERGApGEKGEGGPpG | Collagen alpha-1(III) chain |
| 1.749 | PAPAPPPEPERPKEVE | Myosin light chain 3 |
| 1.816 | AGERGHPGAPGpSGSpGLPGVPGSMGDMVNYDEIK | Collagen alpha-1(XVI) chain |
| 1.857 | KGDRGETGpAGPPGApGAPGAPGPVGP | Collagen alpha-1(I) chain |
| 1.960 | NGApGEAGRDGNpGNDGPpG | Collagen alpha-2(I) chain |
| 1.984 | PpGDEGEmAIISQKGTpGEpGP | Collagen alpha-4(IV) chain |
| 2.074 | ADGQpGAKGEpGDAGAKGDAGppGP | Collagen alpha-1(I) chain |
| 2.142 | SGSVIDQSRVLNLGPITRK | Uromodulin |
| 2.421 | QGKTGpPGPPGVVGpQGPTGETGPMGERGHpGPpGP | Collagen alpha-1(V) chain |
| 2.426 | NGEpGGKGERGApGEKGEGGppG | Collagen alpha-1(III) chain |
| 2.434 | GPpGEAGKpGEQGVP | Collagen alpha-1(I) chain |
| 2.607 | GPpGKNGDDGEAGKPG | Collagen alpha-1(I) chain |
| 2.942 | TPEEKSAVTALWGKVNVDEV | Hemoglobin subunit beta |
| 3.084 | IDQSRVLNLGPITRK | Uromodulin |
| 3.242 | ADGQpGAKGEpGDAGAKGDAGPpGPAGP | Collagen alpha-1(I) chain |
| 3.681 | SGEpGApGSKGDTGAKGEpGP | Collagen alpha-1(I) chain |
| 3.699 | GEVGpAGSpGSNGApGQRGEPGPQGHAGAQGPPGpPG | Collagen alpha-1(III) chain |
| 3.910 | GppGPpGPAGKEG | Collagen alpha-1(I) chain |
| 3.928 | VIDQSRVLNLGPIT | Uromodulin |
| 4.016 | SGSVIDQSRVL | Uromodulin |
| 4.080 | NSGEpGApGSKGDTG | Collagen alpha-1(I) chain |
| 4.547 | GPpGPTGPGGDKGDTGPpGP | Collagen alpha-1(III) chain |
| 5.569 | LSMDGGGSPKGDVDP | Sodium/potassium-transporting ATPase subunit gamma |
| 7.564 | GDpGPpGPpGPpG | Collagen alpha-1(XV) chain |
| 8.523 | pGPQGPLGKPGAPGEPGPQG | Collagen alpha-1(VIII) chain |
| 8.928 | FGASAGTGDLSDNHDIISMK | Vesicular integral-membrane protein VIP36 |
| 11.494 | EGVQKEDIPPADLSDQVPDTESETRILLQGTPVA | Complement C3 |
| 15.096 | RPGApGPAGARGNDGATGAAGPPGPTGpAGpP | Collagen alpha-1(I) chain |
| 16.970 | DEAGSEADHEGTHSTKRGHAKS | Fibrinogen alpha chain |
| 23.336 | FDSDPITVTVPVEV | Clusterin |
| 27.407 | NPPKPMPNPNPNHPSSSGS | CD99 antigen |
*Peptides not included in the RA classifying biomarker model.
Figure 2Urinary polypeptide signatures in cases and controls from the validation set based on 39 significantly different sequenced peptides.
Normalized molecular weight (500–15000 Da) in logarithmic scale is plotted against normalized migration time (18–45 minutes). The mean signal intensity of the polypeptide peak is given in 3-dimensional depiction (n = 15 controls and 16 cases).
Figure 3Graphical representation of the frequency distribution of proteases with modified activity associated with RA.
Percentage frequency of peptide occurrences in the down-regulation group is plotted on the x-axis, whereas the percentage frequency of occurrences in the up-regulated group is plotted on the y-axis. Circled data points represent the proteases which activity is the most affected in RA compared to that of healthy controls (see Table 4).
Predictive analysis of changes in protease activity associated with peptides differentially regulated in RA 1.
| Protease | occ(up)[N(up) = 67] | occ(down)[N(down) = 64] | % frequency(up) | % frequency(down) | % frequencydifference ratio | Frequencyscores |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Prolyl endopeptidase | 10 | 4 | 14.9 | 6.3 | 41.0 | 245.8 |
| ADAMTS4 | 6 | 12 | 9.0 | 18.8 | 35.4 |
|
| Kallikrein 4 | 6 | 2 | 9.0 | 3.1 | 48.3 | 193.1 |
| Granzyme A | 11 | 6 | 16.4 | 9.4 | 27.3 | 136.5 |
| KLK3 | 0 | 1 | 0.0 | 1.6 | 100.0 |
|
| MMP20 | 7 | 4 | 10.5 | 6.3 | 25.1 | 75.4 |
| Cathepsin L1 | 24 | 18 | 35.8 | 28.1 | 12.0 | 72.2 |
| Thimet oligopeptidase | 2 | 4 | 3.0 | 6.3 | 35.4 |
|
| MMP25 | 4 | 2 | 6.0 | 3.1 | 31.3 | 62.6 |
| Thrombin | 4 | 2 | 6.0 | 3.1 | 31.3 | 62.6 |
| Signal peptidase complex catalytic subunit | 8 | 11 | 11.9 | 17.2 | 18.0 |
|
| MMP14 | 30 | 35 | 44.8 | 54.7 | 10.0 |
|
| Kallikrein 2 | 5 | 3 | 7.5 | 4.7 | 22.8 | 45.7 |
| MMP12 | 63 | 69 | 94.0 | 107.8 | 6.8 |
|
| MMP1 | 26 | 30 | 38.8 | 46.9 | 9.4 |
|
| Cathepsin S | 15 | 12 | 22.4 | 18.8 | 8.8 | 26.5 |
| Triptidyl-peptidase 1 | 3 | 4 | 4.5 | 6.3 | 16.5 | −16.5 |
| ADAMTS5 | 18 | 20 | 26.9 | 31.3 | 7.5 | −15.1 |
| Kallikrein 5 | 4 | 3 | 6.0 | 4.7 | 12.0 | 12.0 |
| MMP2 | 28 | 30 | 41.8 | 46.9 | 5.7 | −11.5 |
| Cathepsin K | 16 | 14 | 23.9 | 21.9 | 4.4 | 8.8 |
| MMP7 | 42 | 43 | 62.7 | 67.2 | 3.5 | −3.5 |
| Meprin A | 38 | 36 | 56.7 | 56.3 | 0.4 | 0.8 |
| Calpain 2 | 10 | 10 | 14.9 | 15.6 | 2.3 | 0.0 |
| Neprilysin | 10 | 10 | 14.9 | 15.6 | 2.3 | 0.0 |
Frequency distribution analysis based on all peptides (n = 131).
Mathematical calculations are based on the following parameter and calculations:
occ(up) = Sum of all occurrences for each individual protease in the up-regulated peptides,
occ(down) = Sum of all occurrences for each individual protease in the down-regulated peptides,
N(up) = Total number of peptides being up-regulated,
N(down) = Total number of peptides being down-regulated,
% frequency(up) = (occ(up)/N(up)) * 100.
% frequency(down) = (occ(down)/N(down)) * 100.
% frequency difference ratio = | ((freq%(up) − freq%(down))/(freq%(up)+freq%(down)) * 100 |.
Frequency scores = %freq * (occ(up)-occ(down)).