| Literature DB >> 32645927 |
Yury E Glazyrin1,2, Dmitry V Veprintsev2, Irina A Ler3, Maria L Rossovskaya3, Svetlana A Varygina3, Sofia L Glizer3,4, Tatiana N Zamay1, Marina M Petrova4, Zoran Minic5, Maxim V Berezovski5, Anna S Kichkailo1,2.
Abstract
Diabetic nephropathy, hypertension, and glomerulonephritis are the most common causes of chronic kidney diseases (CKD). Since CKD of various origins may not become apparent until kidney function is significantly impaired, a differential diagnosis and an appropriate treatment are needed at the very early stages. Conventional biomarkers may not have sufficient separation capabilities, while a full-proteomic approach may be used for these purposes. In the current study, several machine learning algorithms were examined for the differential diagnosis of CKD of three origins. The tested dataset was based on whole proteomic data obtained after the mass spectrometric analysis of plasma and urine samples of 34 CKD patients and the use of label-free quantification approach. The k-nearest-neighbors algorithm showed the possibility of separation of a healthy group from renal patients in general by proteomics data of plasma with high confidence (97.8%). This algorithm has also be proven to be the best of the three tested for distinguishing the groups of patients with diabetic nephropathy and glomerulonephritis according to proteomics data of plasma (96.3% of correct decisions). The group of hypertensive nephropathy could not be reliably separated according to plasma data, whereas analysis of entire proteomics data of urine did not allow differentiating the three diseases. Nevertheless, the group of hypertensive nephropathy was reliably separated from all other renal patients using the k-nearest-neighbors classifier "one against all" with 100% of accuracy by urine proteome data. The tested algorithms show good abilities to differentiate the various groups across proteomic data sets, which may help to avoid invasive intervention for the verification of the glomerulonephritis subtypes, as well as to differentiate hypertensive and diabetic nephropathy in the early stages based not on individual biomarkers, but on the whole proteomic composition of urine and blood.Entities:
Keywords: chronic kidney disease; differential diagnosis; label-free quantification; machine learning; mass spectrometry; proteomics
Mesh:
Substances:
Year: 2020 PMID: 32645927 PMCID: PMC7369970 DOI: 10.3390/ijms21134802
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Principal characteristics of the chronic kidney diseases (CKD) patients.
| Variable | Group D | Group G | Group H |
|---|---|---|---|
| Sex, | |||
| Male | 6 (40) | 8 (57) | 2 (40) |
| Female | 9 (60) | 6 (43) | 3 (60) |
| Age, mean ± SD (years) | 62.11 ± 17.91 | 48.46 ± 16.83 | 60.6 ± 14.15 |
| Stage of renal disease, | |||
| Stage 1 | 6 (40) | 7 (50) | 0 (0) |
| Stage 2 + 3 | 7 (46.7) | 2 (14.3) | 0 (0) |
| Stage 4 + 5 | 2 (13.3) | 5 (35.7) | 5 (100) |
| Level of proteinuria | |||
| Below 30 mg/l, | 0 (0) | 0 (0) | 0 (0) |
| 30 - 300 mg/l, | 5 (33.4) | 5 (35.7) | 1 (20) |
| Above 300 mg/l, | 10 (66.6) | 9 (64.3) | 4 (80) |
| Presence of hypertension, | 8 (53) | 10 (71.4) | 5 (100) |
| eGFR, mean ± SD (mL/min) | 39.56 ± 15.27 | 68.38 ± 48.36 | 14.8 ± 7.19 |
| Biopsy tested patients, | 0 (0) | 2 (14.3) | 0 (0) |
| Type of glomerulonephritis | |||
| Chronic glomerulonephritis, | 10 (71.4) | ||
| Nephrotic syndrome, | 4 (28.6) |
Abbreviations: SD, standard deviation; eGFR, estimated glomerular filtration rate.
Figure 1Distribution of urine proteins by molecular weight in the three studied groups of patients.
Figure 2Scheme of two-stage differential diagnosis of chronic kidney disease using proteomic data of urine and plasma.