| Literature DB >> 35887259 |
Alexey S Kononikhin1, Natalia V Zakharova2, Savva D Semenov3, Anna E Bugrova1,2, Alexander G Brzhozovskiy1, Maria I Indeykina1,2, Yana B Fedorova4, Igor V Kolykhalov4, Polina A Strelnikova2,3, Anna Yu Ikonnikova5, Dmitry A Gryadunov5, Svetlana I Gavrilova4, Evgeny N Nikolaev1.
Abstract
Early recognition of the risk of Alzheimer's disease (AD) onset is a global challenge that requires the development of reliable and affordable screening methods for wide-scale application. Proteomic studies of blood plasma are of particular relevance; however, the currently proposed differentiating markers are poorly consistent. The targeted quantitative multiple reaction monitoring (MRM) assay of the reported candidate biomarkers (CBs) can contribute to the creation of a consistent marker panel. An MRM-MS analysis of 149 nondepleted EDTA-plasma samples (MHRC, Russia) of patients with AD (n = 47), mild cognitive impairment (MCI, n = 36), vascular dementia (n = 8), frontotemporal dementia (n = 15), and an elderly control group (n = 43) was performed using the BAK 125 kit (MRM Proteomics Inc., Canada). Statistical analysis revealed a significant decrease in the levels of afamin, apolipoprotein E, biotinidase, and serum paraoxonase/arylesterase 1 associated with AD. Different training algorithms for machine learning were performed to identify the protein panels and build corresponding classifiers for the AD prognosis. Machine learning revealed 31 proteins that are important for AD differentiation and mostly include reported earlier CBs. The best-performing classifiers reached 80% accuracy, 79.4% sensitivity and 83.6% specificity and were able to assess the risk of developing AD over the next 3 years for patients with MCI. Overall, this study demonstrates the high potential of the MRM approach combined with machine learning to confirm the significance of previously identified CBs and to propose consistent protein marker panels.Entities:
Keywords: Alzheimer’s disease; machine learning; mass spectrometry; multiple reaction monitoring; targeted proteomics
Mesh:
Substances:
Year: 2022 PMID: 35887259 PMCID: PMC9318764 DOI: 10.3390/ijms23147907
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
Subject demographics.
| Control | MCI (nc/c) | AD (ps/s) | VD | FTD | |
|---|---|---|---|---|---|
| N | 39 | 32 (23/9) | 37 (13/24) | 6 | 11 |
| Age (years) | 67.6 ± 8.0 | 70.7 ± 7.4/76.1 ± 7.9 | 66.4 ± 4.6/78.4 ± 5.8 | 73.6 ± 7.8 | 64.4 ± 11.4 |
| Sex (%, F) | 69.2 | 70.6 (78.3/44.4) | 54.1 (57.1/52.2) | 66.7 | 63.6 |
| 10.0 | 12.5/33.3 | 46.2/41.7 | 50.0 | 18.2 | |
| e2/e3 | 17.5 | 12.5/11.1 | 0/8.3 | 16.7 | 45.5 |
| e3/e3 | 72.5 | 75/55.6 | 53.8/50.0 | 33.3 | 36.4 |
| e2/e4 | 2.5 | 0/0 | 0/4.2 | 0 | 0 |
| e3/e4 | 7.5 | 12.5/33.3 | 15.4/20.8 | 50.0 | 18.2 |
| e4/e4 | 0 | 0/0 | 30.8/16.7 | 0 | 0 |
| MMSE | 29.5 ± 0.7 | 28.7 ± 1.5/25.9 ± 4 | 14.7 ± 6.6/17.6 ± 4.8 | 22.0 ± 3.7 | 16.9 ± 8.6 |
| CDT | 9.9 ± 0.29 | 9.5 ± 1.1/8.6 ± 1.6 | 4.7 ± 2.6/5.5 ± 2.7 | 8.3 ± 1.7 | 5.7 ± 3.9 |
| BNT | 53.3 ± 1.64 | 51 ± 3.4/45.7 ± 5.7 | 23.6 ± 16/26.5 ± 17.2 | 38.5 ± 7.2 | 18.7 ± 19.7 |
| LMWT | |||||
| NM | 7.98 ± 1.14 | 7.7 ± 1.2/5.7 ± 1.9 | 2.3 ± 1.9/3.4 ± 2.2 | 5.3 ± 1.7 | 3.1 ± 2.8 |
| DM | 6.81 ± 1.71 | 6.4 ± 1.9/4.5 ± 2.7 | 0.26 ± 0.6/1.6 ± 2 | 3.1 ± 1.6 | 1.9 ± 3.0 |
| MDRS | |||||
| Sound associations | 17.9 ± 3.4 | 15.4 ± 4/12.5 ± 5.6 | 6.1 ± 4.7/6.5 ± 4.6 | 12.3 ± 3.0 | 3.3 ± 4.1 |
| Categorial associations | 19.8 ± 1.7 | 17.1 ± 4.4/12.3 ± 5.3 | 7.6 ± 5.4/7.0 ± 5.2 | 12.8 ± 2.9 | 3.6 ± 4.1 |
| Cardiovascular diseases (%) | 65.8 | 70.8/55.6 | 71.4/87.0 | 100.0 | 63.6 |
| Diabetes mellitus (%) | 0.0 | 12.5/11.1 | 7.1/13.0 | 33.3 | 9.1 |
| Gastrointestinal pathologies (%) | 21.1 | 16.7/44.4 | 35.7/26.1 | 66.7 | 36.4 |
| Genitourinary pathologies (%) | 15.8 | 25.0/55.5 | 13.3/39.1 | 50.0 | 9.1 |
| Nicergoline usage (%) | 0 | 13.8/18.2 | 10.5/7.1 | 0 | 26.7 |
| Choline alfoscerate usage (%) | 0 | 65.5/63.6 | 15.8/28.6 | 0 | 0 |
| Donepezil usage (%) | 0 | 0/9.1 | 26.3/28.6 | 25 | 26.7 |
| Memantine usage (%) | 0 | 3.4/0 | 73.7/67.9 | 50 | 73.3 |
| Rivastigmine usage (%) | 0 | 0/0 | 26.3/21.4 | 0 | 0 |
| Quetiapine usage (%) | 0 | 0/0 | 10.5/17.9 | 0 | 33.3 |
Abbreviations: AD—Alzheimer’s disease (ps—pre-senile, s—senile); APOE—apolipoprotein E gene (genotype); BNT—Boston naming test; CDT—clock drawing test; FTD—frontotemporal dementia; LMWT—Luria memory words test (NM—non-intermediate memorization, DM—delayed memorization); MCI—mild cognitive impairment (nc—non-converter, c—converter); MDRS—Mattis dementia rating scale; MMSE—Mini Mental State Examination; VD—vascular dementia. All average values are given ± SD.
Figure 1The results of study participants using psychometric tests and scales: (A) Mini Mental State Examination (MMSE), (B) Boston naming test (BNT), (C) Mattis dementia rating scale (MDRS) and categorical associations subtest. MCI—mild cognitive impairment (nc—non-converter, c—converter); AD—Alzheimer’s disease (mild, moderate, and severe subgroups); VD—vascular dementia and FTD—frontotemporal dementia. Lines inside the boxes show medians; box flanges—25–75 percentiles; whisker range ± SD. Significantly different results, with p-values < 0.01 according to the Mann–Whitney U test, are shown with the following symbols: * significantly different to the control group; ** to the control and MCI groups; *** to the control, MCI and AD groups; **** to the control and AD groups; ***** to the control, MCI, AD and VD groups; # to the control and MCI-nc; “F”—to FTD; “V”—to VD; “!”—significant differences between subgroups.
Figure 2Differentiating proteins between the control and AD groups with uncorrected p-values ≤ 0.01 according to the Mann–Whitney U test. Lines inside boxes—medians; box flanges—25–75 percentiles; whisker range ± SD.
Figure 3The relative ratings of the quantified proteins (AD vs. control). (A) The ordering of 22 proteins with uncorrected p-values of <0.05 by the effect sizes (Cohen’s d). (B) The ordering by the mean feature importance of the proteins included as classifiers in this study. The abbreviation for the name of proteins along the Y axis is given in accordance with Table 2. The color saturation represents the value of feature importance/p-value of the feature.
Proteins selected in this study for the binary classification of AD vs. the controls.
| Protein Name | Abbr. | UniProt ID | Other Cohorts | Effect Size | Relative Import. | ||
|---|---|---|---|---|---|---|---|
| 1 | Afamin | AFAM | P43652 | 3 [ | 1.7 × 10−4 | 0.840 | 0.0318 |
| 2 | Apolipoprotein E | APOE | P02649 | 2–5 [ | 3.1 × 10−3 | 0.601 | 0.0310 |
| 3 | Serum paraoxonase/arylesterase 1 | PON1 | P27169 | 1 [ | 3.6 × 10−3 | 0.605 | 0.0257 |
| 4 | Fibrinogen beta chain | FGB | P02675 | 1–2 [ | 0.0192 | −0.339 | 0.0220 |
| 5 | Biotinidase | BTD | P43251 | 1 [ | 2.7 × 10−3 | 0.714 | 0.0201 |
| 6 | Pregnancy zone protein | PZP | P20742 | 1 [ | 0.0161 | 0.403 | 0.0192 |
| 7 | Attractin | ATRN | O75882 | 1 [ | 0.0200 | 0.548 | 0.0191 |
| 8 | Fibrinogen gamma chain | FGG | P02679 | 3–4 [ | 0.0263 | −0.331 | 0.0191 |
| 9 | Apolipoprotein A-IV | APOA4 | P06727 | 3 [ | 0.0134 | 0.548 | 0.0187 |
| 10 | Vitronectin | VTNC | P04004 | 1–3 [ | 0.0128 | 0.395 | 0.0185 |
| 11 | Cathelicidin antimicrobial peptide | CAMP | P49913 | - | 0.0176 | 0.473 | 0.0181 |
| 12 | Complement C1q subcomponent subunit B | C1QB | P02746 | - | 6.7 × 10−3 | −0.638 | 0.0180 |
| 13 | Alpha-1-acid glycoprotein 1 | A1AG1 | P02763 | 1 [ | 0.0566 | −0.447 | 0.0178 |
| 14 | Lipopolysaccharide-binding protein | LBP | P18428 | - | 0.0577 | 0.332 | 0.0169 |
| 15 | Fibronectin | FINC | P02751 | 1–3 [ | 0.0157 | −0.308 | 0.0167 |
| 16 | Complement C5 | CO5 | P01031 | 1 [ | 0.0679 | −0.382 | 0.0165 |
| 17 | Tenascin | TENA | P24821 | 1–3 [ | 0.0232 | 0.476 | 0.0152 |
| 18 | Alpha-2-antiplasmin | A2AP | P08697 | - | 0.0102 | 0.557 | 0.0151 |
| 19 | Fibrinogen alpha chain | FGA | P02671 | 2–3 [ | 0.103 | −0.250 | 0.0150 |
| 20 | Apolipoprotein C-II | APOC2 | P02655 | 1 [ | 0.0516 | 0.530 | 0.0149 |
| 21 | Fibulin-1 | FBLN1 | P23142 | - | 0.433 | 0.292 | 0.0144 |
| 22 | Adipocyte plasma membrane-associated protein | APMAP | Q9HDC0 | - | 0.127 | −0.351 | 0.0143 |
| 23 | Serotransferrin | TRFE | P02787 | 2 [ | 0.286 | 0.243 | 0.0141 |
| 24 | Metalloproteinase inhibitor 2 | TIMP2 | P16035 | - | 0.175 | 0.386 | 0.0138 |
| 25 | Alpha-1-antichymotrypsin | AACT | P01011 | 1 [ | 0.0883 | −0.475 | 0.0128 |
| 26 | Peroxiredoxin-2 | PRDX2 | P32119 | 1 [ | 0.199 | −0.009 | 0.0126 |
| 27 | Apolipoprotein C-IV | APOC4 | P55056 | - | 0.0274 | 0.396 | 0.0125 |
| 28 | Vascular cell adhesion protein 1 | VCAM1 | P19320 | 3 [ | 0.0443 | −0.323 | 0.0125 |
| 29 | Plasminogen activator inhibitor 1 | PAI1 | P05121 | 1 [ | 0.0343 | 0.392 | 0.0123 |
| 30 | Beta-2-glycoprotein 1 | APOH | P02749 | 2–3 [ | 1.0 | 0.007 | 0.0122 |
| 31 | Cystatin-C | CYTC | P01034 | 1 [ | 0.217 | −0.420 | 0.0113 |
Includes 1–16—proteins; Set-25—1–25 proteins; Set-28—1–28 proteins and Set-31—1–31 proteins.
Figure 4Differentiation of the AD and control samples using the obtained protein panels and developed classifiers. (A) ROC curves for the differentiation of AD vs. the controls for classifiers generated with 4 algorithms and 2 sets of proteins. (B) Distribution of the control and AD samples according to the probability of AD using two classifiers with the best performances.
Figure 5The distribution of MCI, VD and FTD patients according to the probability of AD development using the RF-31 classifier. (A) Distribution of MCI patient samples with a known three-year prognosis (samples of MCI-nc patients with <3 years of observation are excluded). (B) The distribution of the VD and FTD samples.
Figure 6Venn diagram showing proteins differing between AD, FTD and the control at the uncorrected p-value of ≤ 0.05. Bold type shows proteins with a p-value ≤ 0.01.