| Literature DB >> 18769539 |
Martín Gómez Ravetti1, Pablo Moscato.
Abstract
BACKGROUND: Alzheimer's disease (AD) is a progressive brain disease with a huge cost to human lives. The impact of the disease is also a growing concern for the governments of developing countries, in particular due to the increasingly high number of elderly citizens at risk. Alzheimer's is the most common form of dementia, a common term for memory loss and other cognitive impairments. There is no current cure for AD, but there are drug and non-drug based approaches for its treatment. In general the drug-treatments are directed at slowing the progression of symptoms. They have proved to be effective in a large group of patients but success is directly correlated with identifying the disease carriers at its early stages. This justifies the need for timely and accurate forms of diagnosis via molecular means. We report here a 5-protein biomarker molecular signature that achieves, on average, a 96% total accuracy in predicting clinical AD. The signature is composed of the abundances of IL-1alpha, IL-3, EGF, TNF-alpha and G-CSF. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2008 PMID: 18769539 PMCID: PMC2518833 DOI: 10.1371/journal.pone.0003111
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Number of errors from the 18-genes randomly selected signatures on the “AD” validation test set.
| Seed Number | S18-1 | S18-2 | S18-3 | S18-4 |
| S18-6 | S18-7 | S18-8 |
| S18-10 |
| 76 | 18 | 14 | 11 | 18 |
| 18 | 11 | 10 |
| 10 |
| 144 | 18 | 15 | 12 | 19 |
| 17 | 13 | 11 |
| 13 |
| 121 | 18 | 15 | 10 | 22 |
| 19 | 11 | 8 |
| 13 |
| 83 | 17 | 14 | 11 | 21 |
| 18 | 13 | 12 |
| 15 |
| 33 | 20 | 18 | 12 | 20 |
| 16 | 11 | 11 |
| 15 |
| 51 | 15 | 16 | 11 | 21 |
| 17 | 12 | 8 |
| 15 |
| 162 | 15 | 13 | 13 | 20 |
| 21 | 14 | 8 |
| 13 |
| 37 | 13 | 14 | 11 | 21 |
| 20 | 10 | 9 |
| 11 |
| 136 | 17 | 16 | 13 | 22 |
| 20 | 10 | 10 |
| 14 |
| 60 | 18 | 10 | 11 | 17 |
| 18 | 10 | 9 |
| 15 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
The Random forest algorithm was used as classifier. For each signature 10 runs with different seeds were done. We used the WEKA software implementation, and the algorithm was allowed to generate 150 trees. The best and worst signatures are highlighted in bold text. In two cases we found signatures that classify above 90%, comparable with the results of Ray et al. that report on 91% AD predictability as a result of their proposed methodology.
Number of errors from the 6-genes randomly selected signatures on the “AD” validation test set.
| Seed Number | S6-1 | S6-2 | S6-3 | S6-4 | S6-5 | S6-6 | S6-7 | S6-8 | S6-9 | S6-10 |
| 76 | 40 | 34 | 20 | 31 | 31 | 32 | 29 | 32 | 24 | 34 |
| 144 | 40 | 32 | 19 | 34 | 32 | 33 | 30 | 31 | 23 | 33 |
| 121 | 38 | 37 | 18 | 33 | 35 | 30 | 28 | 32 | 27 | 31 |
| 83 | 40 | 33 | 19 | 31 | 33 | 34 | 27 | 27 | 24 | 31 |
| 33 | 41 | 33 | 17 | 35 | 33 | 30 | 27 | 28 | 27 | 29 |
| 51 | 39 | 33 | 19 | 28 | 34 | 30 | 28 | 28 | 24 | 30 |
| 162 | 41 | 35 | 19 | 31 | 36 | 34 | 28 | 27 | 26 | 33 |
| 37 | 40 | 33 | 17 | 32 | 31 | 29 | 27 | 35 | 24 | 32 |
| 136 | 42 | 36 | 19 | 34 | 34 | 32 | 30 | 34 | 24 | 26 |
| 60 | 40 | 35 | 17 | 28 | 27 | 31 | 29 | 32 | 23 | 29 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
The Random forest algorithm was used as classifier, for each signature 10 runs with different seeds were done. We used the WEKA software implementation, and the algorithm was allowed to generate 150 trees. The best and worst signatures are highlighted in bold text. This result shows what it is expected, that a 6-signature, when the biomarkers are randomly chosen, is performing significantly worse than the panel of 18 biomarkers selected by Ray et. al. Now the best result (81.5%) is worse than the average result of a random 18-signature (86%).
Random experiments report.
| 18-gene random signatures | 6-gene random signatures | |
| Average Error | 15.14 | 30.59 |
| Best Signature (average) | 6.2 | 17 |
| Worst Signature (average) | 25.7 | 40.5 |
| Standard Deviation | 5.36 | 6.21 |
| Accuracy Average |
|
|
The table shows the average results of the 20 random signatures for each size, also including the best and worst results and the standard deviation. The accuracy average is calculated considering the error average over the 92 samples of “AD” validation test set.
Figure 1Histograms of the number of errors of the random forest classifier using 20 randomly selected signatures with 18 proteins.
The arrow indicates the results under the same conditions of the 18-protein signature proposed by Ray et al.
Figure 2Histograms of the number of errors considering the random forest classifier and the 20 randomly selected signatures with 6 proteins.
The arrow indicates the results under the same conditions of our 6-protein signature.
Protein name for each signature used in the computational experiment.
| Protein Name | Entrez GeneID | Official gene name provided by HUGO Gene Nomenclature Committee (HGNC) | In signature | |||
| 18 | 10 | 6 | 5 | |||
| ANG-2 | 285 | angiopoietin 2 | x | |||
| CCL5/RANTES | 6352 | chemokine (C-C motif) ligand 5 | x | |||
| CCL7/MCP-3 | 6354 | chemokine (C-C motif) ligand 7 | x | x | ||
| CCL15/MIP-1δ | 6359 | chemokine (C-C motif) ligand 15 | x | x | ||
| CCL18/PARC | 6362 | chemokine (C-C motif) ligand 18 (pulmonary and activation-regulated) | x | |||
| CXCL8/IL-8 | 3576 | interleukin 8 | x | |||
| EGF | 1950 | epidermal growth factor (beta-urogastrone) | x | x | x | x |
| G-CSF | 1440 | colony stimulating factor 3 (granulocyte) | x | x | x | x |
| GDNF | 2668 | glial cell derived neurotrophic factor | x | |||
| ICAM-1 | 3383 | intercellular adhesion molecule 1 (CD54), human rhinovirus receptor | x | |||
| IGFBP-6 | 3489 | insulin-like growth factor binding protein 6 | x | |||
| IL-1α | 3552 | interleukin 1, alpha | x | x | x | x |
| IL-3 | 3562 | interleukin 3 (colony-stimulating factor, multiple) | x | x | x | x |
| IL-6 | 3569 | interleukin 6 (interferon, beta 2) | x | x | ||
| IL-11 | 3589 | interleukin 11 | x | x | ||
| M-CSF | 1435 | colony stimulating factor 1 (macrophage) | x | |||
| PDGF-BB | 5155 | platelet-derived growth factor beta polypeptide (simian sarcoma viral (v-sis) oncogene homolog) | x | x | ||
| TNF-α | 7124 | tumor necrosis factor (TNF superfamily, member 2) | x | x | x | x |
| TRAIL R4 | 8793 | tumor necrosis factor receptor superfamily, member 10d, decoy with truncated death domain | x | |||
Report of the results of the 24 classifiers when using the 18-Protein biomarker.
| Classifier | Grand Total | OVERALL (“AD”+“MCI”) | Test Set “AD” | Test Set “MCI” | |||
| AD Er. | NAD Er. | AD Er. | NAD Er. | AD Er. | NAD Er. | ||
| Dataset size | 139 | 64 | 75 | 42 | 50 | 22 | 25 |
|
| 21 | 7 | 14 | 4 | 6 | 3 | 8 |
|
| 20 | 5 | 15 | 2 | 6 | 3 | 9 |
|
| 25 | 10 | 15 | 5 | 6 | 5 | 9 |
|
| 27 | 11 | 16 | 6 | 7 | 5 | 9 |
|
| 21.7 | 10.1 | 11.6 | 4 | 3.3 | 6.1 | 8.3 |
|
| 27 | 7 | 20 | 3 | 7 | 4 | 13 |
|
| 23 | 4 | 19 | 1 | 5 | 3 | 14 |
|
| 23 | 4 | 19 | 1 | 5 | 3 | 14 |
|
| 23 | 4 | 19 | 1 | 5 | 3 | 14 |
|
| 21 | 5 | 16 | 2 | 3 | 3 | 13 |
|
| 21 | 5 | 16 | 2 | 3 | 3 | 13 |
|
| 28 | 5 | 23 | 2 | 11 | 3 | 12 |
|
| 28 | 15 | 13 | 5 | 3 | 10 | 10 |
|
| 23 | 9 | 14 | 4 | 3 | 5 | 11 |
|
| 28 | 14 | 14 | 5 | 4 | 9 | 10 |
|
| 23.1 | 7.9 | 15.2 | 3.3 | 5.2 | 4.6 | 10 |
|
| 27 | 11 | 16 | 6 | 7 | 5 | 9 |
|
| 26.1 | 10.1 | 16 | 4.4 | 5.5 | 5.7 | 10.5 |
|
| 24 | 13 | 11 | 3 | 2 | 10 | 9 |
|
| 25 | 10 | 15 | 5 | 6 | 5 | 9 |
|
| 26 | 13 | 13 | 5 | 4 | 8 | 9 |
|
| 25 | 14 | 11 | 7 | 2 | 7 | 9 |
|
| 24.3 | 9.3 | 15 | 4.1 | 4 | 5.2 | 11 |
|
| 24 | 13 | 11 | 3 | 2 | 10 | 9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18-Protein Signature (Ray et al.)
Report of the results of the 24 classifiers when using the 10-Protein biomarker.
| 10-Protein Signature | |||||||
| Classifier | Grand Total | OVERALL (“AD”+“MCI”) | Test Set “AD” | Test Set “MCI” | |||
| AD Er. | NAD Er. | AD Er. | NAD Er. | AD Er. | NAD Er. | ||
| Dataset size | 139 | 64 | 75 | 42 | 50 | 22 | 25 |
|
| 23 | 5 | 18 | 3 | 8 | 2 | 10 |
|
| 23 | 7 | 16 | 2 | 6 | 5 | 10 |
|
| 23 | 4 | 19 | 1 | 8 | 3 | 11 |
|
| 24 | 6 | 18 | 1 | 9 | 5 | 9 |
|
| 21.8 | 4.9 | 16.9 | 1.2 | 6.9 | 3.7 | 10 |
|
| 28 | 7 | 21 | 1 | 8 | 6 | 13 |
|
| 31 | 6 | 25 | 2 | 12 | 4 | 13 |
|
| 31 | 6 | 25 | 2 | 12 | 4 | 13 |
|
| 31 | 6 | 25 | 2 | 12 | 4 | 13 |
|
| 28 | 6 | 22 | 3 | 9 | 3 | 13 |
|
| 28 | 6 | 22 | 3 | 9 | 3 | 13 |
|
| 39 | 3 | 36 | 0 | 18 | 3 | 18 |
|
| 28 | 15 | 13 | 5 | 3 | 10 | 10 |
|
| 22 | 4 | 18 | 1 | 8 | 3 | 10 |
|
| 23 | 8 | 15 | 1 | 5 | 7 | 10 |
|
| 25.1 | 6.7 | 18.4 | 1.6 | 8 | 5.1 | 10.4 |
|
| 24 | 6 | 18 | 1 | 9 | 5 | 9 |
|
| 25.8 | 9.9 | 15.9 | 3.3 | 6.4 | 6.6 | 9.5 |
|
| 22 | 11 | 11 | 3 | 2 | 8 | 9 |
|
| 37 | 17 | 20 | 8 | 12 | 9 | 8 |
|
| 19 | 13 | 6 | 5 | 3 | 8 | 3 |
|
| 21 | 10 | 11 | 3 | 2 | 7 | 9 |
|
| 23.9 | 9.4 | 14.5 | 2.7 | 5 | 6.7 | 9.5 |
|
| 22 | 11 | 11 | 3 | 2 | 8 | 9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Report of the results of the 24 classifiers when using the 6-Protein biomarker.
| 6-Protein Signature | |||||||
| Classifier | Grand Total | OVERALL (“AD”+“MCI”) | Test Set “AD” | Test Set “MCI” | |||
| AD Er. | NAD Er. | AD Er. | NAD Er. | AD Er. | NAD Er. | ||
| Dataset size | 139 | 64 | 75 | 42 | 50 | 22 | 25 |
|
| 20 | 8 | 12 | 1 | 3 | 7 | 9 |
|
| 20 | 9 | 11 | 2 | 2 | 7 | 9 |
|
|
|
|
|
|
|
|
|
|
| 21 | 4 | 17 | 0 | 7 | 4 | 10 |
|
| 25.6 | 3.2 | 22.4 | 0.4 | 9 | 2.8 | 13.4 |
|
| 22 | 8 | 14 | 3 | 4 | 5 | 10 |
|
| 23 | 8 | 15 | 2 | 5 | 6 | 10 |
|
| 24 | 9 | 15 | 3 | 5 | 6 | 10 |
|
| 23 | 8 | 15 | 2 | 5 | 6 | 10 |
|
| 33 | 9 | 24 | 3 | 11 | 6 | 13 |
|
| 33 | 9 | 24 | 3 | 11 | 6 | 13 |
|
| 33 | 6 | 27 | 1 | 13 | 5 | 14 |
|
| 29 | 16 | 13 | 6 | 3 | 10 | 10 |
|
| 27 | 11 | 16 | 3 | 6 | 8 | 10 |
|
| 23 | 10 | 13 | 3 | 6 | 7 | 7 |
|
| 24.7 | 9.8 | 14.9 | 2.4 | 4.8 | 7.4 | 10.1 |
|
| 21 | 4 | 17 | 0 | 7 | 4 | 10 |
|
| 26.6 | 11.5 | 15.1 | 3.1 | 5.6 | 8.4 | 9.5 |
|
| 24 | 10 | 14 | 2 | 5 | 8 | 9 |
|
|
|
|
|
|
|
|
|
|
| 21 | 10 | 11 | 1 | 2 | 9 | 9 |
|
| 27 | 13 | 14 | 3 | 5 | 10 | 9 |
|
| 25.6 | 11.8 | 13.8 | 2.6 | 4.4 | 9.2 | 9.4 |
|
| 24 | 10 | 14 | 2 | 5 | 8 | 9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Using this biomarker it is notable the effectiveness of predicting AD on the “AD” test set using simple classifiers as simple logistic or LMT (Logistic Model Tree) or even the same classifier used in [1] (PAM).
Report of the results of the 24 classifiers when using the 5-Protein biomarker.
| 5-Protein Signature | |||||||
| Classifier | Grand Total | OVERALL (“AD”+“MCI”) | Test Set “AD” | Test Set “MCI” | |||
| AD Er. | NAD Er. | AD Er. | NAD Er. | AD Er. | NAD Er. | ||
| Dataset size | 139 | 64 | 75 | 42 | 50 | 22 | 25 |
|
| 21 | 10 | 11 | 3 | 2 | 7 | 9 |
|
| 19 | 8 | 11 | 2 | 2 | 6 | 9 |
|
|
|
|
|
|
|
|
|
|
| 20 | 4 | 16 | 0 | 6 | 4 | 10 |
|
| 21.6 | 5.3 | 16.3 | 0.7 | 5.2 | 4.6 | 11.1 |
|
| 21 | 4 | 17 | 1 | 5 | 3 | 12 |
|
| 19 | 5 | 14 | 1 | 2 | 4 | 12 |
|
| 20 | 5 | 15 | 1 | 3 | 4 | 12 |
|
| 19 | 5 | 14 | 1 | 2 | 4 | 12 |
|
| 30 | 10 | 20 | 3 | 7 | 7 | 13 |
|
| 30 | 10 | 20 | 3 | 7 | 7 | 13 |
|
| 26 | 8 | 18 | 3 | 7 | 5 | 11 |
|
| 29 | 16 | 13 | 6 | 3 | 10 | 10 |
|
| 31 | 3 | 28 | 1 | 11 | 2 | 17 |
|
| 24 | 5 | 19 | 1 | 7 | 4 | 12 |
|
| 21.8 | 8.7 | 13.1 | 1.7 | 3.9 | 7 | 9.2 |
|
| 20 | 4 | 16 | 0 | 6 | 4 | 10 |
|
| 26.1 | 10.9 | 15.2 | 3.1 | 5.1 | 7.8 | 10.1 |
|
| 24 | 10 | 14 | 2 | 5 | 8 | 9 |
|
|
|
|
|
|
|
|
|
|
| 21 | 10 | 11 | 1 | 2 | 9 | 9 |
|
| 27 | 13 | 14 | 3 | 5 | 10 | 9 |
|
| 26.2 | 12.1 | 14.1 | 3.2 | 4.9 | 8.9 | 9.2 |
|
| 24 | 10 | 14 | 2 | 5 | 8 | 9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Removing IL-6 from the biomarker set we have a small gain in predicting AD in both data set, if compared to the 6-protein signature. In this case, the prediction of AD on the “AD” test set achieves an average of 96% without dropping the accuracy of the prediction of NonAD.
Average results for each signature over 24 classifiers.
| Size | Overall | Overall (“AD”+“MCI”) | Test set “AD” | Test set “MCI” | ||||
| AD Er. | NAD Er. | AD Er. | NAD Er. | AD Er. | NAD Er. | |||
| 139 | 64 | 75 | 42 | 50 | 22 | 25 | ||
|
| Error Avg | 24.34 | 9.02 | 15.33 | 3.66 | 4.79 | 5.36 | 10.53 |
| Agr % | 82% | 86% |
| 91% |
|
| 58% | |
| 82% | 91% |
| ||||||
|
| Error Avg | 25.98 | 7.83 | 18.15 | 2.45 | 7.64 | 5.38 | 10.52 |
| Agr % | 81% |
| 76% | 94% | 85% |
| 58% | |
| 81% | 89% |
| ||||||
|
| Error Avg | 24.44 | 8.60 | 15.84 | 2.02 | 5.70 | 6.58 | 10.14 |
| Agr % | 82% | 87% | 79% | 95% | 89% | 70% |
| |
| 82% | 92% | 64% | ||||||
|
| Error Avg | 23.20 | 7.71 | 15.49 | 1.78 | 4.75 | 5.93 | 10.73 |
| Agr % |
|
| 79% |
|
| 73% | 57% | |
|
|
| 65% | ||||||
For each signature the average number of errors is reported and the percentage agreement is calculated over each specific population. The best results are highlighted in bold text.
The standard deviation of each test is shown on this table.
| Overall (“AD”+“MCI”) | Test set AD | Test set MCI | ||||
| AD Er. | NAD Er. | AD Er. | NAD Er. | AD Er. | NAD Er. | |
|
| 3.580 | 3.022 | 1.692 | 2.087 | 2.430 | 1.982 |
|
| 3.546 | 6.127 | 1.721 | 3.893 | 2.214 | 2.729 |
|
| 3.165 | 4.218 | 1.419 | 2.798 | 2.024 | 1.625 |
|
| 3.520 | 3.668 | 1.433 | 2.175 | 2.326 | 1.906 |
All the signatures show a very similar behaviour with a small standard deviation.
Number of errors for each classifier when considering both test sets together (139 samples).
| Method | Overall errors | |||
|
|
|
|
| |
| Simple Logistic | 25 | 25 | 18 |
|
| LMT | 25 | 25 |
|
|
| Logistic | 27 | 24 | 21 |
|
| MultiClass Classifier | 27 | 24 | 21 |
|
| Bayes Net | 27 | 28 | 22 |
|
| NBTree | 26 | 23 |
|
|
| Naïve Bayes | 23 | 30 | 23 |
|
| Naïve Bayes Up. | 23 | 30 | 23 |
|
| ClassViaRegression | 28 | 25 |
| 24 |
| Naïve Bayes Simple | 23 | 30 | 24 |
|
| Kstar | 28 | 41 | 33 |
|
| Decorate | 23.1 | 28.3 | 24.7 |
|
| SMO | 20 | 23 | 20 |
|
| Multilayer Perceptron | 21.7 | 21.8 | 25.6 |
|
| PAM | 21 | 22 |
| 21 |
| Random Committee |
| 26.3 | 26.6 |
|
| j48 |
|
|
|
|
| Ordinal Class Classifier |
|
|
|
|
| LWL |
|
| 29 | 29 |
| Random Forest |
|
| 25.6 | 26.2 |
| Part |
| 30 | 27 | 27 |
| AdaBoost |
| 31 | 27 | 31 |
| IB1 |
| 28 | 33 | 30 |
| Ibk |
| 28 | 33 | 30 |
| Average | 24.342 | 26.821 | 24.438 |
|
| Agreement % | 82% | 81% | 82% |
|
The signature with the best performance on each classifier is highlighted in bold text.
Number of errors for each classifier when considering the “AD” test set (92 samples).
| Method | “AD” test set | |||
|
|
|
|
| |
| NBTree | 9 | 8 |
|
|
| Simple Logistic | 11 | 9 |
|
|
| LMT | 11 | 20 |
|
|
| Logistic | 13 | 10 | 7 |
|
| MultiClass Classifier | 13 | 10 | 7 |
|
| PAM | 10 | 11 |
| 5 |
| SMO | 8 | 8 |
|
|
| Naïve Bayes | 6 | 14 | 7 |
|
| Naïve Bayes Up. | 6 | 14 | 7 |
|
| Bayes Net | 10 | 9 | 7 |
|
| Decorate | 8.5 | 9.6 | 7.2 |
|
| Naïve Bayes Simple | 6 | 14 | 8 |
|
| Kstar | 13 | 18 | 14 |
|
| Multilayer Perceptron | 7.3 | 8.1 | 9.4 |
|
| Random Committee | 9.9 | 9.7 | 8.7 |
|
| ClassViaRegression | 9 |
| 9 | 8 |
| Part | 9 |
| 8 | 8 |
| Random Forest | 8.1 | 7.7 |
| 8.1 |
| LWL |
|
| 9 | 9 |
| j48 |
|
| 7 | 7 |
| Ordinal Class Classifier |
|
| 7 | 7 |
| AdaBoost |
| 9 | 9 | 12 |
| IB1 |
| 12 | 14 | 10 |
| Ibk |
| 12 | 14 | 10 |
| Average | 8.45 | 10.09 | 7.72 |
|
| Agreement % | 91% | 89% | 92% |
|
The signature with the best performance on each classifier is highlighted in bold text.
Number of errors for each classifier when considering the “MCI” test set (47 samples).
| Method | “MCI” test set | |||
|
|
|
|
| |
| ClassViaRegression | 19 | 17 |
| 16 |
| Bayes Net | 17 | 19 |
|
|
| j48 | 19 |
|
|
|
| Ordinal Class Classifier | 19 |
|
|
|
| Naïve Bayes | 17 | 17 |
|
|
| Naïve Bayes Simple | 17 | 17 |
|
|
| Naïve Bayes Up. | 17 | 17 |
|
|
| Simple Logistic |
|
|
|
|
| Logistic |
|
|
|
|
| LWL |
|
|
|
|
| MultiClass Classifier |
|
|
|
|
| LMT |
| 17 |
|
|
| NBTree | 17 |
| 18 | 18 |
| Kstar |
| 21 | 19 | 16 |
| Multilayer Perceptron | 14.4 |
| 16.2 | 15.7 |
| Random Committee | 16.2 |
| 17.9 | 17.9 |
| Decorate |
| 15.5 | 17.5 | 16.2 |
| Random Forest |
|
| 18.6 | 18.1 |
| AdaBoost | 16 |
| 18 | 19 |
| Part |
|
| 19 | 19 |
| IB1 |
|
| 19 | 20 |
| Ibk |
|
| 19 | 20 |
| SMO |
| 15 | 16 | 15 |
| PAM |
| 12 | 16 | 16 |
| Average | 16.29 | 16.11 |
|
|
| Agreement % | 65% | 66% |
|
|
The signature with the best performance on each classifier is highlighted in bold text.
Figure 3Classification and prediction of clinical Alzheimer's diagnosis in subjects with Alzheimer's disease.
(a) An undirected graph, where each node corresponds a different protein belonging to the 10-protein signature we identified; each edge indicates the existence of a direct relation as obtained by searching the PubMed database, (using the Pathway Studio software). (b) Identification of the maximum clique of the graph, uncovering a robust 6-protein signature; each node on the clique has a direct relation with each other. Simple Logistic was used to classify and predict Alzheimer's (AD) and non-Alzheimer's class, in the training set (c), the blinded test set ‘AD’ (d). All the results are shown in a confusion matrix, for the training set a 10-fold cross-validation was applied 10 times, in both cases Simple Logistic was used with the default parameters of Weka package. All the p-values were calculated using the Fisher exact test.