| Literature DB >> 27418712 |
Jintao Long1, Genhua Pan2, Emmanuel Ifeachor2, Robert Belshaw3, Xinzhong Li1.
Abstract
Blood-based biomarkers for Alzheimer's disease would be very valuable because blood is a more accessible biofluid and is suitable for repeated sampling. However, currently there are no robust and reliable blood-based biomarkers for practical diagnosis. In this study we used a knowledge-based protein feature pool and two novel support vector machine embedded feature selection methods to find panels consisting of two and three biomarkers. We validated these biomarker sets using another serum cohort and an RNA profile cohort from the brain. Our panels included the proteins ECH1, NHLRC2, HOXB7, FN1, ERBB2, and SLC6A13 and demonstrated promising sensitivity (>87%), specificity (>91%), and accuracy (>89%).Entities:
Mesh:
Substances:
Year: 2016 PMID: 27418712 PMCID: PMC4932164 DOI: 10.1155/2016/4250480
Source DB: PubMed Journal: Dis Markers ISSN: 0278-0240 Impact factor: 3.434
Figure 1Workflow of Support Vector Machine Forward Selection (SVMFS).
Top 20 proteins with the largest LOOCV accuracy.
| NCBI accession ID | Protein name | LOOCV accuracy |
|---|---|---|
| BC011792.1 | ECH1 | 96.5% |
| NM_004502.2 | HOXB7 | 96.5% |
| NM_177924.1 | ASAH1 | 96.5% |
| BC030814.1 | IGKV1-5 | 95.4% |
| BC034142.1 | IGKV1-5 | 95.4% |
| BC034146.1 | IGKV1-5 | 95.4% |
| BC034937.1 | C10orf64 | 95.4% |
| NM_176884.1 | TAS2R43 | 95.4% |
| PV3366 | ERBB2 | 94.2% |
| NM_201278.1 | MTMR2 | 94.2% |
| BC038406.1 | C3orf20 | 94.2% |
| NM_152776.1 | MGC40579 | 94.2% |
| NM_014110.3 | PPP1R8 | 93.0% |
| XM_294794.1 | LOC339065 | 93.0% |
| NM_019891.1 | ERO1LB | 93.0% |
| BC068078.1 | NPM2 | 93.0% |
| NM_002613.3 | PDPK1 | 93.0% |
| NM_031268.3 | PDPK1 | 93.0% |
| BC032101.1 | JAGN1 | 93.0% |
| NM_000963.1 | PTGS2 | 93.0% |
Performances of three proposed models in datasets GSE29676 and GSE39087.
| Average LOOCV accuracy | Validation accuracy | Sensitivity | Specificity | NPV | PPV | FDR | FOR | |
|---|---|---|---|---|---|---|---|---|
| Cross-validation in GSE29676 | ||||||||
| ECH1 + NHLRC2 | 98.8% | 95.4% | 94.0% | 97.1% | 93.0% | 97.7% | 2.3% | 7.0% |
| ECH1 + HOXB7 | 97.7% | 95.6% | 95.0% | 96.3% | 94.1% | 97.1% | 2.9% | 5.9% |
| ERBB2 + FN1 + SLC6A13 | 96.5% | 89.6% | 87.9% | 91.8% | 86.4% | 93.4% | 6.6% | 13.6% |
| The Nagele model | 64.0% | 56.8% | 58.9% | 54.3% | 51.8% | 62.2% | 37.8% | 48.2% |
|
| ||||||||
| Cross-validation in GSE39087 | ||||||||
| ECH1 + NHLRC2 | 88.9% | 87.4% | 79.7% | 90.8% | 91.2% | 80.7% | 19.4% | 8.8% |
| ECH1 + HOXB7 | 97.8% | 96.9% | 96.8% | 96.9% | 98.6% | 93.8% | 6.2% | 1.4% |
| ERBB2 + FN1 + SLC6A13 | 74.4% | 69.8% | 80.9% | 65.0% | 89.4% | 51.4% | 48.6% | 10.6% |
| The Nagele model | 70.0% | 69.4% | 80.2% | 64.7% | 89.0% | 50.6% | 49.4% | 11.0% |
Figure 3Expression level of six proteins in the three proposed models under different conditions (red for AD samples and blue for healthy samples) in dataset GSE29676. The vertical coordinate of each plot represents the processed expression value and the horizontal coordinate represents different sample categories.
Figure 4Three proposed models in dataset GSE29676. Blue shaded area indicates where a sample will be classified as healthy by the prediction model. Coordinates in each plot represent the processed expression value. Red represents AD samples and blue represents healthy samples.
Figure 2ROC curves of the three proposed models in the cross-cohort validation using GSE39087.
Figure 5Expression level of six proteins in the three proposed models under different conditions (red for AD samples and blue for healthy samples) in dataset GSE39087.
Figure 6Three proposed models in dataset GSE39087.
Accuracy performances of our three proposed models in dataset GSE5281 (see Section 2 for full name of brain regions).
| EC | HIP | MTG | PC | SFG | VCX | |
|---|---|---|---|---|---|---|
| ECH1 + NHLRC2 | 95.5% | 78.3% | 60.0% | 57.1% | 68.0% | 50.0% |
| ECH1 + HOXB7 | 86.4% | 87.0% | 80.0% | 85.7% | 68.0% | 40.0% |
| ERBB2 + FN1 + SLC6A13 | 90.9% | 56.5% | 88.0% | 81.0% | 60.0% | 43.3% |