| Literature DB >> 34957393 |
Fan Zhang1,2, Melissa Petersen1,2, Leigh Johnson1,3, James Hall1,3, Sid E O'Bryant1,3.
Abstract
Driven by massive datasets that comprise biomarkers from both blood and magnetic resonance imaging (MRI), the need for advanced learning algorithms and accelerator architectures, such as GPUs and FPGAs has increased. Machine learning (ML) methods have delivered remarkable prediction for the early diagnosis of Alzheimer's disease (AD). Although ML has improved accuracy of AD prediction, the requirement for the complexity of algorithms in ML increases, for example, hyperparameters tuning, which in turn, increases its computational complexity. Thus, accelerating high performance ML for AD is an important research challenge facing these fields. This work reports a multicore high performance support vector machine (SVM) hyperparameter tuning workflow with 100 times repeated 5-fold cross-validation for speeding up ML for AD. For demonstration and evaluation purposes, the high performance hyperparameter tuning model was applied to public MRI data for AD and included demographic factors such as age, sex and education. Results showed that computational efficiency increased by 96%, which helped to shed light on future diagnostic AD biomarker applications. The high performance hyperparameter tuning model can also be applied to other ML algorithms such as random forest, logistic regression, xgboost, etc.Entities:
Keywords: alzheimer’s disease; high performance computing; hyperparameter tuning; machine learning; support vector machine
Year: 2021 PMID: 34957393 PMCID: PMC8692864 DOI: 10.3389/frai.2021.798962
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
Talon3 computer nodes.
| Quanity | Memory (GB) | Cores | Description |
|---|---|---|---|
| 192 | 64 | 28 | Dell PowerEdge C6320 server with two 2.4 GHz Intel Xeon E5-2680 v4 14-core processors |
| 75 | 32 | 16 | Dell PowerEdge R420 server with two 2.1 GHz Intel Xeon E5-2450 eight-core processors |
| 64 | 64 | 16 | Dell PowerEdge R420 server with two 2.1 GHz Intel Xeon E5-2450 eight-core processors |
| 8 | 512 | 32 | Dell PowerEdge R720 server with four 2.4 GHz Intel Xeon E5-4640 eight-core processors |
| 16 | 64 | 28 | Dell PowerEdge R730 server with two 2.4 GHz Intel Xeon E5-2680 v4 14-core processors and two Nvidia Tesla K80 GPUS (4,992 GPU cores/card) |
FIGURE 1Pseudo code for parallel SVM hyperparameter tuning.
FIGURE 2Computational time vs. number of cores with SVM modeling.
Performance for testing set after hyperparameter tuning.
| Actual demented | Actual nondemented | |
|---|---|---|
| Predicted demented | 9 | 1 |
| Predicted nondemented | 3 | 16 |
| Precision/PPV | 90.00% | |
| Accuracy | 86.21% | |
| Sensitivity | 75.00% | |
| Specificity | 94.12% | |
| NPV | 84.21% | |
| AUC | 90.80% | |
| PPV12 | 63.49% | |
| NPV12 | 96.50% | |
FIGURE 3Pseudo code for parallel RF hyperparameter tuning.
FIGURE 4R script for parallel computing.
FIGURE 5Variable importance of the eight variables.
FIGURE 6Shell script for parallel computing.
FIGURE 7Computational time vs. number of cores with RF modeling.