| Literature DB >> 35898959 |
Eduardo Perez-Valero1,2, Christian Morillas1,2, Miguel A Lopez-Gordo2,3, Ismael Carrera-Muñoz4, Samuel López-Alcalde5, Rosa M Vílchez-Carrillo4.
Abstract
Early detection is crucial to control the progression of Alzheimer's disease and to postpone intellectual decline. Most current detection techniques are costly, inaccessible, or invasive. Furthermore, they require laborious analysis, what delays the start of medical treatment. To overcome this, researchers have recently investigated AD detection based on electroencephalography, a non-invasive neurophysiology technique, and machine learning algorithms. However, these approaches typically rely on manual procedures such as visual inspection, that requires additional personnel for the analysis, or on cumbersome EEG acquisition systems. In this paper, we performed a preliminary evaluation of a fully-automated approach for AD detection based on a commercial EEG acquisition system and an automated classification pipeline. For this purpose, we recorded the resting state brain activity of 26 participants from three groups: mild AD, mild cognitive impairment (MCI-non-AD), and healthy controls. First, we applied automated data-driven algorithms to reject EEG artifacts. Then, we obtained spectral, complexity, and entropy features from the preprocessed EEG segments. Finally, we assessed two binary classification problems: mild AD vs. controls, and MCI-non-AD vs. controls, through leave-one-subject-out cross-validation. The preliminary results that we obtained are comparable to the best reported in literature, what suggests that AD detection could be automatically detected through automated processing and commercial EEG systems. This is promising, since it may potentially contribute to reducing costs related to AD screening, and to shortening detection times, what may help to advance medical treatment.Entities:
Keywords: Alzheimer's disease; EEG; classification; disease detection; machine learning
Year: 2022 PMID: 35898959 PMCID: PMC9309796 DOI: 10.3389/fninf.2022.924547
Source DB: PubMed Journal: Front Neuroinform ISSN: 1662-5196 Impact factor: 3.739
Group, sex, and age distributions of the participants engaged for this study.
|
|
|
|
|
|---|---|---|---|
| NC | 7 | 1 | 67 ± 3.5 |
| MCI-non-AD | 0 | 5 | 73.4 ± 7.1 |
| Mild AD | 5 | 3 | 68.8 ± 4.9 |
The values reported in the Age column represent the mean ± the standard deviation.
Figure 1(Left) Sixteen-channel montage used for the present study (in green) in the extended 10-20 International System. We selected this montage to evenly cover the scalp area. (Right) Versatile semi-dry EEG acquisition system utilized for the data capture. The system includes a Bluetooth acquisition module placed below the occipital area and an EEG headset with semi-dry electrodes. Please, note that the right panel image does not represent the actual montage used in this study and it was included to illustrate the acquisition system and headset.
Figure 2EEG signal processing pipeline. First, we applied a FIR filter with bandpass 1–45 Hz to remove the power line interference and retain the spectral content in the desired frequency range. Then, we segmented the filtered EEG into 4-s epochs without overlapping. Finally, we performed automated artifact removal through Autoreject algorithm and ICA.
Figure 3EEG-based feature extraction and classification pipeline. First, we performed feature extraction on the 4-s clean epochs from all the participants. This yielded a feature matrix with 112 columns (features) and as many columns as the total number of epochs (N). Next, we averaged every S consecutive epochs to enhance the SNR of the features (we evaluated values of 6, 8, 10, and 12 for S). Then, we implemented a grid search cross-validation procedure to find the best hyperparameters for two possible classifiers (SVM and LR). We selected a LOSO strategy for cross-validation. Finally, we estimated the classification performance metrics across all the participants.
Figure 4Comparison of the results yielded for the two binary classification problems as a function of the number of epochs averaged during processing. Error bars indicate the standard error of the mean. For each case, we have also included the name of the best performing classifier. For the mild AD vs. Control problem and the MCI-non-AD vs. Control problem, SVM with 10 and 8 epochs used for the average, yielded the best performance, respectively.
Hyperparameter values inspected using grid-search cross-validation.
|
|
|
| |
|---|---|---|---|
|
|
| ||
| % Features to keep | [10, 25, 50, 75, 100] | 100 | 25 |
| C (SVM) | [10−4, 10−3, …, 104] | 102 | 101 |
| γ (SVM) | [Scale, auto] | auto | scale |
| C (LR) | [10−4, 10−3, …, 104] | - | - |
| Class weight (LR) | [None, balanced] | - | - |
“Best” column indicates the values of each hyperparameter that resulted in the best performance. We have not reported the best values for the LR because the SVM algorithm yielded superior results.
Confusion matrices for the two binary classification problems.
|
|
| |||||
|---|---|---|---|---|---|---|
| PAT | HC | PAT | HC | |||
| PAT | 0.88 (8) | 0.12 (0) | 0.88 (4) | 0.12 (1) | ||
| HC | 0.17 (1) | 0.83 (7) | 0.01 (0) | 0.99 (8) | ||
They were estimated from the predictions yielded by the SVM, which was the best performing classifier for both problems. The values outside and inside the parenthesis represent the results for the epoch-level and the participant-level classification, respectively. The highlighted cells denote the true positives and true negatives.
Classification metrics for the mild AD vs. NC and MCI-non-AD vs.
|
|
|
|
|
|
|---|---|---|---|---|
| Mild AD vs. NC | Mild AD | 0.83 | 0.88 | 0.86 ± 0.06 |
| NC | 0.88 | 0.83 | ||
| MCI-non-AD vs. NC | MCI-non-AD | 0.98 | 0.88 | 0.96 ± 0.03 |
| NC | 0.92 | 0.99 |
NC problems. The right-most column indicates the average cross-validation F1-score ± the standard error of the mean.