| Literature DB >> 35632105 |
Michele Alessandrini1, Giorgio Biagetti1, Paolo Crippa1, Laura Falaschetti1, Simona Luzzi2, Claudio Turchetti1.
Abstract
The use of electroencephalography (EEG) has recently grown as a means to diagnose neurodegenerative pathologies such as Alzheimer's disease (AD). AD recognition can benefit from machine learning methods that, compared with traditional manual diagnosis methods, have higher reliability and improved recognition accuracy, being able to manage large amounts of data. Nevertheless, machine learning methods may exhibit lower accuracies when faced with incomplete, corrupted, or otherwise missing data, so it is important do develop robust pre-processing techniques do deal with incomplete data. The aim of this paper is to develop an automatic classification method that can still work well with EEG data affected by artifacts, as can arise during the collection with, e.g., a wireless system that can lose packets. We show that a recurrent neural network (RNN) can operate successfully even in the case of significantly corrupted data, when it is pre-filtered by the robust principal component analysis (RPCA) algorithm. RPCA was selected because of its stated ability to remove outliers from the signal. To demonstrate this idea, we first develop an RNN which operates on EEG data, properly processed through traditional PCA; then, we use corrupted data as input and process them with RPCA to filter outlier components, showing that even with data corruption causing up to 20% erasures, the RPCA was able to increase the detection accuracy by about 5% with respect to the baseline PCA.Entities:
Keywords: Alzheimer’s disease (AD); deep neural network (DNN); electroencephalography (EEG); principal component analysis (PCA); recurrent neural network (RNN); robust PCA (RPCA)
Mesh:
Year: 2022 PMID: 35632105 PMCID: PMC9145212 DOI: 10.3390/s22103696
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Structure of an RNN. Reproduced under CC-BY from [36].
Figure 2LSTM cell unit. Reproduced under CC-BY from [36].
Dataset details.
| Class | Subjects | Duration (s) |
|---|---|---|
| N | 15 | 17,932 |
| AD | 20 | 28,586 |
| Total | 35 | 46,518 |
Figure 3Flow chart of the proposed algorithm for Alzheimer’s disease Recognition.
Number of data windows before and after oversampling (example for windows of 512 samples with 25% overlap).
| Class | Original | Oversampled |
|---|---|---|
| N | 5424 | 8099 |
| AD | 8099 | 8099 |
| Total | 13,523 | 16,198 |
Figure 4Steps to apply PCA reduction to 3-D input tensor: ➊ transposing second and third dimension, ➋ flattening first two dimensions, ➌ dimension reduction through PCA, ➍ converting back to original form.
Figure 5Singular value magnitude of an example input matrix, limited to the first 150 values.
Figure 6RNN architecture.
Details of RNN layers.
| Layer | Input Size | Output Size | Parameters |
|---|---|---|---|
| LSTM 1 | (-, 50, 16) | (-, 50, 8) | 832 |
| Dropout 1 | (-, 50, 8) | (-, 50, 8) | 0 |
| LSTM 2 | (-, 50, 8) | (-, 8) | 544 |
| Dropout 2 | (-, 8) | (-, 8) | 0 |
| Dense | (-, 8) | (-, 2) | 18 |
Experimental results for original temporal data, best result is displayed in bold.
| window samples | 128 | 256 | 384 | 512 | 640 | ||||||||||
| window duration (s) | 1 | 2 | 3 | 4 | 5 | ||||||||||
| window overlap (%) | 0 | 25 | 50 | 0 | 25 | 50 | 0 | 25 | 50 | 0 | 25 | 50 | 0 | 25 | 50 |
| input features | 36,493 | 48,642 | 72,961 | 18,240 | 24,310 | 36,468 | 12,157 | 16,198 | 24,294 | 9115 | 12,148 | 18,214 | 7290 | 9711 | 14,562 |
| training time (s) | 399.3 | 527.7 | 787.5 | 288.5 | 402.8 | 603.4 | 270.7 | 359.7 | 539.9 | 250.3 | 334.3 | 500.6 | 236.7 | 315.7 | 472.3 |
| test time (s) | 1.3 | 1.9 | 2.8 | 1.6 | 1.4 | 2.1 | 1.0 | 1.2 | 1.9 | 0.9 | 1.2 | 1.8 | 0.9 | 1.2 | 1.7 |
| test accuracy (%) | 56.7 | 77.5 |
| 51.9 | 45.2 | 42.3 | 50.7 | 68.6 | 57.3 | 56.9 | 51.3 | 48.1 | 59.5 | 67.5 | 52.4 |
Experimental results for PCA data using 50 principal components, best result is displayed in bold.
| window samples | 128 | 256 | 384 | 512 | 640 | ||||||||||
| window duration (s) | 1 | 2 | 3 | 4 | 5 | ||||||||||
| window overlap (%) | 0 | 25 | 50 | 0 | 25 | 50 | 0 | 25 | 50 | 0 | 25 | 50 | 0 | 25 | 50 |
| input features | 36,493 | 48,642 | 72,961 | 18,240 | 24,310 | 36,468 | 12,157 | 16,198 | 24,294 | 9115 | 12,148 | 18,214 | 7290 | 9711 | 14,562 |
| training time (s) | 267.5 | 354.4 | 531.1 | 124.8 | 179.3 | 279.5 | 91.7 | 120.3 | 179.4 | 69.3 | 90.9 | 135.1 | 55.6 | 73.5 | 109.2 |
| test time (s) | 0.5 | 1.2 | 1.7 | 1.0 | 0.3 | 0.7 | 0.2 | 0.2 | 0.4 | 0.2 | 0.2 | 0.6 | 0.1 | 0.2 | 0.2 |
| test accuracy (%) | 62.4 | 73.6 | 87.0 | 92.8 | 96.0 |
| 96.9 | 90.5 | 90.1 | 94.6 | 88.4 | 95.6 | 93.4 | 96.3 | 93.2 |
Figure 7Accuracy and loss progress for validation data with respect to training epochs.
Figure 8Confusion matrix for a sample case.
Figure 9Example of data corrupted with a hole (red).
Test accuracy for corrupted signals with different filtering techniques (p is the corruption probability, data loss represent the average number of missing samples).
| Data Loss (%) | No Filter (%) | MSPCA (%) | RPCA (%) | |
|---|---|---|---|---|
| 1 | 2 | 89.0 | 90.8 | 94.5 |
| 2 | 4 | 91.1 | 92.6 | 94.6 |
| 5 | 10 | 80.0 | 81.6 | 85.7 |
| 10 | 20 | 76.9 | 72.1 | 81.5 |
Figure 10Test accuracy for corrupted signals with different filtering techniques (p is the corruption probability).