| Literature DB >> 33251178 |
Zeng You1,2, Runhao Zeng2, Xiaoyong Lan1, Huixia Ren1,3, Zhiyang You1,2, Xue Shi1, Shipeng Zhao1,2, Yi Guo1, Xin Jiang4, Xiping Hu2.
Abstract
Classification of Alzheimer's Disease (AD) has been becoming a hot issue along with the rapidly increasing number of patients. This task remains tremendously challenging due to the limited data and the difficulties in detecting mild cognitive impairment (MCI). Existing methods use gait [or EEG (electroencephalogram)] data only to tackle this task. Although the gait data acquisition procedure is cheap and simple, the methods relying on gait data often fail to detect the slight difference between MCI and AD. The methods that use EEG data can detect the difference more precisely, but collecting EEG data from both HC (health controls) and patients is very time-consuming. More critically, these methods often convert EEG records into the frequency domain and thus inevitably lose the spatial and temporal information, which is essential to capture the connectivity and synchronization among different brain regions. This paper proposes a cascade neural network with two steps to achieve a faster and more accurate AD classification by exploiting gait and EEG data simultaneously. In the first step, we propose attention-based spatial temporal graph convolutional networks to extract the features from the skeleton sequences (i.e., gait) captured by Kinect (a commonly used sensor) to distinguish between HC and patients. In the second step, we propose spatial temporal convolutional networks to fully exploit the spatial and temporal information of EEG data and classify the patients into MCI or AD eventually. We collect gait and EEG data from 35 cognitively health controls, 35 MCI, and 17 AD patients to evaluate our proposed method. Experimental results show that our method significantly outperforms other AD diagnosis methods (91.07 vs. 68.18%) in the three-way AD classification task (HC, MCI, and AD). Moreover, we empirically found that the lower body and right upper limb are more important for the early diagnosis of AD than other body parts. We believe this interesting finding can be helpful for clinical researches.Entities:
Keywords: Alzheimer's disease; EEG; automatic diagnosis; deep learning; gait
Mesh:
Year: 2020 PMID: 33251178 PMCID: PMC7673399 DOI: 10.3389/fpubh.2020.584387
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Figure 1Cascade neural network for the early diagnosis of AD. We perform key point screening on gait data to form key-point skeleton sequences. Then we use attention-based spatial temporal graph convolutional networks (AST-GCN) to extract features and classify the subject into HC or MCI/AD with features. If the subject is classified into MCI/AD, we will input the EEG data into spatial temporal convolutional networks (ST-CNN) to extract features and perform MCI vs. AD binary classification.
Figure 2(A) Spatial temporal graph of skeleton sequences. (B) The “Spatial Configuration” strategy. (C) The architecture of ST-GCN.
Figure 3The structure of hourglass attention module.
The structure of spatial temporal convolutional networks, where K and K are the size of the kernel used in the spatial convolution layer and the temporal convolution layer in a ST-CNN module, respectively.
| 0 | 3 | Batch normalization | – | – | 3 |
| 1 | 3 | ST-CNN | 1 | 4 | |
| 2 | 4 | ST-CNN | 4 | 4 | |
| 3 | 4 | ST-CNN | 1 | 16 | |
| 4 | 16 | ST-CNN | 4 | 8 | |
| 6 | 8 | Flatten | – | – | |
| Classifier | Full connection | – | – | ||
| SoftMax | – | – |
C is the number of EEG channels. T is the number of time points. N is the number of classes. In the second layer, we use depthwise separable convolutions. In the 2nd and 4th ST-CNN module, we set stride to 4 as the pooling layer. The residual mechanism is used in each ST-CNN module.
The grouping criteria for HC, MCI, AD.
| MoCA | > 30 | 18 ~ 30 | 0 ~ 17 |
| MMSE | ≥ 24 | ≥ 24 | <24 |
Figure 4The deployment diagram of Kinect V2.0 devices: (A) The deployment diagram of devices in the Neurology Department. (B) The deployment diagram of devices in the Geriatrics Department. (C) The diagram of the actual data acquisition scene.
Figure 5(A) The 25 markers on human skeleton recognized by Kinect. (B) 64 EEG electrode locations in the International 10-20 System.
Comparison with other methods.
| Handcrafted features + SVM | ✓ | 63.64 | 57.73 | 55.45 | |
| Handcrafted feature + RF | ✓ | 81.82 | 57.14 | 68.18 | |
| AST-GCN(ours) | ✓ | 93.09 | 58.41 | 68.51 | |
| standard CNN | ✓ | – | 69.66 | – | |
| EEGnet | ✓ | – | 97.85 | – | |
| ResNet 18 | ✓ | – | 97.59 | – | |
| VGG 13 | ✓ | – | 96.48 | – | |
| ST-CNN(ours) | ✓ | – | 98.63 | – | |
| cascade neural network(ours) | ✓ | ✓ | |||
Standard CNN represents the model we substitute 2D convolution layers with a kernel size of K.
Ablation study of key point filtering and hourglass attention module on gait data.
| × | × | 88.18 |
| ✓ | × | 91.97 |
| × | ✓ | 90.14 |
| ✓ | ✓ | 93.09 |
Figure 6The performance comparison of the basic model on the skeleton sequences composed of different parts: (A) The performance of the basic model on the skeleton sequences composed of the lower body, the upper body, and the whole body. (B) The performance of the basic model on the datasets of skeleton sequences composed of the whole body, the lower body + the right upper limb, and the lower body + the left upper limb.
Performance comparison of the models with different hourglass attention module locations.
| Accuracy | 88.18 | 88.76 | 88.22 | 87.97 |
The bold values indicates the best performance that method obtain in that experiment.
Comparison of the performance and inference speed with different models.
| AST-GCN (gait) | AST-GCN (gait) | 74.46 | 9.42M | 7.06 |
| AST-GCN (gait) | ST-CNN (EEG) | 4.72M(4.71M+0.01M) | 3.99 | |
The bold values indicates the best performance that method obtain in that experiment.