| Literature DB >> 35557842 |
Thi Kieu Khanh Ho1, Minhee Kim2, Younghun Jeon3, Byeong C Kim4, Jae Gwan Kim2, Kun Ho Lee5,6,7, Jong-In Song3, Jeonghwan Gwak1,8,9,10.
Abstract
The timely diagnosis of Alzheimer's disease (AD) and its prodromal stages is critically important for the patients, who manifest different neurodegenerative severity and progression risks, to take intervention and early symptomatic treatments before the brain damage is shaped. As one of the promising techniques, functional near-infrared spectroscopy (fNIRS) has been widely employed to support early-stage AD diagnosis. This study aims to validate the capability of fNIRS coupled with Deep Learning (DL) models for AD multi-class classification. First, a comprehensive experimental design, including the resting, cognitive, memory, and verbal tasks was conducted. Second, to precisely evaluate the AD progression, we thoroughly examined the change of hemodynamic responses measured in the prefrontal cortex among four subject groups and among genders. Then, we adopted a set of DL architectures on an extremely imbalanced fNIRS dataset. The results indicated that the statistical difference between subject groups did exist during memory and verbal tasks. This presented the correlation of the level of hemoglobin activation and the degree of AD severity. There was also a gender effect on the hemoglobin changes due to the functional stimulation in our study. Moreover, we demonstrated the potential of distinguished DL models, which boosted the multi-class classification performance. The highest accuracy was achieved by Convolutional Neural Network-Long Short-Term Memory (CNN-LSTM) using the original dataset of three hemoglobin types (0.909 ± 0.012 on average). Compared to conventional machine learning algorithms, DL models produced a better classification performance. These findings demonstrated the capability of DL frameworks on the imbalanced class distribution analysis and validated the great potential of fNIRS-based approaches to be further contributed to the development of AD diagnosis systems.Entities:
Keywords: Alzheimer’s disease; CNN-LSTM; deep learning – artificial neural network (DL-ANN); fNIRS; multi-class classification
Year: 2022 PMID: 35557842 PMCID: PMC9087351 DOI: 10.3389/fnagi.2022.810125
Source DB: PubMed Journal: Front Aging Neurosci ISSN: 1663-4365 Impact factor: 5.702
Subject demographics.
| HC | aAD | pAD | ADD | |
| Number | 53 | 28 | 50 | 9 |
| Age ± SD (years) | 72.7 ± 5.3 | 74.5 ± 4.3 | 75.8 ± 3.9 | 75.4 ± 6.8 |
| Gender (M/F) | 21/32 | 15/13 | 31/17 | 4/5 |
| MMSE ± SD | 27.0 ± 4.2 | 26.9 ± 2.5 | 26.0 ± 3.2 | 20.2 ± 4.8 |
| Education ± SD | 9.8 ± 4.7 | 10.2 ± 5.2 | 10.6 ± 5.2 | 8.5 ± 5.3 |
FIGURE 1fNIRS device setup (A) and experimental protocol (B) (s, second; P, phonemic; S, semantic).
FIGURE 2(A) A long short-term memory (LSTM) cell internal mechanism; (B) the stacked LSTM architecture.
FIGURE 3(A) A gated recurrent unit (GRU) cell internal mechanism; (B) the stacked GRU architecture.
FIGURE 4Box charts of oxyhemoglobin (HbO) concentrations acquired by four subject groups under four conditions.
FIGURE 5Box charts of HbO concentrations acquired by Men (M) and Women (W) of four subject groups under four conditions.
Comparison of classification metric results as shown by four proposed deep learning (DL) models using original data.
| Class | Accuracy | Precision | Recall | F1-Score | ||||||||
| HbO | HbR | THb | HbO | HbR | THb | HbO | HbR | THb | HbO | HbR | THb | |
|
| ||||||||||||
|
| ||||||||||||
| HC | 0.863 | 0.853 | 0.866 | 0.884 | 0.862 | 0.870 | 0.863 | 0.853 | 0.866 | 0.848 | 0.852 | 0.862 |
| aAD | 0.826 | 0.817 | 0.779 | 0.847 | 0.879 | 0.819 | 0.826 | 0.817 | 0.779 | 0.806 | 0.810 | 0.774 |
| pAD | 0.854 | 0.861 | 0.876 | 0.878 | 0.842 | 0.912 | 0.854 | 0.861 | 0.876 | 0.849 | 0.860 | 0.875 |
| ADD | 0.774 | 0.731 | 0.786 | 0.778 | 0.864 | 0.772 | 0.754 | 0.711 | 0.736 | 0.749 | 0.710 | 0.735 |
| Mean | 0.829 | 0.815 | 0.827 | 0.847 | 0.862 | 0.843 | 0.824 | 0.810 | 0.814 | 0.813 | 0.808 | 0.811 |
|
| ||||||||||||
|
| ||||||||||||
|
| ||||||||||||
| HC | 0.897 | 0.855 | 0.875 | 0.913 | 0.835 | 0.903 | 0.897 | 0.855 | 0.875 | 0.893 | 0.850 | 0.874 |
| aAD | 0.830 | 0.836 | 0.848 | 0.904 | 0.907 | 0.927 | 0.830 | 0.836 | 0.848 | 0.837 | 0.835 | 0.836 |
| pAD | 0.867 | 0.863 | 0.879 | 0.887 | 0.900 | 0.925 | 0.867 | 0.863 | 0.879 | 0.858 | 0.874 | 0.872 |
| ADD | 0.781 | 0.761 | 0.795 | 0.764 | 0.762 | 0.747 | 0.751 | 0.731 | 0.735 | 0.751 | 0.721 | 0.732 |
| Mean | 0.844 | 0.829 | 0.849 | 0.867 | 0.851 | 0.875 | 0.836 | 0.821 | 0.834 | 0.835 | 0.820 | 0.828 |
|
| ||||||||||||
|
| ||||||||||||
|
| ||||||||||||
| HC | 0.816 | 0.765 | 0.771 | 0.839 | 0.850 | 0.810 | 0.816 | 0.765 | 0.771 | 0.812 | 0.760 | 0.774 |
| aAD | 0.792 | 0.727 | 0.741 | 0.811 | 0.808 | 0.785 | 0.792 | 0.727 | 0.741 | 0.770 | 0.719 | 0.739 |
| pAD | 0.811 | 0.779 | 0.805 | 0.837 | 0.868 | 0.849 | 0.811 | 0.779 | 0.805 | 0.808 | 0.778 | 0.805 |
| ADD | 0.705 | 0.695 | 0.705 | 0.721 | 0.764 | 0.736 | 0.705 | 0.695 | 0.705 | 0.706 | 0.697 | 0.708 |
| Mean | 0.781 | 0.742 | 0.756 | 0.802 | 0.823 | 0.795 | 0.781 | 0.742 | 0.756 | 0.774 | 0.738 | 0.756 |
|
| ||||||||||||
|
| ||||||||||||
|
| ||||||||||||
| HC | 0.918 | 0.907 | 0.917 | 0.938 | 0.905 | 0.918 | 0.918 | 0.907 | 0.917 | 0.898 | 0.902 | 0.913 |
| aAD | 0.879 | 0.855 | 0.889 | 0.865 | 0.872 | 0.917 | 0.839 | 0.855 | 0.859 | 0.835 | 0.853 | 0.864 |
| pAD | 0.880 | 0.893 | 0.907 | 0.902 | 0.890 | 0.910 | 0.880 | 0.893 | 0.907 | 0.876 | 0.894 | 0.907 |
| ADD | 0.797 | 0.799 | 0.793 | 0.781 | 0.839 | 0.771 | 0.767 | 0.799 | 0.753 | 0.767 | 0.806 | 0.746 |
| Mean | 0.868 | 0.864 | 0.877 | 0.871 | 0.877 | 0.879 | 0.851 | 0.864 | 0.859 | 0.844 | 0.864 | 0.858 |
FIGURE 6Summarized history of Cognitive Neural Network-Long Short-Term Memory (CNN-LSTM) model’s accuracies (A) and losses (B) and conclusive confusion matrix (C) of the fivefold cross validation. In the accuracy and loss curves, the solid lines indicate the mean and the shadow areas represent the ranges over the fivefolds.
FIGURE 7Classification comparisons of accuracy between eight machine learning (ML) classifiers and four DL models using original HB datasets.