| Literature DB >> 33427742 |
Charalambos Themistocleous1, Bronte Ficek1, Kimberly Webster1, Dirk-Bart den Ouden2, Argye E Hillis1, Kyrana Tsapkini1.
Abstract
BACKGROUND: The classification of patients with primary progressive aphasia (PPA) into variants is time-consuming, costly, and requires combined expertise by clinical neurologists, neuropsychologists, speech pathologists, and radiologists.Entities:
Keywords: Classification; machine learning; natural language processing; primary progressive aphasia
Year: 2021 PMID: 33427742 PMCID: PMC7990416 DOI: 10.3233/JAD-201101
Source DB: PubMed Journal: J Alzheimers Dis ISSN: 1387-2877 Impact factor: 4.472
Fig. 1Process diagram of model development. The audio recordings from the picture description task are automatically analyzed acoustically and transcribed (Pipeline 1). Then, formant frequencies, the duration of vowels, tonal measures, pauses, and voice quality measurements are estimated from the processed recordings (Pipeline 2). The ratio of characters, words, characters per word and the noun-verb ratio, noun-adjective ratio, noun-adverb ratio, noun-pronoun ratio, verb-adjective ratio, verb-adverb ratio, verb-pronoun ratio, adjective-adverb ratio, adjective-pronoun ratio, and adverb-pronoun ratio are estimated from the analyzed text transcripts (Pipeline 3). The model optimization and parameter tuning are followed by model comparison and evaluation with cross-validation.
Demographic information of the participants for each PPA variant (for age, education, onset of the condition in years, language severity, and total severity, the mean and the standard deviation in parenthesis is provided; language severity and total severity correspond to the Behavior-Comportment-Personality and Language domains of the FTLD-CDR [16]
| Variant | svPPA | lvPPA | nfvPPA |
| Female | 5 | 8 | 7 |
| Male | 4 | 8 | 12 |
| Total patients | 9 | 16 | 19 |
| Age | 66.59 (6.06) | 67.93 (7.55) | 69.07 (5.57) |
| Education | 16.30 (1.92) | 16.92 (2.24) | 16.42 (1.37) |
| Onset years | 6.48 (2.31) | 3.88 (3.23) | 3.49 (1.80) |
| Language severity | 2.27 (0.56) | 1.39 (0.75) | 1.77 (0.48) |
| Total severity | 7.75 (4.36) | 4.98 (2.82) | 6.04 (3.10) |
| Total words | 1265 | 2529 | 826 |
| Mean number of words | 84(57) | 141(133) | 75(52) |
Fig. 2Neural network architecture. Structure of the neural network designed for the study and feature properties, including the number of input features employed, the type and number of units and activation functions for the input, hidden, and output layer. The first layer on top is the input layer and consists of 350 units; 8 layers in the middle containing 350 units are hidden layers, and the final layer contains only three units; here with different colors, when the green is activated it corresponds to the svPPA variant, when the red is activated it corresponds to the lvPPA variant, and when the yellow unit is activated it corresponds to the nfvPPA variant.
Results from eight-fold cross-validation for the deep neural network (DNN), support vector machines (SVM), random forest (RF), and decision tree (DT). Shown is the mean cross-validation accuracy, the 95% confidence intervals (95% CI) and the standard error (SE)
| Model | Mean | 95% CI | SE |
| DNN | |||
| SVM | 45 | [31, 59] | 5 |
| RF | 58 | [43, 73] | 8 |
| DT | 57 | [38, 75] | 8 |
Normalized confusion matrix created from the output of the SVM (a), RF (b), and DT (c). matrices show the predicted versus actual values from the evaluation
| Predicted class | |||
| True class | svPPA | lvPPA | nfvPPA |
| (a) SVM | |||
| svPPA | 28 | 55 | 17 |
| lvPPA | 42 | 39 | 19 |
| nfvPPA | 24 | 31 | 45 |
| (b) RF | |||
| svPPA | 54 | 37 | 9 |
| lvPPA | 38 | 38 | 24 |
| nfvPPA | 8 | 34 | 58 |
| (c) DT | |||
| svPPA | 50 | 47 | 3 |
| lvPPA | 35 | 35 | 30 |
| nfvPPA | 3 | 31 | 66 |
Normalized confusion matrix created from the output of the deep neural network. The confusion matrix provides the sum of scores from the 8-fold cross-validation test
| Predicted class | ||||
| True class | svPPA | lvPPA | nfvPPA | |
| svPPA | 64 | 30 | 6 | |
| lvPPA | – | 95 | 5 | |
| nfvPPA | 10 | – | 90 | |