| Literature DB >> 35049736 |
Inês Vigo1, Luis Coelho1,2, Sara Reis1,2.
Abstract
BACKGROUND: Alzheimer's disease (AD) has paramount importance due to its rising prevalence, the impact on the patient and society, and the related healthcare costs. However, current diagnostic techniques are not designed for frequent mass screening, delaying therapeutic intervention and worsening prognoses. To be able to detect AD at an early stage, ideally at a pre-clinical stage, speech analysis emerges as a simple low-cost non-invasive procedure.Entities:
Keywords: Alzheimer’s disease (AD); classification; features; machine learning (ML); mild cognitive impairment (MCI); speech
Year: 2022 PMID: 35049736 PMCID: PMC8772820 DOI: 10.3390/bioengineering9010027
Source DB: PubMed Journal: Bioengineering (Basel) ISSN: 2306-5354
Language changes in AD (adapted from Ferris and Farlow [18] and Greta et al. [23]).
| Function | Early Stages | Moderate to Severe Stages |
|---|---|---|
| Spontaneous Speech | Fluent, grammatical | Non-fluent, echolalic |
| Paraphrastic errors | Semantics | Semantic and phonetic |
| Repetition | Intact | Very affected |
| Naming objects | Slightly affected | Very affected |
| Understanding the words | Intact | Very affected |
| Syntactical understanding | Intact | Very affected |
| Reading | Intact | Very affected |
| Writing | Very affected | |
| Semantic knowledge of words and objects | Difficulties with less used words and objects. | Very affected |
Figure 1Number of publications (within the review’s scope) by year, in absolute value.
Figure 2Flow chart of the different phases of the review.
Figure 3Flowchart of a general machine learning pipeline to process acoustic/prosodic correlates of disease. Adapted from Braga et al. [31].
List of databases, with related specifications, with Alzheimer’s patients’ speech recordings. (Table contents are sorted by language, first column, and database name, second column).
| Language | Database Name | Task | Population | Availability | Refs. | ||
|---|---|---|---|---|---|---|---|
| HC | MCI | AD | |||||
| English | DementiaBank | DF | 99 | - | 169 | Upon request | [ |
| English | Pitt Corpus | PD | 75/142 | 27/16 | 87/170 | Upon request | [ |
| English | WRAP | PD | 59/141 | 28/36 | - | Upon request | [ |
| English | - | PD | 112 | - | 98 | Undefined | [ |
| French | - | Mixed | 6/9 | 11/12 | 13/13 | Undefined | [ |
| French | - | VF, PD, SS | - | 19/25 | 12/15 | Undefined | [ |
| French | - | VF, Semantics | 5/19 | 23/24 | 8/16 | Undefined | [ |
| French | - | Reading | 16 | 16 | 16 | Undefined | [ |
| Greek | - | PD | 16/14 | - | 13/17 | Undefined | [ |
| Hungarian | BEA | SS | 13/23 | 16/32 | - | Upon request | [ |
| 25 | 25 | 25 | |||||
| Italian | - | Mixture | 48 | 48 | - | Undefined | [ |
| Mandarin | Lu Corpus | PD/SS | 4/6 | - | 6/4 | Upon request | [ |
| Mandarin | - | PD/SS | 24 | 20 | 20 | Undefined | [ |
| Portuguese | Cinderella | SS | 20 | 20 | 20 | Undefined | [ |
| Spanish | AZTITXIKI | SS | 5 | - | 5 | Undefined | [ |
| Spanish | AZTIAHORE | SS | 11/9 | - | 8/12 | Undefined | [ |
| Spanish | PGA-OREKA | VF | 26/36 | 17/21 | - | Upon request | [ |
| Mini-PGA | PD | 4/8 | - | 1/5 | |||
| Spanish | - | Reading | 30/68 | - | 14/33 | Undefined | [ |
| Swedish | Gothenburg | PD | 13/23 | 15/16 | - | Undefined | [ |
| Swedish | - | Mixed | 12/14 | 8/21 | - | Upon request | [ |
| Swedish | - | Reading | 11/19 | 12/13 | - | Undefined | [ |
| Turkish | - | SS/Interview | 31/20 | - | 18/10 | Undefined | [ |
| Turkish | - | SS/Interview | 12/15 | 17/10 | Undefined | [ | |
| Turkish | - | SS | 12/15 | - | 17/10 | Undefined | [ |
Legend: M: Males; F: Females; HC: Healthy Controls; MCI: Mild Cognitive Impairment; AD: Alzheimer’s Disease; SS: Spontaneous Speech; VF: Verbal Fluency; PD: Picture Description; PGA: Gipuzkoa Alzheimer Project; WRAP: Wisconsin Registry for Alzheimer’s Prevention.
Linguistic features that have been used for AD detection. The features are organized by type. For each feature name, the number of occurrences/usages is provided inside parenthesis.
| Feature Type | Feature Name |
|---|---|
| Occurrence frequency | Words (3); Verbs (2); Nouns, Predicates (1); Coordinate and Subordinate Phrases (2); Reduced phrases (2); Incomplete Phrases/Ideas (3); Filling words (1); Unique words (2); Revisions/Repetitions (1); Word Replacement (2) |
| Time/Duration | Total speech (3); Speech Rate (3); Speech time (2); Average of syllables (2); Pauses (4); Maximum pause (2). |
| Parts of speech ratio | Nouns/Verbs (2); Pronouns/Substantives (1); Determinants/Substantives (2); Type/Token (2); Silence/Speaking (4); Hesitation/Speaking (3). |
| Semantic density | The density of the idea (1); Efficiency of the idea (1); Density of information (2); Density of the sentences (1). |
| POS (Parts-of-Speech) | Text tags (4). |
| Complexity | The entropy of words (1); Honore’s Statistics (1). |
| Lexical Variation | Variation: nominal (2), adjective (1), modifier (1), adverb (1), verbal (1), word (1); Brunet’s Index (1). |
Acoustic features that have been used for AD detection. The features are organized by type. For each feature name, the number of occurrences/usages is provided inside parenthesis.
| Feature Type | Feature Name |
|---|---|
| Hesitations | Filled Pauses (2); Silent Pauses (4); Long Pauses (3); Short Pauses (3); Voice Breaks (5). |
| Voice Segments | Period (4); Average duration (4); Accentuation (2). |
| Frequency | Fundamental frequency (8); Short term energy (7); Spectral centroid (1); Autocorrelation (2); Variation of voice frequencies (2). |
| Regularity | Jitter (11); Shimmer (11); Intensity (6); Square Energy Operator (1); Teager-Kaiser Energy Operator (1); Root Mean Square Amplitude (2). |
| Noise | Harmonic-Noise ratio (3); Noise-Harmonic ratio (2). |
| Phonetics | Articulation dynamics (1); the rate of articulation (1); Pause rate (5). |
| Intensity | From the voice segments (1); From the pause segments (1); |
| Timbre | Formant’s Structure (6); Formant’s Frequency (8). |
Most significantly used classification models.
| Model | Characterization | References | |
|---|---|---|---|
| NB | Consists of a network, composed of a main node with other associated descending nodes that follow Bayes’ theorem [ | [ | |
| SVM | Consists of building the hyperplane with maximum margin capable of optimally separating two classes of a data set [ | [ | |
| RF | Relies on the creation of a large number of uncorrelated decision trees based on the average random selection of predictor variables [ | [ | |
| DT | Consists of building a decision tree where each node in the tree specifies a test on an attribute, each branch descending from that node corresponds to one of the possible values for that attribute, and each leaf represents class labels associated with the instance. The instances of the training set are classified following the path from the root to a leaf, according to the result of the tests along the path [ | [ | |
| KNN | Based on the memory principle in the sense that it stores all cases and classifies new cases based on similar measures [ | [ | |
| LR | A model capable of finding an equation that predicts an outcome for a binary variable from one or more response variables [ | [ | |
| LDA | It is a discriminatory approach based on the differences between samples of certain groups. Unsupervised learning technique where the objective is to maximize the relationship between the variance between groups and the variance within the same group [ | [ | |
| ANN | DNN | Naturally inspired models. Supervised learning approach based on a theory of association (pattern recognition) between cognitive elements [ | [ |
| CNN | |||
| RNN | |||
| MLP | |||
NB: Naive Bayes; RF: Random Forest; LDA: Linear Discriminant Analysis; SVM: Support Vector Machine; DT: Decision Trees; ANN: Artificial Neural Networks; RNN: Recurrent Neural Network; CNN: Convolutional Neural Networks; MLP: Multilayer Perceptron; KNN: k-Nearest Neighbors; DNN: Deep Neural Networks; LR: Logistic Regression.
Figure 4Prevalence of classification models.
Evaluation models for classification models.
| Model | Method | Reference |
|---|---|---|
| Cross Validation | k-Fold | [ |
| Leave-pair-out | [ | |
| Leave-one-out | [ | |
| Split Evaluation | 90–10% | [ |
| 80–20% | [ | |
| Random Sub-Sampling | - | [ |
Figure 5Mean accuracy by classification model.