| Literature DB >> 31401078 |
Kyriaki Neophytou1, Robert W Wiley2, Brenda Rapp3, Kyrana Tsapkini4.
Abstract
Currently, variant subtyping in primary progressive aphasia (PPA) requires an expert neurologist and extensive language and cognitive testing. Spelling impairments appear early in the development of the disorder, and the three PPA variants (non-fluent - nfvPPA; semantic - svPPA; logopenic - lvPPA) reportedly show fairly distinct spelling profiles. Given the theoretical and empirical evidence indicating that spelling may serve as a proxy for spoken language, the current study aimed to determine whether spelling performance alone, when evaluated with advanced statistical analyses, allows for accurate PPA variant classification. A spelling to dictation task (with real words and pseudowords) was administered to 33 PPA individuals: 17 lvPPA, 10 nfvPPA, 6 svPPA. Using machine learning classification algorithms, we obtained pairwise variant classification accuracies that ranged between 67 and 100%. In additional analyses that assumed no prior knowledge of each case's variant, classification accuracies ranged between 59 and 70%. To our knowledge, this is the first time that all the PPA variants, including the most challenging logopenic variant, have been classified with such high accuracy when using information from a single language task. These results underscore the rich structure of the spelling process and support the use of a spelling task in PPA variant classification. Published by Elsevier Ltd.Entities:
Keywords: Language; Logopenic variant; Non-fluent variant; Primary progressive aphasia; Semantic variant; Spelling; Variant classification
Mesh:
Year: 2019 PMID: 31401078 PMCID: PMC6817413 DOI: 10.1016/j.neuropsychologia.2019.107157
Source DB: PubMed Journal: Neuropsychologia ISSN: 0028-3932 Impact factor: 3.139
Fig. 1.The cognitive architecture of spelling.
Description of the variables evaluated in Step 1 of the analysis that served to obtain beta coefficient values for each variable for each participant on the basis of single-participant LMEM fitting.
| Variable | Description |
|---|---|
|
| The lexicality status of the letter string, i.e., whether it was a real word or a pseudoword (coded as 1 for words and 0 for pseudowords). |
|
| The length of each letter string in terms of the number of letters, ranging from 3 to 8 letters. |
|
| The position of a letter in a word (starting from the left), scaled to the length of the word. Since the variable is scaled with the range limited from 0 to 1, the first letter of a word will always have a value of 0, the final letter a value of 1, and the letters falling in between will have values that will sum up to 1. For instance, the four letters of a 4-letter word were coded as follows: letter 1 = 0; letter 2 = 0.33; letter 3 = 0.66, letter 4 = 1. This variable quantifies if there was an increase or a decrease in performance across letter positions. |
|
| Position-quadratic is the quadratic form of
the linear variable Position. To calculate the values for this variable,
the linear position values were transformed into quadratic values. For
instance, the four letters of a 4-letter word were coded as follows:
letter 1 = (02) = 0; letter 2 = (0.332) = .11;
letter 3 = (0.662) = 0.44, letter 4 = (12) = 1. By
performing a regression with a quadratic term (i.e., a non-linear term),
we investigate the possibility of the relation between position and
accuracy being non-linear. Since an equation with a quadratic term gives
a parabola, we can see whether performance on the two ends of a word is
better than the middle positions (i.e. an upward parabola), as has been
reported for some individuals (e.g., |
|
| The imageability rating (ranging from 174 to
360) of each word from the MRC database ( |
|
| The written frequency of each word (ranging
from 0.72 to 2614.04) from the CELEX database ( |
|
| The probability of the graphemic
representation for each phoneme in each real word. Values ranged from
0.01 to 96.47 and were extracted from a digitized version of the |
|
| The number of words that differed from a
target word by changing one letter while preserving the identity and
positions of the rest (e.g., |
|
| The number of words that differ from a target
word by changing one |
Mean, standard deviation and ranges of coefficient values from the LMEM's for the nine variables of interest per variant. Underlined variables correspond to the variables subsequently selected in Step 2 and reported in Table 4. For lexicality, positive coefficients indicate better performance for words, as opposed to pseudowords. For the continuous variables (i.e., length, position, imageability, frequency, PG probability, Orthographic N., Phonological N.) positive coefficients indicate better performance for higher values on those variables. For position-quadratic, positive coefficients indicate better performance on the two ends of a word compared to the middle positions.
| Variant/Variable |
|
|
| Position- |
| Frequency | PG Probability | Orthographic N. | Phonological N. | |
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
| 1.29 (1.04) |
| 0.37 (0.38) | 0.43 (0.29) | 0.64 (0.57) | 0.11 (0.94) |
|
|
|
|
| −0.2 |
| −0.12 | 0.00 | −0.07 | −1.65 | |
|
|
|
|
| 3.23 |
| 1.26 | 1.10 | 2.08 | 2.72 | |
|
|
|
|
|
| 0.97 (0.90) |
| 0.31 (0.50) | 0.56 (0.32) | 0.84 (0.94) | 0.06 (0.82) |
|
|
|
|
| −0.23 |
| −0.58 | 0.02 | −0.42 | −0.9 | |
|
|
|
|
| 2.86 |
| 1.28 | 1.03 | 2.47 | 1.77 | |
|
|
|
|
|
| 1.52 (1.18) |
| 0.54 (0.46) | 0.81 (0.19) | 0.33 (0.99) | 0.05 (1.40) |
|
|
|
|
| −0.05 |
| −0.17 | 0.51 | −1.17 | −0.79 | |
|
|
|
|
| 2.74 |
| 1.13 | 1.00 | 1.54 | 2.88 |
Mean and standard deviation values, as well as ranges of percentages for word-pseudoword grapheme accuracy and each error type for each variant group. Underlined variables correspond to the variables subsequently selected in Step 2 and reported in Table 4.
| Variant/Error Type |
| Lexical Substitution |
| Pseudoword for word | Pseudoword for pseudoword | |
|---|---|---|---|---|---|---|
|
|
|
| 0.06 (0.05) |
| 0.21 (0.21) | 0.44 (0.18) |
|
|
| 0 |
| 0.01 | 0.26 | |
|
|
| 0.16 |
| 0.71 | 0.79 | |
|
|
|
| 0.11 (0.08) |
| 0.16 (0.15) | 0.6 (0.29) |
|
| 0.45 | 0.01 |
| 0 | 0.11 | |
|
|
| 0.26 |
| 0.42 | 0.89 | |
|
|
|
| 0.04 (0.03) |
| 0.15 (0.17) | 0.31 (0.28) |
|
|
| 0 |
| 0.04 | 0.05 | |
|
|
| 0.07 |
| 0.48 | 0.68 |
Variables that differed across the three variants, based on the p-values obtained from variant pairwise ANOVAs after application of the (arbitrary) threshold of p < .05.
| Variable/Pairwise comparison | Informative variables |
|---|---|
|
| x |
|
| x |
|
| x |
|
| |
|
| x |
|
| |
|
| |
|
| |
|
| x |
|
| |
|
| x |
|
| |
|
|
Classification accuracy based on binomial leave-one-out cross classification (see text for details).
| A: nfvPPA vs svPPA (N=16) | B: lvPPA vs
svPPA | C: lvPPA vs nfvPPA (N=27) | |||
|---|---|---|---|---|---|
|
| 94% |
| 74% |
| 77% |
|
| 100% |
| 76% |
| 82% |
|
| 83% |
| 67% |
| 70% |
Fig. 2.Distribution of accuracy values across the 1000 permutations per pairwise comparison. The thin, red line indicates the accuracy value obtained by the actual data. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)
Confusion matrix with classification accuracy from Step 4 analysis - white: accurately classified cases; light-grey: misclassified cases, dark-grey:unclassified cases.
| PREDICTED LABEL | TARGET LABEL | |||
|---|---|---|---|---|
|
|
|
| ||
|
| 70% (n=7) | 17% (n=1) | 12% (n=2) | |
|
| 0% (n=0) | 66% (n=4) | 18% (n=3) | |
|
| 30% (n=3) | 17% (n=1) | 59% (n=10) | |
|
| 0% (n=0) | 0% (n=0) | 12% (n=2) | |
Fig. 3.Distribution of classification accuracy values across the 1000 random permutations of variant label assignments, across the three variants. The thin, red line indicates the accuracy value obtained from the actual data. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)
|
| |||
|---|---|---|---|
|
|
|
| |
|
| optional -
| core | optional -
|
|
| optional -
| core | |
|
| core | ||
|
| optional -
| optional -
| core |
|
| core | ||
|
| core | optional -
| |
|
| optional -
| ||
| Impaired object knowledge | optional -
| optional -
| optional -
|
|
| optional -
| ||
|
| optional -
| ||
| ID/ | Grammaticality of sentence
pro- | Effortful | Word | Single
Word | Naming | Repetition | Syntax
Comprehension | Object | Surface
Dyslexia/ | Phonemic | Classification |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ABK | Sentence Anagrams: A: 0/5; P: 0/5; Agrammatic Speech | YES | YES | 10/30 | 7/37 | total: 16/40; A:7, P:3, SR:5, OR:1 | 15/15 | YES | NO | nfvPPA | |
|
| Sentence Anagrams: A: 5/5; P: 5/5 | YES | NO | 100% | 29/30 | 36/37 | total: 35/40; A:10, P:10, SR:10, OR:5 | 15/15 | NO | NO | nfvPPA |
|
| Sentence Anagrams: A:5/5; P: 3/5 | NO | YES | 26/30 | 31/37 | total: 29/40; A:8, P:7, SR:9, OR:5 | 15/15 | NO | NO | lvPPA | |
|
| Sentence Anagrams: A: 5/5; P: 0/5 | 59% | 3/30 | 12/37 | total: 27/40; A:6, P:8, SR:7, OR:6 | 14/15 | lvPPA | ||||
|
| Sentence Anagrams: A: 4/5; P: 0/5; Agrammatic Speech | YES | YES | 100% | 4/30 | 11/37 | total: 26/40; A:8, P:5, SR:9, OR:4 | 15/15 | YES | YES | nfvPPA |
|
| Sentence Anagrams: A: 5/5; P: 4/5 | NO | YES | 100% | 16/30 | 27/37 | total: 33/40; A:8, P:7, SR:9, OR:9 | 15/15 | NO | NO | lvPPA |
|
| Sentence Anagrams: A: 4/5; P: 0/5; Agrammatic Speech | YES | YES | 18/30 | 31/37 | total: 25/40; A:8, P:5, SR:6, OR:6 | 15/15 | NO | NO | nfvPPA | |
|
| Sentence Anagrams: A: 0/5; P: 0/5 | NO | YES | 25% | 0/30 | 34/37 | total: 24/40; A:8, P:2, SR:7, OR:5 | 10/15 | YES | NO | svPPA |
|
| Sentence Anagrams: A: 5/5; P: 5/5 | NO | YES |
| 1/30 | 35/37 | total: 39/40; A:9, P:10, SR:10, OR:10 | 12/15 | NO | NO | svPPA |
|
| Sentence Anagrams:A: 0/5; P: 0/5 | YES | YES | 24/30 | 33/37 | total: 21/40; A:9, P:5, SR:5, OR:2 | 15/15 | NO | YES | lvPPA | |
|
| Sentence Anagrams: A: 5/5; P: 2/5 | NO | YES | 92% | 22/30 | 23/37 | total: 31/40; A:8, P:9, SR:6, OR:8 | 14/15 | NO | Yes | lvPPA |
|
| Sentence Anagrams: | YES | YES | 25/30 | 28/37 | total: 31/40; A:10, P:8, SR:9, OR:4 | 15/15 | YES | NO | nfvPPA | |
|
| Sentence Anagrams: 0/10; Agrammatic Speech | YES | NO | 23/30 | 23/37 | total: 20/40; A:7, P:6, SR:3, OR:4 | 12/15 | YES | NO | nfvPPA | |
|
| Sentence Anagrams: A: 5/5; P: 4/5 | NO | YES | 95% | 28/30 | 26/37 | total: 32/40; A:10, P:9, SR:10, OR:3 | 15/15 | YES | NO | lvPPA |
|
| Sentence Anagrams: A: 2/5; P: 0/5 | YES | YES | (word comprehension was intact, based on physician's notes) | 14/30 | 30/37 | total: 32/40; A:9, P:9, SR:7, OR:7 | 13/15 | NO | NO | Mixed |
|
| Sentence Anagrams:A: 2/5; P: 0/5 | NO | YES | 63% | 1/30 | 10/37 | total: 19/40; A:6, P:2, SR:7, OR:4 | 15/15 | YES | YES | lvPPA |
|
| Sentence Anagrams: A: 5/5; P: 2/5; Agrammatic Speech | YES | NO | 100% | 27/30 | 35/37 | total: 33/40; A:9, P:10, SR:9, OR:5 | 15/15 | NO | NO | nfvPPA |
|
| Sentence Anagrams: A: 4/5; P: 0/5 | NO | 37% | 1/30 | 0/37 | total: 17/40; A:3, P:4, SR:6, OR:4 | 15/15 | YES | lvPPA | ||
|
| Sentence Anagrams: A: 5/5; P: 4/5 | NO | YES | 24/30 | 36/37 | total: 35/40; A:10,P:10, SR:10, OR:5 | 15/15 | NO | YES | Unclassified | |
|
| Sentence Anagrams: A: 5/5; P: 0/5; Agrammatic Speech | NO | YES | 16/30 | 24/37 | total: 22/40; A:9, P:5, SR:6, OR:2 | 15/15 | YES | NO | lvPPA | |
| JEE | Sentence Anagrams: A: 5/5; P: 0/5 | NO | YES | 0/30 | 30/37 | total: 23/40; A:7, P:6, SR:5, OR:5 | 13/15 | YES | NO | Mixed | |
|
| Sentence Anagrams: A: 5/5; P: 5/5 | NO | NO | 25/30 | 35/37 | total: 40/40; A:10, P:10, SR:10, OR:10 | 15/15 | NO | NO | Unclassified | |
|
| Sentence Anagrams: A: 3/5; P: 1/5 | 35% | 2/30 | 33/37 | total: 20/40; A:8, P:4, SR:3, OR:5 | 13/15 | YES | svPPA | |||
|
| Sentence Anagrams: A: 5/5; P: 5/5 | NO | YES | 85% | 4/30 | 37/37 | total: 37/40; A:10, P:9, SR:9, OR:9 | 15/15 | NO | NO | svPPA |
|
| Sentence Anagrams: A: 5/5; P: 3/5 | NO | YES | 95% | 12/30 | 21/37 | 15/15 | NO | lvPPA | ||
|
| Sentence Anagrams: A: 5/5; P: 4/5 | NO | YES | 97% | 24/30 | 37/37 | total: 34/40; A:10, P:10, SR:10, OR:4 | 15/15 | NO | NO | lvPPA |
|
| Sentence Anagrams: A: 2/5; P: 1/5; Agrammatic Speech | NO | YES | 95% | 28/30 | 29/37 | total: 27/40; A:7, P:7, SR:7, OR:6 | 14/15 | NO | NO | lvPPA |
|
| Sentence Anagrams: A: 5/5; P: 4/5 | NO | YES | 100% | 5/30 | 35/37 | total: 39/40; A:9, P:10, SR:10, OR:10 | 11/15 | NO | NO | Unclassified |
|
| Sentence Anagrams: A: 5/5; P: 0/5 | NO | NO | 100% | 28/30 | 35/37 | total: 18/40; A:7, P:1, SR:6, OR:4 | 15/15 | NO | NO | nfvPPA |
|
| Agrammatic Speech | YES | 80% | 2/30 | 19/37 | total: 15/40; A:4, P:3, SR:6, OR:2 | 13/15 | YES | YES | Mixed | |
|
| Sentence Anagrams:A: 1/5; P: 0/5 | NO | YES | 60% | 9/30 | 4/37 | total: 27/40; A:7, P:7, SR:5, OR:8 | 12/15 | YES | NO | Mixed |
|
| Sentence Anagrams: A: 2/5; P: 4/5 | YES | YES | 95% | 25/30 | 28/37 | total: 29/40; A:9, P:7, SR:5, OR:8 | 15/15 | NO | lvPPA | |
|
| Sentence Anagrams: A: 5/5; P: 4/5 | YES | YES | 14/30 | 29/37 | total: 34/40; A:10, P:9, SR:8, OR:7 | 15/15 | YES | NO | lvPPA | |
|
| Sentence Anagrams: A: 4/5; P: 0/5 | NO | YES | 27/30 | 32/37 | total: 21/40; A:5, P:8, SR:4, OR:4 | 8/15 | NO | NO | lvPPA | |
|
| Sentence Anagrams: A: 1/5; P: 0/5 | NO | YES | 75% | 0/30 | 33/37 | total: 27/40; A:8, P:6, SR:10, OR:3 | 11/15 | YES | NO | svPPA |
|
| Sentence Anagrams: A: 5/5; P: 0/5 | NO | YES |
| 2/30 | 31/37 | total: 25/40; A:9, P:2, SR:9, OR:5 | 14/15 | NO | NO | svPPA |
|
| Sentence Anagrams: A: 4/5; P: 0/5 | 62% | 25/30 | total: 21/40; A:7, P:3, SR:9, OR:2 | 15/15 | nfvPPA | |||||
|
| Agrammatic Speech | YES | YES | 73% | 14/30 | 15/37 | 15/15 | YES | NO | nfvPPA | |
|
| Sentence Anagrams: A: 5/5; P: 5/5 | 100% | 9/30 | 36/37 | total: 38/40; A:10, P:9, SR:10, OR:9 | 15/15 | Unclassified | ||||
|
| Sentence Anagrams: A: 5/5; P: 4/5 | NO | YES | 98% | 29/30 | 31/37 | total: 34/40; A:9, P:8, SR:10, OR:4 | 15/15 | NO | NO | lvPPA |
|
| Sentence Anagrams: A: 5/5; P: 5/5 | NO | YES | 8/30 | 32/37 | total: 35/40; A:10, P:10, SR:10, OR:5 | 15/15 | NO | NO | lvPPA | |
|
| Sentence Anagrams: A: 4/5; P: 4/5 | NO | YES | 100% | 30/30 | NO | YES | Unclassified |
Predicted variant for each pairwise comparison (L = lvPPA; NF = nfvPPA; S = svPPA, U = unclassified). Final classification is defined by the number of overlapping pairwise variant predictions: 2 overlapping predictions = accurately classified (✓) or misclassified (x); 0 overlapping predictions = unclassified (?)
| ID | Actual Variant | lvPPA vs svPPA | nfvPPA vs svPPA | lvPPA vs nfvPPA | Final Classification | Classification Accuracy |
|---|---|---|---|---|---|---|
| ABK | nfvPPA | L | NF | NF | NF | ✓ |
| BIN | nfvPPA | L | L | NF | L | x |
| BLR | lvPPA | S | L | S | S | x |
| BNR | lvPPA | L | L | NF | L | ✓ |
| CBN | nfvPPA | L | NF | NF | NF | ✓ |
| CBT | lvPPA | S | L | S | S | x |
| CKI | nfvPPA | L | NF | NF | NF | ✓ |
| DCN | svPPA | S | L | S | S | ✓ |
| DEK | svPPA | S | L | S | S | ✓ |
| DME | lvPPA | L | NF | NF | NF | x |
| DNE | lvPPA | L | L | NF | L | ✓ |
| DPD | nfvPPA | L | L | NF | L | x |
| DPZ | nfvPPA | L | NF | NF | NF | ✓ |
| DRS | lvPPA | S | L | S | S | x |
| DUE | lvPPA | L | L | NF | L | ✓ |
| ERM | nfvPPA | L | NF | NF | NF | ✓ |
| GFS | lvPPA | L | L | NF | L | ✓ |
| IJN | lvPPA | L | L | S | L | ✓ |
| JJN | svPPA | L | NF | NF | NF | x |
| JKA | svPPA | S | L | S | S | ✓ |
| JRD | lvPPA | L | L | NF | L | ✓ |
| JRE | lvPPA | L | L | S | L | ✓ |
| JSS | lvPPA | L | L | NF | L | ✓ |
| KBG | nfvPPA | L | L | NF | L | x |
| MOR | lvPPA | L | NF | NF | NF | x |
| MPI | lvPPA | L | L | NF | L | ✓ |
| MVR | lvPPA | L | NF | S | U | ? |
| NCG | svPPA | L | L | S | L | x |
| RFH | svPPA | S | L | S | S | ✓ |
| RVT | nfvPPA | L | NF | NF | NF | ✓ |
| SKR | nfvPPA | L | NF | NF | NF | ✓ |
| TBD | lvPPA | L | L | NF | L | ✓ |
| TBE | lvPPA | S | L | NF | U | ? |