| Literature DB >> 31156496 |
Najoung Kim1, Jung-Ho Kim1, Maria K Wolters2, Sarah E MacPherson3, Jong C Park1.
Abstract
In neuropsychological assessment, semantic fluency is a widely accepted measure of executive function and access to semantic memory. While fluency scores are typically reported as the number of unique words produced, several alternative manual scoring methods have been proposed that provide additional insights into performance, such as clusters of semantically related items. Many automatic scoring methods yield metrics that are difficult to relate to the theories behind manual scoring methods, and most require manually-curated linguistic ontologies or large corpus infrastructure. In this paper, we propose a novel automatic scoring method based on Wikipedia, Backlink-VSM, which is easily adaptable to any of the 61 languages with more than 100k Wikipedia entries, can account for cultural differences in semantic relatedness, and covers a wide range of item categories. Our Backlink-VSM method combines relational knowledge as represented by links between Wikipedia entries (Backlink model) with a semantic proximity metric derived from distributional representations (vector space model; VSM). Backlink-VSM yields measures that approximate manual clustering and switching analyses, providing a straightforward link to the substantial literature that uses these metrics. We illustrate our approach with examples from two languages (English and Korean), and two commonly used categories of items (animals and fruits). For both Korean and English, we show that the measures generated by our automatic scoring procedure correlate well with manual annotations. We also successfully replicate findings that older adults produce significantly fewer switches compared to younger adults. Furthermore, our automatic scoring procedure outperforms the manual scoring method and a WordNet-based model in separating younger and older participants measured by binary classification accuracy for both English and Korean datasets. Our method also generalizes to a different category (fruit), demonstrating its adaptability.Entities:
Keywords: category fluency test; executive function; relation extraction; semantic fluency; semantic memory; verbal fluency; word embeddings
Year: 2019 PMID: 31156496 PMCID: PMC6532534 DOI: 10.3389/fpsyg.2019.01020
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1Example of cosine similarity between adjacent words in a semantic fluency sequence.
Correlation between switch counts and cluster sizes determined by manual scoring and by the selected best settings for automatic scoring algorithms.
| English | WordNet | 0.561 | 0.402 | 0.454 | 0.113 (4.871) |
| VSM | 0.657 | 0.568 | 0.355 | 0.221 | |
| Backlink | 0.618 | 0.242 | 0.149 (1.949) | 0.261 | |
| Korean | WordNet | 0.535 | 0.425 | 0.323 | 0.196 |
| VSM | 0.572 | 0.141 (2.000) | 0.140 (1.205) | 0.258 | |
| VSM (KDiff) | 0.525 | 0.105 (2.026) | 0.027 (1.262) | 0.219 | |
| Backlink | 0.554 | −0.0153 (3.488) | 0.072 (2.871) | 0.031 (6.990) |
Values inside parentheses denote the average of the actual values
(p < 0.05,
p < 0.01, and
p < 0.001).
Figure 2Example of a backlink relation.
Figure 3Age and gender distributions of all English and Korean participants (Blue and red bars represent male and female, respectively. The data labels for 1 are omitted). (A) Age and gender distribution of English participants. (B) Age and gender distribution of Korean participants.
Age and gender distributions of younger and older group English and Korean participants.
| Younger | 13 | 26 | 28 | 16 |
| (20–29) | ||||
| Older | 24 | 48 | 6 | 5 |
| (50+) | ||||
Inter-annotator agreements for manual analysis.
| English | 0.962 | 0.895 | 0.736 | 0.888 |
| Korean | 0.972 | 0.866 | 0.638 | 0.937 |
Correlation (Spearman's ρ) between switch counts determined by VSM and manual scoring.
| English | all | 0.657 | 0.342 | −0.054 (median) |
| 0.390 | −0.010 (75th) | |||
| younger | 0.750 | 0.285 | −0.005 (median) | |
| 0.333 | −0.046 (75th) | |||
| older | 0.562 | 0.305 | −0.236 | |
| 0.434 | −0.275 | |||
| Korean | all | 0.572 | 0.565 | 0.173 (median) |
| 0.373 | 0.499 | |||
| younger | 0.598 | 0.604 | 0.263 (median) | |
| 0.482 | 0.587 | |||
| older | −0.012 (median) | 0.535 | −0.163 (median) | |
| 0.021 (25th) | 0.074 (75th) | |||
| Korean (KDiff) | all | 0.525 | 0.518 | 0.144 (median) |
| 0.327 | 0.429 | |||
| younger | 0.560 | 0.544 | 0.224 (median) | |
| 0.417 | 0.499 | |||
| older | −0.026 (median) | 0.664 | −0.141 (median) | |
| 0.129 (25th) | −0.069 (75th) |
(p < 0.05,
p < 0.01,
p < 0.001).
Average switch counts for each scoring metric according to age groups.
| English | all | 9.786 | 4.974 | 9.949 | 5.658 | 0.983 | 0.308 | 12.427 |
| younger | 10.949 | 5.256 | 10.513 | 6.051 | 0.846 | 0.385 | 13.564 | |
| older | 9.097 | 4.861 | 9.639 | 5.431 | 1.111 | 0.292 | 11.732 | |
| Korean | all | 10.657 | 5.152 | 10.381 | 10.657 | 10.552 | 4.219 | 7.543 |
| younger | 10.944 | 5.074 | 10.519 | 10.889 | 10.667 | 4.185 | 8.056 | |
| older | 9.927 | 6.091 | 10.000 | 9.636 | 10.000 | 5.273 | 5.000 | |
| Korean (KDiff) | all | 10.657 | 5.181 | 10.381 | 10.438 | 9.533 | 3.695 | 7.543 |
| younger | 10.944 | 5.204 | 10.278 | 11.093 | 9.519 | 3.741 | 8.056 | |
| older | 9.927 | 6.091 | 9.909 | 9.273 | 9.455 | 4.818 | 5.000 | |
Korean and Korean (KDiff) only differ by their VSM switch counts.
Correlation between age and switch counts produced by different models.
| English | Manual | −0.318 |
| WordNet | −0.353 | |
| VSM | −0.275 | |
| Backlink | −0.354 | |
| Korean | Manual | −0.119 |
| WordNet | 0.063 | |
| VSM | −0.104 | |
| VSM (KDiff) | −0.023 | |
| Backlink | −0.186 ( |
p < 0.05,
p < 0.01, and
p < 0.001.
Influence of gender and education on manual semantic fluency scores.
| English | Gender-Manual | |
| Gender-Backlink switch count | ||
| Gender-Backlink mean cluster size | ||
| Gender-Backlink median cluster size | ||
| Gender-Backlink max cluster size | ||
| Gender-VSM switch count | ||
| Gender-VSM mean cluster size | ||
| Gender-VSM median cluster size | ||
| Gender-VSM max cluster size | ||
| Korean | Gender-Manual | |
| Gender-Backlink switch count | ||
| Gender-Backlink mean cluster size | ||
| Gender-Backlink median cluster size | ||
| Gender-Backlink max cluster size | ||
| Gender-VSM switch count | ||
| Gender-VSM mean cluster size | ||
| Gender-VSM median cluster size | ||
| Gender-VSM max cluster size | ||
| Gender-VSM (KDiff) switch count | ||
| Gender-VSM (KDiff) mean cluster size | ||
| Gender-VSM (KDiff) median cluster size | ||
| Gender-VSM (KDiff) max cluster size | ||
| Education-Manual | ρ = −0.041 | |
| Education-Backlink switch count | ρ = 0.079 | |
| Education-Backlink mean cluster size | ρ = −0.103 | |
| Education-Backlink median cluster size | ρ = −0.121 | |
| Education-Backlink max cluster size | ρ = 0.056 | |
| Education-VSM switch count | ρ = 0.107 | |
| Education-VSM mean cluster size | ρ = −0.069 | |
| Education-VSM median cluster size | ρ = −0.118 | |
| Education-VSM max cluster size | ρ = 0.034 | |
| Education-VSM (KDiff) switch count | ρ = 0.147 | |
| Education-VSM (KDiff) mean cluster size | ρ = −0.105 | |
| Education-VSM (KDiff) median cluster size | ρ = −0.010 | |
| Education-VSM (KDiff) max cluster size | ρ = 0.002 |
p < 0.05,
p < 0.01, and
p < 0.001.
List of proposed predictors from the VSM and the Backlink model.
| Backlink | Switch count |
| Mean cluster size | |
| Median cluster size | |
| Max cluster size | |
| Vector Space | Switch count |
| Mean cluster size | |
| Median cluster size | |
| Max cluster size |
Linear regression using features from the VSM and the Backlink model as predictors of age.
| English (Backlink + VSM only) | Backlink switch count | −2.684 |
| Backlink mean cluster size | −21.508 | |
| Backlink median cluster size | −10.051 | |
| Backlink max cluster size | 2.289 | |
| VSM median cluster size | 8.357 | |
| VSM max cluster size | 1.706 | |
| English (Backlink + VSM + WC & Rep) | Backlink switch count | −7.287 |
| Backlink mean cluster size | −35.885 | |
| Backlink median cluster size | −5.935 | |
| Backlink max cluster size | −0.908 | |
| VSM median cluster size | 8.038 | |
| VSM max cluster size | 1.205 | |
| Unique word count | 3.179 | |
| Repetition | 10.058 | |
| Korean (Backlink + VSM only) | Backlink switch count | −0.471 |
| Backlink max cluster size | 0.640 | |
| VSM switch count | −0.241 | |
| VSM mean cluster size | 3.306 | |
| VSM median cluster size | −5.159 | |
| VSM max cluster size | −0.545 | |
| Korean (Backlink + VSM + WC & Rep) | Backlink switch count | −0.431 |
| Backlink max cluster size | 0.279 | |
| VSM switch count | −0.714 | |
| VSM mean cluster size | −2.176 | |
| VSM median cluster size | −2.201 | |
| VSM max cluster size | −0.284 | |
| Unique word count | 0.118 | |
| Repetition | 5.224 | |
| Korean (KDiff) (Backlink + VSM only) | Backlink switch count | −0.768 |
| Backlink max cluster size | 0.348 | |
| VSM switch count | 0.032 | |
| VSM mean cluster size | 5.648 | |
| VSM median cluster size | −7.451 | |
| VSM max cluster size | −0.642 | |
| Korean (KDiff) (Backlink + VSM + WC & Rep) | Backlink switch count | −0.491 |
| Backlink max cluster size | 0.207 | |
| VSM switch count | 0.028 | |
| VSM mean cluster size | 2.546 | |
| VSM median cluster size | −4.466 | |
| VSM max cluster size | −0.241 | |
| Unique word count | −0.278 | |
| Repetition | 4.449 |
p < 0.05,
p < 0.01, and
p < 0.001.
Unique word counts for English and Korean data grouped by age.
| English | all | 20.69 (4.69) | [11, 35] |
| younger | 21.71 (4.87) | [12, 30] | |
| older | 20.08 (4.56) | [11, 35] | |
| Korean | all | 21.03 (5.91) | [8, 42] |
| younger | 21.69 (6.05) | [8, 42] | |
| older | 16.55 (4.01) | [11, 24] | |
Performance of the integrated model for the younger-older group distinction using animal fluency.
| English | Manual | 15.068 | 71.1 | 71.7 | 91.7 |
| Majority class | - | 64.9 (39:72) | - | - | |
| WordNet | 7.977 | 68.5 | 71.3 | 86.1 | |
| VSM | 4.120 (4) | 65.8 | 65.5 | 100 | |
| Backlink | 9.769 | 70.3 | 72.4 | 87.5 | |
| Backlink + VSM | 12.413 | 72.1 | 74.7 | 86.1 | |
| Backlink + VSM + WC + Rep. | 25.097 | 73.9 | 72.6 | 95.8 | |
| Korean | Manual | 5.085 (4) | 84.6 | 100 | 9.1 |
| Majority class | - | 83.1 (54:11) | - | - | |
| WordNet | 13.078 | 86.2 | 62.5 | 45.5 | |
| VSM | 4.441 (3) | 86.2 | 100 | 18.2 | |
| VSM (KDiff) | 5.071 (3) | 86.2 | 100 | 18.2 | |
| Backlink | 9.583 | 84.6 | 66.7 | 18.2 | |
| Backlink + VSM | 9.784 | 87.7 | 100 | 27.3 | |
| Backlink + VSM (KDiff) | 10.560 | 87.7 | 100 | 27.3 | |
| Backlink + VSM + WC + Rep. | 26.796 | 90.8 | 85.7 | 54.5 | |
| Backlink + VSM (KDiff) + WC + Rep. | 27.102 | 90.8 | 85.7 | 54.5 |
(p < 0.05,
p < 0.01, and
p < 0.001).
Performance of the integrated model for the younger-older group distinction using fruit fluency.
| Korean | Majority class | - | 83.1 (54:11) | - | - |
| VSM | 7.121 (4) | 81.5 | 40.0 | 18.2 | |
| VSM (KDiff) | 5.832 (4) | 84.6 | 66.7 | 18.2 | |
| Backlink | 22.291 | 87.7 | 100 | 27.3 | |
| VSM + Backlink | 22.783 | 89.2 | 83.3 | 45.5 | |
| VSM (KDiff) + Backlink | 20.477 | 89.2 | 100 | 36.4 | |
| VSM + Backlink + WC + Rep. | 26.894 | 90.8 | 85.7 | 54.5 | |
| VSM (KDiff) + Backlink + WC + Rep. | 32.215 | 93.9 | 88.9 | 72.7 |
(p < 0.05,
p < 0.01, and
p < 0.001).