| Literature DB >> 35223052 |
Wei Wu1, Paul Hoffman1.
Abstract
Recent studies suggest that knowledge representations and control processes are the two key components underpinning semantic cognition, and are also crucial indicators of the shifting cognitive architecture of semantics in later life. Although there are many standardized assessments that provide measures of the quantity of semantic knowledge participants possess, normative data for tasks that probe semantic control processes are not yet available. Here, we present normative data from more than 200 young and older participants on a large set of stimuli in two semantic tasks, which probe controlled semantic processing (feature-matching task) and semantic knowledge (synonym judgement task). We verify the validity of our norms by replicating established age- and psycholinguistic-property-related effects on semantic cognition. Specifically, we find that older people have more detailed semantic knowledge than young people but have less effective semantic control processes. We also obtain expected effects of word frequency and inter-item competition on performance. Parametrically varied difficulty levels are defined for half of the stimuli based on participants' behavioural performance, allowing future studies to produce customized sets of experimental stimuli based on our norms. We provide all stimuli, data and code used for analysis, in the hope that they are useful to other researchers.Entities:
Keywords: cognitive ageing; executive function; knowledge; semantic cognition
Year: 2022 PMID: 35223052 PMCID: PMC8847894 DOI: 10.1098/rsos.211056
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Figure 1Example items from different tasks in the experiment. (a) Test of semantic knowledge, (b) test of non-semantic cognitive control and (c) test of semantic control.
Spelling rules for stimuli in the string-matching task. Note: there are 32 items under each spelling rule. (T) indicates the target in the item.
| rule | description | example | |
|---|---|---|---|
| 1 | Probe has five letters; target has five letters, four of the letters are from the probe (in the same order as the probe); distractor has five letters that are different from the probe. | xgbfj | |
| xgqfj(T) | oucnz | ||
| 2 | Probe has five letters; target has five letters, four of the letters are from the probe (in the same order as the probe); distractor has five letters, two of the letters are from the probe (in the same order as the probe). | gxrcm | |
| gvreo | gxrym(T) | ||
| 3 | Probe has five letters; target has five letters, three of the letters are from the probe (in the same order as the probe); distractor has five letters, two of the letters are from the probe (in the same order as the probe). | qlezj | |
| qymzj(T) | qbesp | ||
| 4 | Probe has five letters; target has five letters, three of the letters are from the probe (in the same order as the probe); distractor has five letters, two of the letters are from the probe (in the same order as the probe), one of the letters is from the probe but locates at a different position. | ivazl | |
| ipavb | ivuzm(T) | ||
| 5 | Probe has six letters; target has six letters, four of the letters are from the probe (in the same order as the probe); distractor has six letters, three of the letters are from the probe (in the same order as the probe), one of the letters is from the probe but locates at a different position. | btnfuk | |
| btnyur(T) | btnuic | ||
Mean psycholinguistic properties of task items and participants' performance. Note: word frequencies (on the log-transformed Zipf scale) were obtained from the SUBTLEX-UK database [32], and the lowest frequency in each word pair/triad was used as a measure of word frequency for this task item. Concreteness values were obtained from Brysbaert et al. [39] and the word-average concreteness was used as a measure of concreteness for each task item. TDS values were calculated in a similar fashion to a previous study [17]. Note that for the synonym task, the TDS was calculated for similar- (top row) and dissimilar-word-pair (bottom row) items separately, as the two words' word2vec distance. Word length was measured for each item by summing the number of characters over all words in the item. Mean RTs were group-average winsorized reaction times on all correct trials (except extremely long-time trials). Mean accuracy and RT were calculated by averaging participants' performance for each item and then averaging performance across items in each task. All s.d. and ranges were calculated over items instead of participants.
| semantic knowledge test (synonym judgement task) | semantic control test (feature-matching tasks) | non-semantic cognitive control test (string-matching task) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| mean | s.d. | range | mean | s.d. | range | mean | s.d. | range | |
| frequency (Zipf) | 2.79 | 1.13 | 0.70–5.44 | 3.42 | 0.54 | 2.09–4.98 | |||
| TDS | 0.43 (similar) | 0.15 | 0.13–0.76 | 0.02 | 0.36 | −0.79–0.71 | |||
| 0.12 (dissimilar) | 0.10 | −0.05–0.45 | |||||||
| concreteness | 3.61 | 1.03 | 1.52–5 | 4.76 | 0.17 | 3.92–5 | |||
| word length | 13.19 | 3.33 | 6–23 | 17.05 | 3.05 | 11–23 | 15.60 | 1.20 | 15–18 |
| young accuracy | 0.80 | 0.22 | 0.16–1 | 0.87 | 0.12 | 0.33–1 | 0.83 | 0.17 | 0.31–1 |
| young RT (ms) | 1957 | 638 | 1000–4174 | 2951 | 456 | 1867–4064 | 3104 | 1024 | 1618–6038 |
| older accuracy | 0.90 | 0.16 | 0.13–1 | 0.90 | 0.14 | 0.13–1 | 0.90 | 0.11 | 0.51–1 |
| older RT (ms) | 2030 | 535 | 1215–3896 | 3656 | 814 | 2029–6487 | 6191 | 2046 | 3082–10406 |
Correlations of task performance (accuracy rates) in older and young groups. Note: p-values are uncorrected.
| older | young | |||
|---|---|---|---|---|
| semantic knowledge test – semantic control test | 0.19 | 0.05 | 0.33 | <10−3 |
| semantic knowledge test – non-semantic cognitive control test | 0.11 | 0.26 | 0.37 | <10−4 |
| semantic control test – non-semantic cognitive control test | 0.50 | <10−7 | 0.29 | <10−3 |
Mixed effects models predicting participant accuracy and RT in the semantic knowledge test from age group, frequency and item type.
| effect | accuracy | RT | ||||
|---|---|---|---|---|---|---|
| s.e. | s.e. | |||||
| group | 0.520 | 0.058 | <10−15 | 0.031 | 0.009 | <0.001 |
| frequency | 0.867 | 0.112 | <10−13 | −0.061 | 0.006 | <10−15 |
| item type | 0.239 | 0.118 | <0.05 | 0.028 | 0.006 | <10−5 |
| group * frequency | −0.038 | 0.054 | 0.618 | 0.003 | 0.003 | 0.315 |
| group * item type | 0.025 | 0.068 | 0.598 | −0.006 | 0.003 | 0.064 |
| frequency * item type | −0.381 | 0.113 | <0.001 | 0.019 | 0.005 | <0.001 |
| group * frequency * item type | 0.248 | 0.058 | <10−5 | −0.008 | 0.002 | <0.001 |
Figure 2Modelled effects of age group and frequency on accuracy and RT for similar- (a) and dissimilar-word-pair (b) items in the semantic knowledge test. Shadow areas indicate 95% confidence intervals.
Figure 3Modelled effects of age group and TDS on accuracy and RT in the semantic control test. Shadow areas indicate 95% confidence intervals.
Mixed effects models predicting participant accuracy and RT from difficulty level.
| difficulty effect on accuracy | difficulty effect on RT | |||||
|---|---|---|---|---|---|---|
| s.e. | s.e. | |||||
| colour feature-matching task | −0.983 | 0.081 | <10−15 | 0.069 | 0.003 | <10−15 |
| size feature-matching task | −1.394 | 0.096 | <10−15 | 0.059 | 0.003 | <10−15 |
| synonym judgement task | −1.401 | 0.084 | <10−15 | 0.064 | 0.003 | <10−15 |
| string-matching task | −1.139 | 0.123 | <10−14 | 0.086 | 0.003 | <10−15 |
| colour feature-matching task | −0.941 | 0.069 | <10−15 | 0.055 | 0.003 | <10−15 |
| size feature-matching task | −0.938 | 0.065 | <10−15 | 0.040 | 0.003 | <10−15 |
| synonym judgement task | −1.117 | 0.053 | <10−15 | 0.058 | 0.003 | <10−15 |
| string-matching task | −1.138 | 0.066 | <10−15 | 0.069 | 0.003 | <10−15 |
Figure 4Relationships between difficulty level and behavioural performance of older (a) and young (b) participants on subset stimuli in different tasks. Shadow areas indicate 95% confidence intervals.
Test performance of participants on subset items at each difficulty level in each task. Note: mean RTs were group-average winsorized reaction times on all correct trials (except extremely long-time trials). Mean accuracy and RT were calculated by averaging participants' performance for each item and then averaging performance across items at each difficulty level in each task. All s.d.s were calculated over items instead of participants. (T) indicates the target in the item. (S) and (D) indicate similar-word-pair and dissimilar-word-pair items.
| difficulty | example | young accuracy | young RT (ms) | older accuracy | older RT (ms) | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| mean | s.d. | mean | s.d. | mean | s.d. | mean | s.d. | |||
| semantic knowledge test (synonym judgement task) | 1 | cliche rug (D) | 0.97 | 0.03 | 1371 | 243 | 0.99 | 0.01 | 1514 | 135 |
| prawn shrimp (S) | ||||||||||
| 2 | hog moped (D) | 0.96 | 0.02 | 1550 | 246 | 0.98 | 0.01 | 1654 | 208 | |
| fastener zipper (S) | ||||||||||
| 3 | nemesis president (D) | 0.93 | 0.04 | 1754 | 318 | 0.97 | 0.03 | 1926 | 191 | |
| aspirin tablet (S) | ||||||||||
| 4 | gonorrhea headache (D) | 0.84 | 0.08 | 1886 | 432 | 0.93 | 0.07 | 2018 | 224 | |
| tendency trend (S) | ||||||||||
| 5 | felicitous faithful (D) | 0.72 | 0.07 | 2392 | 416 | 0.81 | 0.10 | 2530 | 519 | |
| denture implant (S) | ||||||||||
| semantic control test (colour feature-matching task) | 1 | walnut | 0.98 | 0.01 | 2412 | 115 | 0.98 | 0.03 | 2971 | 281 |
| breeze almond(T) | ||||||||||
| 2 | lemon | 0.94 | 0.03 | 2561 | 246 | 0.94 | 0.03 | 3272 | 464 | |
| highlighter(T) tree | ||||||||||
| 3 | lead | 0.92 | 0.02 | 3122 | 261 | 0.95 | 0.04 | 3772 | 235 | |
| pencil graphite(T) | ||||||||||
| 4 | moon | 0.91 | 0.03 | 3098 | 211 | 0.90 | 0.05 | 4034 | 611 | |
| earth tooth(T) | ||||||||||
| 5 | artery | 0.81 | 0.06 | 3548 | 350 | 0.79 | 0.07 | 4828 | 291 | |
| cranberry(T) vein | ||||||||||
| semantic control test (size feature-matching task) | 1 | nurse | 0.94 | 0.04 | 2580 | 157 | 0.99 | 0.01 | 2946 | 252 |
| hospital baboon(T) | ||||||||||
| 2 | basketball | 0.88 | 0.04 | 2817 | 132 | 0.97 | 0.01 | 3249 | 168 | |
| head(T) player | ||||||||||
| 3 | roof | 0.89 | 0.04 | 3141 | 257 | 0.96 | 0.04 | 3650 | 208 | |
| canopy(T) stool | ||||||||||
| 4 | boot | 0.84 | 0.06 | 2966 | 219 | 0.89 | 0.04 | 3648 | 307 | |
| pigeon(T) car | ||||||||||
| 5 | candle | 0.73 | 0.06 | 3555 | 267 | 0.77 | 0.05 | 4716 | 341 | |
| lantern cup(T) | ||||||||||
| non-semantic cognitive control test (string-matching task) | 1 | ksije | 0.99 | 0.01 | 1833 | 85 | 0.99 | 0.01 | 3531 | 265 |
| ksvje(T) tbdnx | ||||||||||
| 2 | ibavf | 0.98 | 0.01 | 2028 | 116 | 0.99 | 0.01 | 3833 | 208 | |
| imjuf ibayf(T) | ||||||||||
| 3 | xpstj | 0.95 | 0.02 | 2329 | 132 | 0.99 | 0.01 | 4236 | 384 | |
| xhqtn xpstk(T) | ||||||||||
| 4 | fyvjq | 0.92 | 0.05 | 2669 | 172 | 0.98 | 0.02 | 5240 | 519 | |
| fyejq(T) fyswx | ||||||||||
| 5 | xlmueg | 0.82 | 0.09 | 3250 | 373 | 0.90 | 0.04 | 6678 | 311 | |
| xlcaem xjmued(T) | ||||||||||
Figure 5Distribution of older (a) and young (b) participants' overall accuracy rates on subset items in different tasks and at different difficulty levels. The height of the bars indicates number count of participants whose performance fell within a specified range of accuracy values. Dash line indicates group-average accuracy.