| Literature DB >> 19933145 |
Matthew H Davis1, M Gareth Gaskell.
Abstract
In this paper we present a novel theory of the cognitive and neural processes by which adults learn new spoken words. This proposal builds on neurocomputational accounts of lexical processing and spoken word recognition and complementary learning systems (CLS) models of memory. We review evidence from behavioural studies of word learning that, consistent with the CLS account, show two stages of lexical acquisition: rapid initial familiarization followed by slow lexical consolidation. These stages map broadly onto two systems involved in different aspects of word learning: (i) rapid, initial acquisition supported by medial temporal and hippocampal learning, (ii) slower neocortical learning achieved by offline consolidation of previously acquired information. We review behavioural and neuroscientific evidence consistent with this account, including a meta-analysis of PET and functional Magnetic Resonance Imaging (fMRI) studies that contrast responses to spoken words and pseudowords. From this meta-analysis we derive predictions for the location and direction of cortical response changes following familiarization with pseudowords. This allows us to assess evidence for learning-induced changes that convert pseudoword responses into real word responses. Results provide unique support for the CLS account since hippocampal responses change during initial learning, whereas cortical responses to pseudowords only become word-like if overnight consolidation follows initial learning.Entities:
Mesh:
Year: 2009 PMID: 19933145 PMCID: PMC2846311 DOI: 10.1098/rstb.2009.0111
Source DB: PubMed Journal: Philos Trans R Soc Lond B Biol Sci ISSN: 0962-8436 Impact factor: 6.237
Figure 1.Neural and functional organization of systems involved in representing and learning spoken words. (a) Left temporal lobe regions involved in perceiving and comprehending spoken words (based on Hickok & Poeppel 2004; Davis & Johnsrude 2007) and their interactions with medial temporal systems for word learning. (b) Functional organization of the Distributed Cohort Model (Gaskell & Marslen-Wilson 1997, 1999; depicted within the grey box) with additional connections to hippocampal/episodic memory system for learning new words. In both diagrams, rapid cortico-cortico connections are shown with solid lines, and slower, cortico-hippocampal connections are shown with broken lines. Dotted lines with open arrow-heads show recurrent connections involved in maintaining acoustic-phonetic representations in echoic memory.
Figure 2.Impact of initial learning and sleep-associated consolidation on lexical representations and word recognition. (a) Speech waveforms for tokens of an existing word (cathedral) and a new word (cathedruke) with a marker showing the approximate time point at which the acoustic-phonetic input for cathedral diverges from all other known words (uniqueness point, cf. Marslen-Wilson 1984). The uniqueness point for cathedral is markedly later (orange line versus blue line) if the new word cathedruke must also be ruled out. (b) Lexical organization of these words after learning and before or after sleep-associated consolidation. Before sleep (blue box) the strongest lexical competitor for cathedral is cathartic, hence the uniqueness point is reached once this word can be ruled out (early uniqueness point shown in a). After sleep (orange box), the addition of a new lexical competitor is such that additional speech input is required to rule out cathedruke (late uniqueness point shown in a). (c) Pause detection response times showing the impact of learning and consolidation on lexical competition (data replotted from Dumay & Gaskell 2007). Two groups of participants were trained on novel words (e.g. cathedruke) at either 8.00 h or 20.00 h and tested on matched items with and without new lexical competitors 0, 12 or 24 h after training. Responses to existing words were significantly delayed by competition from newly learned words only for those conditions in which sleep intervened between training and testing (orange bars).
Studies included in the meta-analysis. Those labelled asterisk (*) report peak activations for the Talairach & Tournaux (1988) brain atlas and have been transformed into the MNI152 average brain for analysis and reporting.
| citation | imaging modality | number of participants | task | number of foci | |
|---|---|---|---|---|---|
| word > pseudo | pseudo > word | ||||
| fMRI | 28 | simple auditory detection (block onset) | 3 | 0 | |
| fMRI | 16 | pause-detection (non-target trials) | 0 | 7 | |
| fMRI | 18 | lexical decision (filtered) | 8 | 4 | |
| fMRI | 13 | lexical decision (paired priming) | 7 | 6 | |
| PET | 11 | repetition/passive listening | 7 | 1 | |
| PET | 12 | speeded repetition | 2 | 3 | |
| fMRI | 13 | lexical decision (priming) | 33 | 0 | |
| fMRI | 15 | lexical decision | 0 | 1 | |
| fMRI | 16 | lexical decision | 4 | 2 | |
| fMRI | 28 | one-back detection (attend/not) | 1 | 1 | |
| fMRI | 14 | lexical decision | 3 | 4 | |
| 68 | 29 | ||||
Figure 3.Activation Likelihood Estimation (ALE) map derived from 97 peak voxels from 11 functional imaging studies comparing neural responses to spoken words and pseudowords. ALE maps are thresholded at p < 0.05 FDR corrected, and only clusters larger than 100 mm3 are shown. Additional activation for pseudowords compared with words (red) and words compared with pseudowords (blue) is shown (a) rendered onto the left hemisphere, (b) displayed on an axial and (c) multiple sagittal slices of the MNI canonical brain (z and x coordinates as shown). Note inferior temporal and fusiform activation for word >pseudoword (circled in orange) is largely hidden in the rendering but apparent on the axial slice and sagittal slices x = −48, −40 and −32. (d,e) Response profiles showing predicted changes in neural responses owing to familarization with pseudowords: (d) Within regions that initially show an additional response to pseudowords (red in figure 3a–c, e.g. STG, posterior inferior frontal gyrus). For these regions we predict a diminished response to pseudowords after training. (d) Predicted response within regions that show an additional response to real words (anterior MTG, anterior fusiform, supramarginal gyrus, blue in figure 3a–c), we predict an increased response to pseudowords following training. (f) fMRI responses in a region of the STG overlapping with areas shown in red in figure 3a–c (replotted from Davis ). An equivalent, additional response to pseudowords was seen for items that were untrained at the time of scanning or trained but not consolidated (i.e. learned on the same day as scanning). However, there was a significant lexicality by consolidation interaction with a reduced response to pseudowords that were trained and consolidated (i.e. learned on the day before scanning).
Activation likelihood estimation results for 29 peak voxels in studies reporting a greater neural response to spoken pseudowords than words (shown in red in figure 2). Results thresholded at p < 0.05 FDR corrected, and with clusters greater than 100 mm3 reported. Entries shown in bold are cluster summary statistics (including centre of mass and volume), entries in plain type show local maxima.
| MNI coordinates | |||||
|---|---|---|---|---|---|
| location | volume (mm3) | ||||
| left superior temporal gyrus | 0.015 | −60 | −34 | 10 | |
| left superior temporal gyrus | 0.013 | −64 | −22 | 12 | |
| right superior temporal gyrus | 0.007 | 64 | −28 | 14 | |
| right superior temporal gyrus | 0.007 | 58 | −26 | 6 | |
| right middle temporal gyrus | 0.007 | 60 | −28 | −6 | |
| right middle temporal gyrus | 0.007 | 68 | −24 | −2 | |
| left superior temporal gyrus | 0.007 | −60 | −8 | −4 | |
| left superior temporal gyrus | 0.007 | −50 | −10 | 0 | |
| left superior temporal gyrus | 0.007 | −56 | 2 | 0 | |
| right cingulate gyrus | 0.007 | 12 | 22 | 34 | |
| right cingulate gyrus | 0.007 | 2 | 24 | 34 | |
| right insula | 0.006 | 50 | −32 | 18 | |
| left inferior frontal gyrus | 0.007 | −54 | 18 | 10 | |
| right middle frontal gyrus | 0.006 | 40 | 26 | 18 | |
| left middle temporal gyrus | 0.007 | −48 | −44 | 4 | |
| right medial frontal gyrus | 0.007 | 8 | 14 | 44 | |
| right cerebellum | 0.007 | 12 | −56 | −8 | |
| right insula | 0.006 | 26 | 26 | 8 | |
| left superior frontal gyrus | 0.007 | −4 | 18 | 50 | |
| left postcentral gyrus | 0.007 | −48 | −26 | 58 | |
| right inferior frontal gyrus | 0.007 | 50 | 20 | −10 | |
| left insula | 0.007 | −46 | −20 | 18 | |
| right middle temporal gyrus | 0.007 | 58 | −6 | −4 | |
| right inferior frontal gyrus | 0.007 | 28 | 6 | 26 | |
| left parahippocampal gyrus | 0.006 | −26 | −52 | −2 | |
| left cerebellum | 0.007 | −22 | −62 | −26 | |
| right inferior frontal gyrus | 0.007 | 54 | 22 | 12 | |
| left precentral gyrus | 0.006 | −34 | 4 | 28 | |
| left precentral gyrus | 0.006 | −34 | 4 | 30 | |
Activation likelihood estimation results for 68 peak voxels in studies reporting a greater neural response to spoken words than pseudowords (shown in blue in figure 2). Results thresholded at p < 0.05 FDR corrected, and with clusters greater than 100 mm3 reported. Entries shown in bold are cluster summary statistics (centre of mass and volume), entries in plain type show local maxima.
| MNI coordinates | |||||
|---|---|---|---|---|---|
| location | volume (mm3) | ||||
| 2488 | |||||
| left middle temporal gyrus | 0.009 | −40 | −70 | 24 | |
| left supramarginal gyrus | 0.008 | −54 | −52 | 30 | |
| left superior temporal gyrus | 0.007 | −50 | −58 | 22 | |
| left superior occipital gyrus | 0.007 | −44 | −78 | 28 | |
| 1320 | |||||
| left inferior temporal gyrus | 0.008 | −40 | −24 | −18 | |
| left fusiform gyrus | 0.008 | −36 | −34 | −16 | |
| 1152 | |||||
| right superior temporal gyrus | 0.009 | 54 | −48 | 14 | |
| right supramarginal gyrus | 0.007 | 54 | −50 | 30 | |
| left inferior frontal gyrus | 0.008 | −38 | 26 | −8 | |
| left inferior frontal gyrus | 0.007 | −48 | 24 | −12 | |
| 664 | |||||
| left cuneus | 0.01 | −6 | −66 | 32 | |
| 648 | |||||
| right precuneus | 0.007 | 14 | −66 | 40 | |
| right precuneus | 0.007 | 16 | −66 | 30 | |
| right precuneus | 0.007 | 12 | −56 | 38 | |
| 632 | |||||
| left middle frontal gyrus | 0.009 | −38 | 42 | −10 | |
| 440 | |||||
| left precuneus | 0.007 | −12 | −48 | 28 | |
| left cingulate gyrus | 0.007 | −10 | −44 | 30 | |
| 416 | |||||
| right precentral gyrus | 0.008 | 56 | −6 | 12 | |
| right precentral gyrus | 0.008 | 54 | −4 | 14 | |
| 328 | |||||
| left superior parietal lobule | 0.007 | −30 | −76 | 46 | |
| left superior parietal lobule | 0.007 | −36 | −68 | 48 | |
| 280 | |||||
| left putamen | 0.007 | −22 | 12 | 2 | |
| left putamen | 0.007 | −16 | 12 | 4 | |
| 144 | |||||
| left precuneus | 0.007 | −32 | −82 | 36 | |
| 136 | |||||
| left middle temporal gyrus | 0.007 | −56 | 0 | −20 | |
| 128 | |||||
| right middle temporal gyrus | 0.007 | 50 | −8 | −14 | |
| 128 | |||||
| left anterior cingulate | 0.007 | −8 | 38 | 0 | |
| 128 | |||||
| left inferior parietal lobule | 0.007 | −58 | −42 | 46 | |
| 120 | |||||
| left middle frontal gyrus | 0.007 | −22 | 26 | −16 | |
| 120 | |||||
| left middle temporal gyrus | 0.007 | −56 | −36 | −2 | |
| 104 | |||||
| left precuneus | 0.007 | −36 | −62 | 32 | |