| Literature DB >> 34248751 |
Jakub Dotlačil1, Puck de Haan2.
Abstract
This paper explores how the rational theory of memory summarized in Anderson (1991) can inform the computational psycholinguistic models of human parsing. It is shown that transition-based parsing is particularly suitable to be combined with Anderson's theory of memory systems. The combination of the rational theory of memory with the transition-based parsers results in a model of sentence processing that is data-driven and can be embedded in the cognitive architecture Adaptive Control of Thought-Rational (ACT-R). The predictions of the parser are tested against qualitative data (garden-path sentences) and a self-paced reading corpus (the Natural Stories corpus).Entities:
Keywords: cognitively constrained parsers; computational psycholinguistics; memory retrieval; modeling reading data; rational theory of memory
Year: 2021 PMID: 34248751 PMCID: PMC8261045 DOI: 10.3389/fpsyg.2021.657705
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1Phrase structure of a boy dances.
Figure 2Activations per word for sentence pairs (18)–(21). The yellow bars represent the activations in the sentences that disambiguate early. The blue bars are the activations of the garden-path sentences. The ellipses highlight the activations on the words that trigger the garden-path effect.
Figure 3Sequential model of reading on one word. Each box represents one process. Arrows show the order in which the processes fire. There are two arrows from retrieve parsing steps because retrieve wh-dependent is only triggered when a gap is postulated by the parser.
Figure 4Bayesian model for parameter estimation of Natural Stories Corpus.
Estimated parameter values.
| 0.0139 | 0.0139 | 0.001 | |
| 0.661 | 0.655 | 0.068 |
The linear model with Predictive RT as the only independent variable.
| Predictive RT | 0.993 | 0.0024 | 415.5 |
The linear model with Intercept and Predictive RT.
| Intercept | 248.4 | 12.7 | 19.57 | |
| Predicted RT | 0.220 | 0.040 | 5.55 |
A full linear model for RTs in the NSC.
| Intercept | 258.5 | 17.2 | 15 | |
| Story | 7.3 | 1.3 | 5.5 | |
| Zone | −3.9 | 0.87 | −4.5 | |
| Position | −2 | 0.7 | −3 | 0.003 |
| Story:Zone | −3.3 | 1.34 | −2.5 | 0.01 |
| Zone:Position | 1.65 | 0.73 | 2.25 | 0.02 |
| Nchar | 16.3 | 3.79 | 4.3 | |
| Log(Freq) | 0.21 | 0.52 | 0.4 | 0.7 |
| Nchar:log(Freq) | −0.68 | 0.22 | −3.1 | 0.002 |
| Log(Bigram) | 0.25 | 0.63 | 0.4 | 0.7 |
| Log(Trigram) | −0.88 | 0.48 | −1.82 | 0.07 |
| Predicted RT | 0.15 | 0.04 | 3.66 | 0.0003 |
Figure 5Mean and standard deviation summaries of model and data split per trigram, frequency and observed mean RT deciles. The x-axis label shows the upper cut-off point per decile (given in log in case of Frequency). In case of Frequency, only 9 deciles are present. This is because a single word (the) spans the top two deciles.