| Literature DB >> 34950869 |
Thomas Wilschut1,2, Florian Sense1, Maarten van der Velde1,2, Zafeirios Fountas3, Sarah C Maaß1,2,4, Hedderik van Rijn1,2.
Abstract
Memorising vocabulary is an important aspect of formal foreign-language learning. Advances in cognitive psychology have led to the development of adaptive learning systems that make vocabulary learning more efficient. One way these computer-based systems optimize learning is by measuring learning performance in real time to create optimal repetition schedules for individual learners. While such adaptive learning systems have been successfully applied to word learning using keyboard-based input, they have thus far seen little application in word learning where spoken instead of typed input is used. Here we present a framework for speech-based word learning using an adaptive model that was developed for and tested with typing-based word learning. We show that typing- and speech-based learning result in similar behavioral patterns that can be used to reliably estimate individual memory processes. We extend earlier findings demonstrating that a response-time based adaptive learning approach outperforms an accuracy-based, Leitner flashcard approach in learning efficiency (demonstrated by higher average accuracy and lower response times after a learning session). In short, we show that adaptive learning benefits transfer from typing-based learning, to speech based learning. Our work provides a basis for the development of language learning applications that use real-time pronunciation assessment software to score the accuracy of the learner's pronunciations. We discuss the implications for our approach for the development of educationally relevant, adaptive speech-based learning applications.Entities:
Keywords: ACT-R; adaptive learning; memory; pronunciation; reaction times (RT); speech
Year: 2021 PMID: 34950869 PMCID: PMC8689065 DOI: 10.3389/frai.2021.780131
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
FIGURE 1Theoretical framework for typing- and speech based vocabulary learning. Thick solid lines represent strong, established connections between first language representations for a word in the mental lexicon. Arrows represent the newly learned connections between L1 orthographic and L2 orthographic (typing-based learning) or L2 phonological (speech-based learning) representations. Dotted lines represent connections that are learned implicitly.
FIGURE 2Experimental design.
FIGURE 3Visual comparison of reaction times for correct trials in the RT-adaptive, typing-based (RT-T) and RT-adaptive speaking-based (RT-S) learning condition over the time course of the experiment. Vertical lines represent median reaction times at each time point.
Predicting accuracy from reaction time-based memory activation.
| Model 1: Typing-based adaptive learning |
| SE |
|
|
|---|---|---|---|---|
| Intercept | 1.47 | 0.12 | 11.94 |
|
| Activation | 0.57 | 0.05 | 10.64 |
|
|
|
|
|
|
|
| Intercept | 2.54 | 0.19 | 13.15 |
|
| Activation | 1.83 | 0.01 | 19.09 |
|
***p 0.001; **p 0.01; *p 0.05.
FIGURE 4Predicting accuracy from normalized memory activations in the RT-adaptive typing condition (RT-T) and the RT-adaptive speaking condition (RT-S). The cloud of semi-transparent points shows the empirical accuracy. These values are either 1 (correct) or 0 (incorrect) but have been offset and jittered vertically to highlight the differences between the conditions and where on the x-axis the data are concentrated.
Predicting performance from learning condition and session.
| Model 3: Accuracy |
| SE |
|
| |
|---|---|---|---|---|---|
| Intercept | 1.94 | 0.16 | 12.12 |
| |
| Leitner learning | −0.71 | 0.08 | −9.25 |
| |
| Test | 0.29 | 0.19 | 1.02 | 0.309 | |
| Leitner learning × Test | 0.16 | 0.25 | 0.66 | 0.513 | |
|
|
|
|
|
|
|
| Intercept | 2,825.18 | 93.80 | 56.85 | 30.19 |
|
| Leitner learning | 554.33 | 49.82 | 5,397.75 | 11.13 |
|
| Test | −93.31 | 104.26 | 5,326.47 | 0.90 | 0.371 |
| Leitner learning × Test | −28.70 | 147.93 | 5,316.82 | −0.19 | 0.846 |
***p 0.001; **p 0.01; *p 0.05.
FIGURE 5Accuracy and median reaction times for RT-adaptive (RT-S) and Leitner-adaptive (L–S) speech-based learning. Error bars are standard errors.