| Literature DB >> 30984068 |
Jung Yeon Park1, Frederik Cornillie1, Han L J van der Maas2, Wim Van Den Noortgate1.
Abstract
Adaptive learning systems have received an increasing attention as they enable to provide personalized instructions tailored to the behaviors and needs of individual learners. In order to reach this goal, it is desired to have an assessment system, monitoring each learner's ability change in real time. The Elo Rating System (ERS), a popular scoring algorithm for paired competitions, has recently been considered as a fast and flexible method that can assess learning progress in online learning environments. However, it has been argued that a standard ERS may be problematic due to the multidimensional nature of the abilities embedded in learning materials. In order to handle this issue, we propose a system that incorporates a multidimensional item response theory model (MIRT) in the ERS. The basic idea is that instead of updating a single ability parameter from the Rasch model, our method allows a simultaneous update of multiple ability parameters based on a compensatory MIRT model, resulting in a multidimensional extension of the ERS ("M-ERS"). To evaluate the approach, three simulation studies were conducted. Results suggest that the ERS that incorrectly assumes unidimensionality has a seriously lower prediction accuracy compared to the M-ERS. Accounting for both speed and accuracy in M-ERS is shown to perform better than using accuracy data only. An application further illustrates the method using real-life data from a popular educational platform for exercising math skills.Entities:
Keywords: Elo rating system; adaptive practice; e-learning; multidimensional IRT; speed-accuracy trade-off
Year: 2019 PMID: 30984068 PMCID: PMC6450197 DOI: 10.3389/fpsyg.2019.00620
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Patterns of multidimensionality (a sample of 15 items from two item banks).
| 1 | −2.534 | 1 | . | . | 1 | . | . |
| 2 | −2.21 | 1 | . | . | 1 | . | . |
| 3 | 1.326 | 1 | . | . | 1 | . | . |
| 4 | 0.253 | 1 | . | . | 1 | . | . |
| 5 | 1.275 | 1 | . | . | 1 | . | . |
| 6 | 0.089 | 1 | . | . | 1 | . | . |
| 7 | −0.001 | 1 | 1 | . | . | 1 | . |
| 8 | −1.256 | 1 | 1 | . | . | 1 | . |
| 9 | 2.242 | 1 | 1 | . | . | 1 | . |
| 10 | −1.556 | 1 | 1 | . | . | 1 | . |
| 11 | 2.213 | 1 | 1 | . | . | 1 | . |
| 12 | −3.3 | 1 | . | 1 | . | . | 1 |
| 13 | 0.753 | 1 | . | 1 | . | . | 1 |
| 14 | −2.246 | 1 | . | 1 | . | . | 1 |
| 15 | −1.156 | 1 | . | 1 | . | . | 1 |
“1” indicates that the item loads on the dimension.
Figure 1Receiver operating characteristic (ROC) curves for a standard ERS and M-ERS. Note. Each panel includes six ROC curves representing a total of 6 simulation conditions (2 patterns of dimensionality × 3 correlations among ability parameters).
Area Under ROC curve (AUROC) for a standard ERS and M-ERS.
| ρ = 0.0 | 0.5197 | 0.8038 | 0.5216 | 0.7850 |
| ρ = 0.2 | 0.5204 | 0.8070 | 0.5324 | 0.7900 |
| ρ = 0.5 | 0.5496 | 0.8157 | 0.5494 | 0.7893 |
| Average | 0.5299 | 0.8088 | 0.5345 | 0.7881 |
Figure 2Result of 3-dimensional ability estimation across the number of items answered. Cor, correlations between dimensions; D1, 1st dimension; D2, 2nd dimension; D3, 3rd dimension.
Figure 3Relations between EAP and M-ERS estimates.
Comparing IRT-based ERS algorithms for correctness vs. trade-off score (correctness and speediness combined).
| Accuracy | 0.009 | 0.300 | 0.008 | 0.286 | 0.009 | 0.289 | |
| Speed-accuracy | 0.008 | 0.245 | 0.008 | 0.232 | 0.007 | 0.213 | |
| Accuracy | 0.010 | 0.364 | 0.009 | 0.363 | 0.010 | 0.382 | |
| Speed-accuracy | 0.009 | 0.306 | 0.009 | 0.298 | 0.009 | 0.307 | |
| Accuracy | 0.010 | 0.400 | 0.010 | 0.395 | 0.010 | 0.392 | |
| Speed-accuracy | 0.009 | 0.337 | 0.009 | 0.336 | 0.009 | 0.332 | |
| Accuracy | 0.008 | 0.291 | 0.008 | 0.294 | 0.008 | 0.290 | |
| Speed-accuracy | 0.008 | 0.245 | 0.008 | 0.251 | 0.008 | 0.252 | |
| Accuracy | 0.008 | 0.299 | 0.008 | 0.290 | 0.008 | 0.291 | |
| Speed-accuracy | 0.008 | 0.260 | 0.008 | 0.245 | 0.008 | 0.252 | |
| Accuracy | 0.009 | 0.337 | 0.009 | 0.353 | 0.009 | 0.315 | |
| Speed-accuracy | 0.008 | 0.299 | 0.009 | 0.310 | 0.008 | 0.278 | |
|Bias|, absolute value of the bias averaged over learners and items; MSE, a mean squared error averaged over learners and items.
Figure 4Example of ability estimates for a student by standard ERS and M-ERS.