| Literature DB >> 35324572 |
Boris Forthmann1, Natalie Förster1, Elmar Souvignier1.
Abstract
Monitoring the progress of student learning is an important part of teachers' data-based decision making. One such tool that can equip teachers with information about students' learning progress throughout the school year and thus facilitate monitoring and instructional decision making is learning progress assessments. In practical contexts and research, estimating learning progress has relied on approaches that seek to estimate progress either for each student separately or within overarching model frameworks, such as latent growth modeling. Two recently emerging lines of research for separately estimating student growth have examined robust estimation (to account for outliers) and Bayesian approaches (as opposed to commonly used frequentist methods). The aim of this work was to combine these approaches (i.e., robust Bayesian estimation) and extend these lines of research to the framework of linear latent growth models. In a sample of N = 4970 second-grade students who worked on the quop-L2 test battery (to assess reading comprehension) at eight measurement points, we compared three Bayesian linear latent growth models: (a) a Gaussian model, (b) a model based on Student's t-distribution (i.e., a robust model), and (c) an asymmetric Laplace model (i.e., Bayesian quantile regression and an alternative robust model). Based on leave-one-out cross-validation and posterior predictive model checking, we found that both robust models outperformed the Gaussian model, and both robust models performed comparably well. While the Student's t model performed statistically slightly better (yet not substantially so), the asymmetric Laplace model yielded somewhat more realistic posterior predictive samples and a higher degree of measurement precision (i.e., for those estimates that were either associated with the lowest or highest degree of measurement precision). The findings are discussed for the context of learning progress assessment.Entities:
Keywords: Bayesian analysis; growth; progress monitoring; robust estimation; slope
Year: 2022 PMID: 35324572 PMCID: PMC8949320 DOI: 10.3390/jintelligence10010016
Source DB: PubMed Journal: J Intell ISSN: 2079-3200
Linear Latent Growth Model Definitions and Used Prior Distributions.
| Model | Gaussian | Student’s | Asymmetric Laplace |
|---|---|---|---|
| Response Distribution |
|
|
|
| Linear Predictor |
|
|
|
| Latent Variable Distribution |
|
|
|
|
|
|
|
|
|
| Improper flat prior | Improper flat prior | Improper flat prior |
|
|
|
|
|
|
|
|
|
|
| Prior for correlation matrices |
|
|
|
| Prior for |
|
|
|
| Prior for | - |
| - |
= reading efficiency factor score for person p at timepoint t. = linear predictor for person p at timepoint t. = Residual variance. = degrees of freedom of Student’s t-distribution. = Intercept of person p (i.e., initial level of reading efficiency). = Slope of person p (i.e., learning progress in reading efficiency). = Coding variable of measurement timepoint t (. = Matrix of latent variables and . = Vector of latent variable means (i.e., the average intercept across all persons) and (i.e., the average slope across all persons). = Covariance matrix of latent variables and . N() = Normal distribution. t() = Student’s t distribution. ALD() = Asymmetric Laplace distribution. MVN() = Multivariate normal distribution. ht() = Half-t distribution. lkj() = Lewandowski-Kurowicka-Joe distribution. Γ() = Gamma distribution.
Model Estimates and Comparisons for the Latent Growth Curve Models.
| Model | Gaussian | Student’s | Asymmetric Laplace | |||
|---|---|---|---|---|---|---|
| Estimate | 95% CI | Estimate | 95% CI | Estimate | 95% CI | |
| Person Level (Latent Variables) | ||||||
|
| 34.10 | [33.32, 34.90] | 34.28 | [33.48, 35.06] | 34.17 | [33.38, 34.98] |
|
| 2.79 | [2.65, 2.94] | 2.41 | [2.28, 2.54] | 2.29 | [2.15, 2.42] |
| −0.55 | [−0.58, −0.52] | −0.63 | [−0.67, −0.60] | −0.63 | [−0.66, −0.59] | |
| Population Level | ||||||
|
| −13.98 | [−15.00, −12.96] | −12.79 | [−13.79, −11.75] | −12.70 | [−13.74, −11.68] |
|
| 4.52 | [4.40, 4.64] | 4.59 | [4.49, 4.70] | 4.57 | [4.46, 4.67] |
|
| 21.90 | [21.73, 22.08] | 14.55 | [14.31, 14.80] | 7.67 | [7.59, 7.76] |
|
| - | - | 3.22 | [3.08, 3.36] | - | - |
| Quantile | - | - | - | - | 0.50 | - |
| LOO Comparison | ||||||
| ELPD Difference | −2600.60 | 0.00 | −32.50 | |||
| ELPD Difference | 154.00 | 0.00 | 30.40 |
CI = credible interval. LOO = leave-one-out cross-validation. ELPD = expected log-pointwise predictive density. SE = standard error. Please see Table 1 for model definitions and equations.
Figure 1Graphical posterior predictive checking results. Top: Density overlay (based on ten posterior draws for each of the models) restricted to the range of −290 to 140 on the x axis to facilitate a comparison of model fit based on the main part of the empirical distribution (i.e., observed values of reading efficiency y ranged from −286.11 to 135.88). Bottom: Boxplots of the original data (dark blue) and ten draws of the posterior predictive distribution (boxes in light blue) to facilitate comparison of the sampled values between the three models.
Figure 2Distribution density plots for the following distributions (cf. Table 2): N(−13.98, 34.10), t (−12.79, 14.55, 3.22), and ALD (−12.70, 7.67, 0.50). These are the distributions based on the estimated models reported in Table 2 for reading efficiency at the first measurement point for the average student. Left: y-axis range from 0 to 0.035 and x-axis range from −200 to 200. Right: “zoom-in” depiction of the densities to better visualize differences at the tails, i.e., y-axis range from 0 to 0.00001 and x-axis range from −200 to 200.
Figure 3Bivariate scatter plots between initial level and learning progress estimates.
Figure 4Bivariate scatter plots of measurement precision estimates. Red line = LOESS curve.