| Literature DB >> 35687222 |
David Kaplan1, Jianshen Chen2, Sinan Yavuz3, Weicong Lyu3.
Abstract
The purpose of this paper is to demonstrate and evaluate the use of Bayesian dynamic borrowing (Viele et al, in Pharm Stat 13:41-54, 2014) as a means of systematically utilizing historical information with specific applications to large-scale educational assessments. Dynamic borrowing via Bayesian hierarchical models is a special case of a general framework of historical borrowing where the degree of borrowing depends on the heterogeneity among historical data and current data. A joint prior distribution over the historical and current data sets is specified with the degree of heterogeneity across the data sets controlled by the variance of the joint distribution. We apply Bayesian dynamic borrowing to both single-level and multilevel models and compare this approach to other historical borrowing methods such as complete pooling, Bayesian synthesis, and power priors. Two case studies using data from the Program for International Student Assessment reveal the utility of Bayesian dynamic borrowing in terms of predictive accuracy. This is followed by two simulation studies that reveal the utility of Bayesian dynamic borrowing over simple pooling and power priors in cases where the historical data is heterogeneous compared to the current data based on bias, mean squared error, and predictive accuracy. In cases of homogeneous historical data, Bayesian dynamic borrowing performs similarly to data pooling, Bayesian synthesis, and power priors. In contrast, for heterogeneous historical data, Bayesian dynamic borrowing performed at least as well, if not better, than other methods of borrowing with respect to mean squared error, percent bias, and leave-one-out cross-validation.Entities:
Keywords: Bayesian dynamic borrowing; large-scale assessments; multilevel modeling; power priors
Year: 2022 PMID: 35687222 PMCID: PMC9185721 DOI: 10.1007/s11336-022-09869-3
Source DB: PubMed Journal: Psychometrika ISSN: 0033-3123 Impact factor: 2.290
Descriptive statistics for all PISA cycles (full US sample).
| Statistics | Cycle | PV1MATH | Female | Lang | PARED | HOMEPOS | IMMIG | TCSHORT | STRATIO |
|---|---|---|---|---|---|---|---|---|---|
| Mean or Proportion | 2003 | 481.86 | 0.50 | 0.91 | 13.47 | 0.31 | 0.87 | 15.80 | |
| 2006 | 471.05 | 0.50 | 0.88 | 13.49 | 0.84 | 0 | 16.23 | ||
| 2009 | 482.94 | 0.50 | 0.86 | 13.49 | 0.04 | 0.81 | 16.32 | ||
| 2012 | 481.98 | 0.49 | 0.86 | 13.58 | 0.18 | 0.79 | 17.17 | ||
| 2015 | 468.74 | 0.50 | 0.82 | 13.54 | 0.19 | 0.77 | 16.32 | ||
| 2018 | 474.30 | 0.50 | 0.86 | 14.03 | 0.79 | 17.58 | |||
| SD | 2003 | 93.64 | 0.50 | 0.28 | 2.55 | 1.01 | 0.34 | 0.91 | 5.62 |
| 2006 | 87.64 | 0.50 | 0.32 | 2.48 | 0.96 | 0.37 | 0.96 | 4.73 | |
| 2009 | 89.35 | 0.50 | 0.34 | 2.56 | 0.95 | 0.40 | 0.82 | 5.24 | |
| 2012 | 89.84 | 0.50 | 0.35 | 2.66 | 1.11 | 0.41 | 0.93 | 10.26 | |
| 2015 | 88.74 | 0.50 | 0.39 | 2.81 | 1.11 | 0.42 | 1.08 | 4.87 | |
| 2018 | 91.67 | 0.50 | 0.35 | 2.49 | 1.15 | 0.40 | 1.01 | 10.08 | |
| Percent Missing | 2003 | 0 | 0 | 0.03 | 0.03 | 0.01 | 0.03 | 0.01 | 0.07 |
| 2006 | 0 | 0 | 0.03 | 0.01 | 0.01 | 0.03 | 0.01 | 0.17 | |
| 2009 | 0 | 0 | 0.02 | 0.02 | 0.01 | 0.02 | 0.01 | 0.12 | |
| 2012 | 0 | 0 | 0.02 | 0.02 | 0.01 | 0.03 | 0.01 | 0.04 | |
| 2015 | 0 | 0 | 0.01 | 0.02 | 0.01 | 0.04 | 0 | 0.11 | |
| 2018 | 0 | 0 | 0.01 | 0.02 | 0.01 | 0.03 | 0.02 | 0.10 |
Posterior Means and Standard Deviations (SD) of Coefficients for Case Study 1 (Single-Level Model).
| Cycle | Method | Intercept | FEMALE | PARED | HOMEPOS | IMMIG | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | Mean | SD | Mean | SD | Mean | SD | ||
| 2003 | BLR non inf | 416.89 | 6.74 | 2.24 | 3.62 | 0.49 | 33.54 | 1.23 | 12.08 | 3.52 | |
| 2006 | BLR non inf | 387.85 | 6.88 | 2.20 | 6.61 | 0.49 | 25.91 | 1.22 | 8.04 | 3.15 | |
| 2009 | BLR non inf | 417.92 | 6.88 | 2.29 | 5.17 | 0.51 | 27.71 | 1.27 | 3.85 | 3.00 | |
| 2012 | BLR non inf | 408.43 | 6.90 | 2.34 | 5.62 | 0.52 | 21.17 | 1.17 | 3.12 | ||
| 2015 | BLR non inf | 405.80 | 5.93 | 2.19 | 4.72 | 0.46 | 19.61 | 1.07 | 2.78 | ||
| 2018 | BLR non inf | 426.29 | 7.64 | 2.42 | 4.68 | 0.54 | 25.02 | 1.14 | 3.15 | ||
| BLR inf | 419.55 | 5.13 | 1.68 | 4.66 | 0.35 | 25.14 | 0.79 | 2.12 | |||
| BLR pooling | 411.46 | 2.78 | 0.96 | 5.00 | 0.21 | 25.00 | 0.46 | 0.34 | 1.25 | ||
| BDB IG(1,1) | 426.52 | 7.40 | 2.36 | 4.66 | 0.53 | 25.03 | 1.11 | 3.07 | |||
| BDB IG(1,.1) | 426.23 | 7.24 | 2.39 | 4.67 | 0.51 | 25.01 | 1.11 | 3.05 | |||
| BDB IG(1,.001) | 425.41 | 6.58 | 2.15 | 4.68 | 0.46 | 25.03 | 1.08 | 3.04 | |||
| PP (.25) | 416.67 | 4.55 | 1.56 | 4.87 | 0.33 | 25.15 | 0.77 | 2.04 | |||
| PP (.50) | 413.72 | 3.71 | 1.25 | 4.94 | 0.27 | 25.07 | 0.63 | 1.63 | |||
| PP (.75) | 412.12 | 3.14 | 1.08 | 4.99 | 0.23 | 25.02 | 0.54 | 1.45 | |||
BLR non inf: Bayesian linear regression with non-informative prior; BLR inf: Bayesian linear regression with informative prior; BDB: Bayesian dynamic borrowing; IG: inverse-gamma prior for the variance of the joint prior distribution, which determines the degree of borrowing; PP: power prior.
Fig. 1Total effective sample size (upper panel), and LOOIC (lower panel) for single-level case study (full US sample). The horizontal axis represents the methods used under Bayesian linear regression (BLR), the priors used under Bayesian dynamic borrowing (BDB), and the a parameter used for power priors (PP).
Posterior means and standard deviations (SD) of individual-level coefficients for case study 2 (multilevel model).
| Cycle | Method | Intercept | FEMALE | PARED | HOMEPOS | IMMIG | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | Mean | SD | Mean | SD | Mean | SD | ||
| 2003 | BLR non inf | 451.06 | 11.53 | 2.59 | 2.56 | 0.54 | 26.36 | 1.39 | 3.52 | 4.15 | |
| 2006 | BLR non inf | 421.04 | 13.19 | 2.15 | 4.53 | 0.49 | 18.35 | 1.26 | 2.48 | 3.50 | |
| 2009 | BLR non inf | 456.17 | 12.78 | 2.49 | 3.25 | 0.49 | 19.42 | 1.33 | 3.13 | 3.26 | |
| 2012 | BLR non inf | 433.41 | 9.61 | 2.54 | 3.95 | 0.53 | 15.17 | 1.21 | 3.46 | ||
| 2015 | BLR non inf | 422.29 | 12.09 | 2.28 | 3.32 | 0.47 | 15.00 | 1.11 | 0.67 | 3.10 | |
| 2018 | BLR non inf | 435.74 | 9.92 | 2.57 | 3.29 | 0.57 | 19.55 | 1.26 | 3.54 | ||
| BLR inf | 435.28 | 7.69 | 1.74 | 3.33 | 0.38 | 19.62 | 0.89 | 2.54 | |||
| BLR pooling | 434.32 | 4.33 | 0.96 | 3.49 | 0.21 | 18.54 | 0.51 | 1.41 | |||
| BDB IG(1,1) W2 | 435.93 | 9.72 | 2.44 | 3.27 | 0.55 | 19.48 | 1.19 | 3.46 | |||
| BDB IG(1,.1) W2 | 436.17 | 9.55 | 2.46 | 3.25 | 0.55 | 19.42 | 1.19 | 3.42 | |||
| BDB IG(1,.001) W2 | 437.26 | 8.53 | 2.21 | 3.28 | 0.48 | 19.36 | 1.13 | 3.26 | |||
| BDB IG(1,1) W20 | 435.90 | 9.92 | 2.46 | 3.26 | 0.55 | 19.42 | 1.21 | 3.47 | |||
| BDB IG(1,.1) W20 | 436.01 | 9.86 | 2.44 | 3.26 | 0.55 | 19.41 | 1.19 | 3.46 | |||
| BDB IG(1,.001) W20 | 437.39 | 8.50 | 2.16 | 3.28 | 0.47 | 19.34 | 1.14 | 3.26 | |||
| PP (.25) | 426.05 | 5.74 | 1.64 | 4.02 | 0.36 | 20.91 | 0.84 | 2.28 | |||
| PP (.5) | 430.96 | 4.98 | 1.28 | 3.71 | 0.28 | 19.58 | 0.66 | 1.87 | |||
| PP (.75) | 433.14 | 4.48 | 1.09 | 3.56 | 0.24 | 18.92 | 0.56 | 1.59 | |||
BLR non inf: Bayesian linear regression with non-informative prior; BLR inf: Bayesian linear regression with informative prior; BDB: Bayesian dynamic borrowing; IG: inverse-gamma prior for level-1 variance of the joint prior distribution, which determines the degree of level-1 borrowing; W2: Wishart prior with weak borrowing for level-2 precision matrix (results were converted back the covariance matrix); W20: Wishart prior with strong borrowing for level-2 precision matrix (results were converted back the covariance matrix); PP: power priors.
Posterior means and standard deviations (SD) of school-level coefficients for case study 2 (multilevel model).
| Cycle | Method | TCSHORT | STRATIO | FEMALE:TCSHORT | |||
|---|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | Mean | SD | ||
| 2003 | BLR non inf | 3.35 | 0.52 | 0.09 | 2.80 | ||
| 2006 | BLR non inf | 3.41 | 0.65 | 2.27 | |||
| 2009 | BLR non inf | 4.07 | 0.61 | 2.64 | |||
| 2012 | BLR non inf | 3.61 | 0.06 | 0.31 | 0.93 | 2.45 | |
| 2015 | BLR non inf | 2.94 | 0.03 | 0.60 | 2.09 | ||
| 2018 | BLR non inf | 3.18 | 0.33 | 0.29 | 1.85 | 2.52 | |
| BLR inf | 2.29 | 0.15 | 0.25 | 0.76 | 1.74 | ||
| BLR pooling | 1.39 | 0.18 | 0.96 | ||||
| BDB IG(1,1) W2 | 3.16 | 0.34 | 0.29 | 1.80 | 2.40 | ||
| BDB IG(1,.1) W2 | 3.49 | 0.32 | 0.28 | 1.79 | 2.36 | ||
| BDB IG(1,.001) W2 | 2.48 | 0.17 | 0.26 | 0.96 | 1.91 | ||
| BDB IG(1,1) W20 | 3.22 | 0.34 | 0.29 | 1.78 | 2.40 | ||
| BDB IG(1,.1) W20 | 3.14 | 0.34 | 0.29 | 1.81 | 2.37 | ||
| BDB IG(1,.001) W20 | 2.51 | 0.17 | 0.26 | 0.97 | 1.93 | ||
| PP (.25) | 1.49 | 0.02 | 0.16 | 0.81 | 1.65 | ||
| PP (.5) | 1.40 | 0.17 | 0.29 | 1.28 | |||
| PP (.75) | 1.38 | 0.17 | 0.06 | 1.09 | |||
BLR non inf: Bayesian linear regression with non-informative priors; BLR inf: Bayesian linear regression with informative priors; BDB: Bayesian dynamic borrowing; IG: inverse-gamma prior for level-1 variance of the joint prior distribution, which determines the degree of level-1 borrowing; W2: Wishart prior with weak borrowing for level-2 precision matrix (results were converted back the covariance matrix); W20: Wishart prior with strong borrowing for level-2 precision matrix (results were converted back the covariance matrix); PP: power priors.
Posterior means of variation parameters for case study 2 (multilevel model).
| Cycle | Method | Level-1 SD | Level-2 Var.-Intercept | Level-2 Covar. | Level-2 Var.-FEMALE |
|---|---|---|---|---|---|
| 2003 | BLR non inf | 77.12 | 1069.02 | 10.40 | |
| 2006 | BLR non inf | 76.42 | 1185.44 | 7.98 | |
| 2009 | BLR non inf | 73.83 | 1358.10 | 3.57 | 10.67 |
| 2012 | BLR non inf | 74.35 | 1221.41 | 4.43 | |
| 2015 | BLR non inf | 78.41 | 1152.04 | 18.66 | |
| 2018 | BLR non inf | 79.21 | 949.58 | 37.80 | |
| BLR inf | 79.22 | 951.49 | 34.35 | ||
| BLR pooling | 76.56 | 1255.06 | 16.36 | ||
| BDB IG(1,1) W2 | 76.48 | 972.51 | 10.16 | ||
| BDB IG(1,.1) W2 | 76.48 | 974.05 | 9.71 | ||
| BDB IG(1,.001) W2 | 76.48 | 971.30 | 9.82 | ||
| BDB IG(1,1) W20 | 76.47 | 1016.67 | 15.22 | ||
| BDB IG(1,.1) W20 | 76.47 | 1015.61 | 15.66 | ||
| BDB IG(1,.001) W20 | 76.46 | 1011.85 | 14.68 | ||
| PP (.25) | 79.88 | 618.45 | 6.33 | ||
| PP (.5) | 77.99 | 974.91 | 5.70 | ||
| PP (.75) | 77.08 | 1151.31 | 9.43 |
BLR non inf: Bayesian linear regression with non-informative priors; BLR inf: Bayesian linear regression with informative priors; BDB: Bayesian dynamic borrowing; IG: inverse-gamma prior for level-1 variance of the joint prior distribution, which determines the degree of level-1 borrowing; W2: Wishart prior with weak borrowing for level-2 precision matrix (results were converted back the covariance matrix); W20: Wishart prior with strong borrowing for level-2 precision matrix (results were converted back the covariance matrix); PP: power priors.
Fig. 2Total effective sample size (upper panel), and LOOIC (lower panel) for multilevel case study (full US sample). The horizontal axis represents the methods used under Bayesian linear regression (BLR), the priors used under Bayesian dynamic borrowing (BDB) under weak (Wishart Prior W2) or strong (Wishart Prior W20) borrowing at level-2, and the a parameter used for power priors (PP).
Fig. 3Log MSE (FIG. 3a, upper left), percent bias (FIG. 3b, upper right), total effective sample size (FIG. 3c, lower left), and LOOIC (FIG. 3d, lower right) for Simulation Study 1 (). The horizontal axis represents heterogeneity conditions. Each line within the figures represents methods examined under Bayesian linear regression (BLR), the priors used under Bayesian dynamic borrowing (BDB), and the a parameter used for power priors (PP).
Fig. 4Log MSE (FIG. 4a, upper left), Percent bias (FIG. 4b, upper right), Total Effective sample size (FIG. 4c, lower left), and LOOIC (FIG. 4d, lower right) for Simulation Study 1 (). The horizontal axis represents heterogeneity conditions. Each line within the figures represents methods examined under Bayesian linear regression (BLR), the priors used under Bayesian dynamic borrowing (BDB) for weak (W2) or strong (W20) borrowing, and the a parameter used for power priors (PP).
Fig. 5Log MSE (Figure. 5a, upper left), Percent bias (Figure. 5b, upper right), Total Effective sample size (Figure. 5c, lower left), and LOOIC (Figure. 5d, lower right) for Simulation Study 2 (; 30 Schools, 20 Students Each). The horizontal axis represents heterogeneity conditions. Each line within the figures represents methods examined under Bayesian linear regression (BLR), the priors used under Bayesian dynamic borrowing (BDB), and the a parameter used for power priors (PP).