| Literature DB >> 35741481 |
Abstract
In educational large-scale assessment studies such as PISA, item response theory (IRT) models are used to summarize students' performance on cognitive test items across countries. In this article, the impact of the choice of the IRT model on the distribution parameters of countries (i.e., mean, standard deviation, percentiles) is investigated. Eleven different IRT models are compared using information criteria. Moreover, model uncertainty is quantified by estimating model error, which can be compared with the sampling error associated with the sampling of students. The PISA 2009 dataset for the cognitive domains mathematics, reading, and science is used as an example of the choice of the IRT model. It turned out that the three-parameter logistic IRT model with residual heterogeneity and a three-parameter IRT model with a quadratic effect of the ability θ provided the best model fit. Furthermore, model uncertainty was relatively small compared to sampling error regarding country means in most cases but was substantial for country standard deviations and percentiles. Consequently, it can be argued that model error should be included in the statistical inference of educational large-scale assessment studies.Entities:
Keywords: PISA; item response model; model uncertainty; scaling
Year: 2022 PMID: 35741481 PMCID: PMC9223051 DOI: 10.3390/e24060760
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.738
Figure 1Item response functions (left panel) and locally optimal weights (right panel) for the 1PL, 1PCL and 1PLL models.
Figure 2Item response functions (left panel) and locally optimal weights (right panel) for the 4PL, 3PL and 2PL models.
Figure 3Item response functions (left panel) and locally optimal weights (right panel) for different IRFs of the 3PLRH model.
Model comparisons based on information criteria for the three ability domains—mathematics, reading and science—in PISA 2009.
| Mathematics | Reading | Science | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Model | AIC | BIC | ΔGHP | AIC | BIC | ΔGHP | AIC | BIC | ΔGHP |
| 1PL | 217510 | 217779 | 0.0059 | 413555 | 414317 | 0.0055 | 347819 | 348222 | 0.0062 |
| 1PCL | 220022 | 220291 | 0.0122 | 414757 | 415519 | 0.0070 | 348756 | 349160 | 0.0077 |
| 1PLL | 216882 | 217151 | 0.0043 | 416988 | 417751 | 0.0098 | 348984 | 349388 | 0.0081 |
| 1PGL | 216784 | 217068 | 0.0041 | 413369 | 414146 | 0.0053 | 347804 | 348223 | 0.0062 |
| 2PL | 215621 | 216144 | 0.0012 | 410032 |
| 0.0011 | 344597 | 345389 | 0.0009 |
| 4PGL |
| 216188 |
|
| 412182 |
|
| 345648 |
|
| 3PLQ |
|
|
| 409327 |
|
|
|
|
|
| 3PLRH |
|
|
| 409275 |
|
|
|
|
|
| 3PL | 215486 | 216099 | 0.0009 | 409767 |
| 0.0008 | 344420 |
| 0.0006 |
| 4PL |
|
|
| 409296 | 411852 |
|
|
|
|
| 4PLQ |
| 216102 |
|
| 411913 |
|
| 345464 |
|
Note. AIC = Akaike information criterion; BIC = Bayesian information criteria; DGHP = difference in Gilula–Haberman penalty (GHP) between a particular model and the best-fitting model in terms of GHP; For model descriptions see Section 2.1 and Equations (3) to (14). For AIC and BIC, the best-fitting model and models whose information criteria did not deviate from the minimum value by more than 100 are printed in bold. For DGHP, the model with the smallest value and models with DGHP values smaller than 0.0005 are printed in bold.
Figure 4Dendrogram of cluster analysis using the Ward method for 11 different scaling models based on the distance matrix defined as average absolute differences between country means of models for PISA 2009 reading data.
Detailed results for all 11 different scaling models for country means in PISA 2009 reading.
| CNT | M | rg |
| 1PL | 1PCL | 1PLL | 1PGL | 2PL | 4PGL | 3PLQ | 3PLRH | 3PL | 4PL | 4PLQ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUS | 515.2 | 1.25 | 0.29 | 515.1 | 515.8 | 514.8 | 515.2 | 515.7 | 515.2 | 515.2 | 515.5 | 515.0 | 515.0 | 514.5 |
| AUT | 470.8 | 2.36 | 0.65 | 470.2 |
| 470.6 | 470.1 | 470.9 |
| 471.6 | 471.7 | 470.6 | 471.6 |
|
| BEL | 509.5 | 2.91 | 0.78 | 508.9 |
| 509.4 | 508.8 | 509.7 |
| 510.4 | 510.5 | 509.4 |
|
|
| CAN | 525.0 | 1.79 | 0.43 | 525.1 | 525.6 | 525.2 | 525.1 | 525.4 | 524.3 | 524.5 | 524.8 | 524.9 | 524.0 |
|
| CHE | 501.7 | 1.27 | 0.39 | 501.3 | 501.3 | 501.0 | 501.4 | 501.5 | 502.3 | 502.3 | 502.2 | 501.8 | 502.3 | 502.3 |
| CZE | 479.9 | 0.89 | 0.27 | 479.5 | 480.2 | 479.5 | 479.6 | 480.1 | 480.0 | 480.0 | 479.8 | 480.4 | 480.1 | 480.0 |
| DEU | 498.5 | 1.83 | 0.39 | 498.2 | 499.3 |
| 498.5 | 498.4 | 499.0 | 498.9 | 498.9 | 498.7 | 498.8 | 499.1 |
| DNK | 493.7 | 5.46 | 1.58 |
|
| 492.9 |
|
|
|
|
| 493.5 |
|
|
| ESP | 480.1 | 1.43 | 0.43 | 480.0 | 480.7 | 479.5 | 480.1 | 480.3 | 479.6 | 479.8 | 479.6 | 480.9 | 479.7 | 479.7 |
| EST | 501.5 | 2.43 | 0.75 | 501.2 |
|
| 501.4 | 502.0 | 500.9 | 501.0 | 501.0 |
| 500.7 | 500.8 |
| FIN | 539.0 | 1.66 | 0.41 | 539.0 | 538.7 | 539.2 | 538.9 | 538.7 | 539.8 | 539.2 | 539.6 | 538.4 | 539.7 |
|
| FRA | 498.0 | 4.54 | 1.13 | 497.4 |
|
| 497.0 | 497.7 |
|
|
| 497.7 |
|
|
| GBR | 494.0 | 1.29 | 0.20 | 494.0 | 494.7 | 493.4 | 494.1 | 494.0 | 494.0 | 494.1 | 494.0 | 494.2 | 493.8 | 493.8 |
| GRC | 480.6 | 3.42 | 0.96 |
|
|
| 481.1 | 480.3 |
| 480.0 | 479.7 | 480.0 |
|
|
| HUN | 494.2 | 1.74 | 0.40 | 494.4 | 495.0 | 493.8 | 494.4 | 494.5 | 493.5 | 493.6 | 493.7 | 494.3 | 493.3 | 493.4 |
| IRL | 496.8 | 2.04 | 0.51 | 496.5 | 497.7 |
| 496.8 | 497.4 | 496.4 | 496.6 | 496.6 | 497.5 | 496.5 | 496.4 |
| ISL | 501.2 | 0.78 | 0.15 | 501.3 | 501.6 | 501.5 | 501.2 | 501.3 | 501.1 | 500.8 | 501.0 | 500.8 | 501.3 | 501.2 |
| ITA | 486.5 | 1.37 | 0.32 | 486.3 | 485.6 | 486.6 | 486.2 | 486.8 | 486.7 | 487.0 | 486.9 | 486.6 | 486.8 | 486.9 |
| JPN | 521.3 | 7.70 | 1.60 | 522.3 |
|
| 521.4 | 520.4 | 521.6 | 521.0 | 520.7 |
| 522.2 | 522.2 |
| KOR | 539.7 | 4.03 | 1.45 |
|
|
|
| 538.7 |
|
|
|
|
|
|
| LUX | 472.7 | 4.38 | 1.22 | 471.7 |
| 473.0 |
| 473.2 |
|
|
| 472.5 |
|
|
| NLD | 509.0 | 1.57 | 0.28 | 509.1 | 509.8 | 508.2 | 509.4 | 508.6 | 508.9 | 509.1 | 508.7 | 508.8 | 509.2 | 509.1 |
| NOR | 503.3 | 0.89 | 0.14 | 503.3 | 503.6 | 503.7 | 503.1 | 503.2 | 503.3 | 503.2 | 503.0 | 503.3 | 503.7 | 503.9 |
| POL | 501.7 | 2.24 | 0.72 | 501.0 | 501.2 |
| 501.3 | 502.2 | 502.0 | 502.5 | 502.2 | 502.7 | 502.2 | 502.1 |
| PRT | 489.2 | 2.79 | 0.70 | 489.4 |
|
| 489.8 | 489.3 | 488.3 | 488.5 | 488.4 | 489.9 | 488.3 | 488.3 |
| SWE | 497.0 | 0.34 | 0.00 | 496.9 | 497.0 | 497.0 | 496.9 | 496.9 | 497.2 | 497.0 | 497.1 | 496.9 | 497.1 | 497.2 |
Note. CNT = country label (see Appendix B); M = weighted mean across different scaling models; rg = range of estimates across models; MEbc = bias-corrected estimate of model error based on balanced half sampling (see Equation (23)); For model descriptions see Section 2.1 and Equations (3) to (14). Country means that differ from the weighted mean of country means of the 11 different models more than 1 are printed in bold.
Detailed results for all 11 different scaling models for country means in PISA 2009 mathematics.
| CNT | M | rg |
| 1PL | 1PCL | 1PLL | 1PGL | 2PL | 4PGL | 3PLQ | 3PLRH | 3PL | 4PL | 4PLQ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUS | 511.2 | 0.72 | 0.02 | 511.3 | 510.9 | 510.8 | 511.1 | 511.4 | 511.4 | 511.2 | 511.2 | 511.5 | 511.2 | 511.3 |
| AUT | 492.5 | 2.90 | 0.71 | 492.7 |
|
|
| 492.7 | 492.4 | 492.3 | 492.9 |
| 492.1 | 492.1 |
| BEL | 512.4 | 2.99 | 0.86 | 513.0 |
|
|
| 511.6 | 512.2 | 512.4 | 512.1 | 511.5 | 512.3 | 512.2 |
| CAN | 523.0 | 2.17 | 0.62 | 522.5 |
| 522.7 | 522.9 | 523.8 | 523.1 | 523.1 | 523.2 | 524.0 | 523.0 | 523.0 |
| CHE | 533.5 | 6.22 | 1.44 | 532.5 |
|
|
| 533.9 |
| 534.4 | 534.4 | 533.4 |
|
|
| CZE | 488.1 | 1.21 | 0.20 | 488.2 | 488.9 | 487.8 | 487.7 | 488.5 | 487.8 | 487.8 | 488.0 | 488.0 | 487.7 | 487.7 |
| DEU | 508.9 | 2.46 | 0.89 | 509.7 | 508.9 |
|
| 508.1 | 508.3 | 507.9 | 508.2 | 508.0 | 508.1 | 508.0 |
| DNK | 497.4 | 3.52 | 0.93 | 498.0 |
|
| 496.6 | 497.6 |
| 496.4 |
| 497.9 |
| 496.4 |
| ESP | 478.9 | 0.53 | 0.06 | 479.1 | 479.0 | 478.8 | 478.8 | 478.6 | 479.1 | 478.9 | 479.0 | 478.6 | 478.9 | 478.9 |
| EST | 508.1 | 5.35 | 1.35 | 507.6 |
|
|
| 508.9 | 507.8 | 507.9 | 507.7 |
| 507.9 | 507.9 |
| FIN | 538.1 | 5.13 | 1.27 |
|
| 538.2 | 537.9 | 537.9 |
|
|
| 538.2 |
|
|
| FRA | 490.8 | 1.79 | 0.50 | 491.3 | 490.0 | 491.6 | 491.8 | 490.0 | 490.4 | 490.7 | 490.6 | 490.4 | 490.5 | 490.5 |
| GBR | 486.9 | 2.30 | 0.53 | 486.6 | 486.9 |
|
| 487.1 | 487.1 | 487.3 | 487.1 | 487.6 | 487.3 | 487.3 |
| GRC | 458.0 | 3.95 | 0.97 | 458.6 | 457.6 |
|
| 457.3 | 458.3 | 458.0 | 458.2 |
| 457.9 | 457.8 |
| HUN | 483.4 | 1.11 | 0.00 | 483.5 | 484.1 | 483.1 | 483.0 | 483.5 | 483.5 | 483.2 | 483.4 | 483.1 | 483.3 | 483.4 |
| IRL | 482.6 | 1.97 | 0.55 | 482.1 | 482.1 |
| 482.0 | 483.1 | 483.0 | 483.0 | 482.7 | 483.6 | 483.2 | 483.2 |
| ISL | 501.0 | 3.02 | 0.74 | 501.5 |
| 500.1 | 500.2 | 500.7 | 500.0 | 500.4 | 500.1 | 501.3 | 500.3 | 500.4 |
| ITA | 478.0 | 0.88 | 0.18 | 478.1 | 478.6 | 478.1 | 477.8 | 477.7 | 478.2 | 478.2 | 478.2 | 477.8 | 478.2 | 478.2 |
| JPN | 529.9 | 3.06 | 1.11 |
| 529.1 | 529.1 | 528.9 | 530.5 |
|
|
| 530.5 |
|
|
| KOR | 544.7 | 7.87 | 2.45 |
|
|
|
| 545.6 |
|
|
| 545.6 |
|
|
| LUX | 483.4 | 1.55 | 0.46 | 483.8 | 482.8 | 484.1 | 484.0 | 482.7 | 483.7 | 483.3 | 483.7 | 482.5 | 483.4 | 483.5 |
| NLD | 521.5 | 1.98 | 0.51 | 522.0 |
| 521.4 | 521.5 | 521.2 | 520.8 | 520.8 | 520.8 | 521.5 | 520.6 | 520.7 |
| NOR | 493.3 | 4.11 | 0.87 | 493.4 |
|
|
| 493.5 | 492.9 | 493.0 | 492.8 | 493.9 | 493.0 | 493.0 |
| POL | 487.0 | 1.22 | 0.15 | 487.1 | 488.0 | 486.8 | 486.7 | 487.1 | 486.9 | 486.9 | 486.8 | 486.8 | 487.0 | 486.9 |
| PRT | 480.1 | 2.26 | 0.49 | 479.8 |
| 479.7 | 480.0 | 480.3 | 480.7 | 480.8 | 481.0 | 480.2 | 480.8 | 480.7 |
| SWE | 487.4 | 1.44 | 0.47 | 488.1 | 488.3 | 487.4 | 487.6 | 486.8 | 487.2 | 487.0 | 487.0 | 486.9 | 487.0 | 487.1 |
Note. CNT = country label (see Appendix B); M = weighted mean across different scaling models; rg = range of estimates across models; MEbc = bias-corrected estimate of model error based on balanced half sampling (see Equation (23)); For model descriptions see Section 2.1 and Equations (3) to (14). Country means that differ from the weighted mean of country means of the 11 different models more than 1 are printed in bold.
Detailed results for all 11 different scaling models for country means in PISA 2009 science.
| CNT | M | rg |
| 1PL | 1PCL | 1PLL | 1PGL | 2PL | 4PGL | 3PLQ | 3PLRH | 3PL | 4PL | 4PLQ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUS | 517.6 | 2.73 | 0.83 | 518.4 | 518.1 |
| 518.3 | 516.7 | 517.3 | 517.1 | 517.2 |
| 517.2 | 517.1 |
| AUT | 488.1 | 1.11 | 0.18 | 487.9 | 488.6 | 487.6 | 488.0 | 488.4 | 488.3 | 488.4 | 488.7 | 487.9 | 488.3 | 488.2 |
| BEL | 498.1 | 2.37 | 0.55 | 497.8 |
| 498.9 | 497.7 | 498.5 | 498.6 | 498.5 | 498.5 | 498.2 | 498.7 | 498.6 |
| CAN | 519.6 | 0.65 | 0.09 | 519.6 | 519.5 | 520.0 | 519.6 | 519.4 | 519.6 | 519.6 | 519.5 | 519.6 | 519.4 | 519.6 |
| CHE | 509.2 | 0.96 | 0.35 | 508.7 | 508.8 | 508.9 | 508.7 | 509.5 | 509.6 | 509.7 | 509.7 | 509.4 | 509.6 | 509.6 |
| CZE | 494.1 | 2.89 | 0.98 |
|
| 494.5 |
| 493.5 |
|
|
| 493.6 |
|
|
| DEU | 513.9 | 2.13 | 0.53 | 514.2 |
| 514.7 | 514.2 | 514.0 | 513.3 | 513.1 | 513.5 | 513.7 | 513.1 |
|
| DNK | 488.3 | 4.70 | 1.89 |
|
|
|
|
|
|
|
|
|
|
|
| ESP | 478.2 | 2.07 | 0.42 | 478.2 | 479.0 |
| 478.4 | 478.1 | 477.8 | 477.9 | 477.7 | 478.7 | 478.0 | 478.0 |
| EST | 517.5 | 1.00 | 0.23 | 517.4 | 517.2 | 517.9 | 517.3 | 517.4 | 517.6 | 517.2 | 517.4 | 518.2 | 517.4 | 517.2 |
| FIN | 546.5 | 3.54 | 0.79 | 547.1 | 546.3 |
| 546.9 | 546.0 | 546.4 | 546.0 | 546.1 |
| 546.3 | 546.1 |
| FRA | 488.2 | 3.74 | 1.02 |
|
| 488.3 |
| 488.9 |
|
|
| 488.8 |
|
|
| GBR | 505.0 | 1.12 | 0.28 | 504.7 | 504.8 | 505.2 | 504.7 | 504.9 | 505.8 | 505.4 | 505.4 | 504.7 | 505.5 | 505.6 |
| GRC | 461.4 | 4.51 | 1.26 |
|
| 461.6 |
| 462.4 |
|
|
| 462.1 |
|
|
| HUN | 494.6 | 5.05 | 1.36 |
|
|
|
| 493.9 |
|
|
| 494.5 |
|
|
| IRL | 497.0 | 0.95 | 0.27 | 497.3 | 497.4 | 497.4 | 497.3 | 496.7 | 496.8 | 496.5 | 496.7 | 496.7 | 496.5 | 496.6 |
| ISL | 487.6 | 3.34 | 1.09 |
| 487.4 |
| 486.6 |
| 488.4 | 488.2 | 488.4 |
| 488.1 | 488.2 |
| ITA | 479.7 | 0.57 | 0.17 | 479.9 | 479.5 | 479.5 | 479.9 | 479.8 | 479.5 | 479.4 | 479.3 | 479.7 | 479.3 | 479.3 |
| JPN | 534.6 | 7.85 | 2.29 |
|
| 534.6 |
|
|
|
|
| 535.0 |
|
|
| KOR | 530.6 | 3.57 | 1.42 |
|
|
|
| 531.0 |
|
|
| 531.5 |
|
|
| LUX | 474.8 | 3.49 | 0.87 | 474.2 |
| 475.1 | 474.0 | 475.3 |
|
|
| 474.6 | 475.7 | 475.7 |
| NLD | 514.2 | 2.63 | 0.93 |
|
| 514.8 |
| 513.6 |
|
| 513.2 | 513.4 |
|
|
| NOR | 491.0 | 3.24 | 1.10 |
|
| 491.2 |
| 490.5 |
|
|
| 490.6 |
|
|
| POL | 499.6 | 3.08 | 0.70 | 500.0 |
| 498.6 | 500.2 | 499.3 | 499.0 | 498.9 | 499.0 | 499.7 | 498.7 | 498.9 |
| PRT | 483.4 | 4.41 | 0.88 | 483.2 |
|
| 483.5 | 483.8 | 483.0 | 483.1 | 482.9 | 484.4 | 483.1 | 483.1 |
| SWE | 487.3 | 1.54 | 0.34 | 487.1 |
| 487.2 | 487.0 | 487.5 | 487.6 | 487.9 | 487.7 | 487.5 | 487.9 | 487.9 |
Note. CNT = country label (see Appendix B); M = weighted mean across different scaling models; rg = range of estimates across models; MEbc = bias-corrected estimate of model error based on balanced half sampling (see Equation (23)); For model descriptions see Section 2.1 and Equations (3) to (14). Country means that differ from the weighted mean of country means of the 11 different models more than 1 are printed in bold.
Results and model uncertainty of 11 different scaling models for country means and country standard deviations in PISA 2009 reading.
| Country Mean | Country Standard Deviation | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CNT |
| M | rg | SE | ME |
| ER | TE | M | rg | SE | ME |
| ER | TE |
| AUS | 14,247 | 515.2 | 1.2 | 2.51 | 0.32 | 0.29 | 0.12 | 2.52 | 104.7 | 2.6 | 1.45 | 0.68 | 0.64 | 0.44 | 1.59 |
| AUT | 6585 | 470.8 | 2.4 | 3.34 | 0.69 | 0.65 | 0.19 | 3.40 | 104.6 | 6.8 | 2.16 | 1.66 | 1.64 | 0.76 | 2.71 |
| BEL | 8500 | 509.5 | 2.9 | 2.49 | 0.80 | 0.78 | 0.32 | 2.61 | 107.5 | 3.1 | 1.92 | 0.69 | 0.65 | 0.34 | 2.02 |
| CAN | 23,200 | 525.0 | 1.8 | 1.49 | 0.45 | 0.43 | 0.29 | 1.55 | 95.6 | 4.6 | 1.12 | 1.18 | 1.18 | 1.05 | 1.62 |
| CHE | 11,801 | 501.7 | 1.3 | 2.72 | 0.42 | 0.39 | 0.14 | 2.75 | 99.7 | 0.8 | 1.67 | 0.23 | 0.00 | 0.00 | 1.67 |
| CZE | 6059 | 479.9 | 0.9 | 3.17 | 0.32 | 0.27 | 0.09 | 3.18 | 95.2 | 1.3 | 1.86 | 0.39 | 0.20 | 0.11 | 1.87 |
| DEU | 4975 | 498.5 | 1.8 | 3.05 | 0.42 | 0.39 | 0.13 | 3.08 | 100.1 | 1.3 | 2.01 | 0.30 | 0.00 | 0.00 | 2.01 |
| DNK | 5920 | 493.7 | 5.5 | 2.10 | 1.58 | 1.58 | 0.75 | 2.63 | 88.0 | 3.5 | 1.31 | 0.70 | 0.68 | 0.52 | 1.48 |
| ESP | 25,828 | 480.1 | 1.4 | 2.12 | 0.44 | 0.43 | 0.20 | 2.17 | 91.9 | 4.6 | 1.18 | 1.16 | 1.13 | 0.96 | 1.64 |
| EST | 4726 | 501.5 | 2.4 | 2.70 | 0.77 | 0.75 | 0.28 | 2.80 | 85.5 | 3.8 | 1.71 | 0.85 | 0.82 | 0.48 | 1.89 |
| FIN | 5807 | 539.0 | 1.7 | 2.27 | 0.43 | 0.41 | 0.18 | 2.30 | 91.5 | 9.8 | 1.31 | 2.68 | 2.68 | 2.05 | 2.98 |
| FRA | 4280 | 498.0 | 4.5 | 3.92 | 1.16 | 1.13 | 0.29 | 4.08 | 112.2 | 1.8 | 2.92 | 0.55 | 0.41 | 0.14 | 2.95 |
| GBR | 12,172 | 494.0 | 1.3 | 2.47 | 0.25 | 0.20 | 0.08 | 2.47 | 99.6 | 2.8 | 1.34 | 0.77 | 0.73 | 0.55 | 1.53 |
| GRC | 4966 | 480.6 | 3.4 | 4.26 | 1.01 | 0.96 | 0.23 | 4.37 | 99.8 | 5.4 | 2.09 | 1.46 | 1.38 | 0.66 | 2.50 |
| HUN | 4604 | 494.2 | 1.7 | 3.62 | 0.46 | 0.40 | 0.11 | 3.64 | 94.8 | 2.7 | 2.78 | 0.67 | 0.58 | 0.21 | 2.84 |
| IRL | 3931 | 496.8 | 2.0 | 3.24 | 0.55 | 0.51 | 0.16 | 3.28 | 98.8 | 4.2 | 2.63 | 1.24 | 1.19 | 0.45 | 2.89 |
| ISL | 3628 | 501.2 | 0.8 | 1.67 | 0.23 | 0.15 | 0.09 | 1.68 | 102.0 | 3.5 | 1.40 | 1.03 | 0.96 | 0.68 | 1.69 |
| ITA | 30,905 | 486.5 | 1.4 | 1.61 | 0.33 | 0.32 | 0.20 | 1.64 | 101.4 | 3.7 | 1.35 | 0.81 | 0.77 | 0.57 | 1.55 |
| JPN | 6082 | 521.3 | 7.7 | 3.71 | 1.62 | 1.60 | 0.43 | 4.04 | 107.3 | 8.0 | 3.16 | 1.59 | 1.52 | 0.48 | 3.50 |
| KOR | 4989 | 539.7 | 4.0 | 3.10 | 1.51 | 1.45 | 0.47 | 3.42 | 84.2 | 8.4 | 1.76 | 2.23 | 2.02 | 1.15 | 2.68 |
| LUX | 4622 | 472.7 | 4.4 | 1.19 | 1.23 | 1.22 | 1.02 | 1.70 | 109.3 | 8.0 | 1.21 | 2.01 | 1.99 | 1.65 | 2.33 |
| NLD | 4760 | 509.0 | 1.6 | 5.58 | 0.35 | 0.28 | 0.05 | 5.59 | 95.1 | 4.1 | 1.89 | 1.12 | 1.01 | 0.54 | 2.14 |
| NOR | 4660 | 503.3 | 0.9 | 2.61 | 0.22 | 0.14 | 0.06 | 2.61 | 96.8 | 3.7 | 1.55 | 0.98 | 0.93 | 0.60 | 1.81 |
| POL | 4917 | 501.7 | 2.2 | 2.72 | 0.72 | 0.72 | 0.26 | 2.81 | 92.8 | 3.6 | 1.32 | 0.90 | 0.84 | 0.63 | 1.56 |
| PRT | 6298 | 489.2 | 2.8 | 3.17 | 0.71 | 0.70 | 0.22 | 3.25 | 91.8 | 3.2 | 1.75 | 0.74 | 0.71 | 0.40 | 1.89 |
| SWE | 4565 | 497.0 | 0.3 | 3.00 | 0.09 | 0.00 | 0.00 | 3.00 | 103.6 | 1.7 | 1.63 | 0.42 | 0.27 | 0.17 | 1.66 |
Note. CNT = country label (see Appendix B); N = sample size; M = weighted mean across different scaling models; rg = range of estimates across models; SE = standard error (computed with balanced half sampling); ME = estimated model error (see Equation (20)); MEbc = bias-corrected estimate of model error based on balanced half sampling (see Equation (23)); ER = error ratio defined as MEbc/SE; TE = total error computed by (see Equation (24)).
Results and model uncertainty of 11 different scaling models for country 10th and 90th percentiles in PISA 2009 reading.
| Country 10th Percentile | Country 90th Percentile | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CNT |
| M | rg | SE | ME |
| ER | TE | M | rg | SE | ME |
| ER | TE |
| AUS | 14,247 | 379.5 | 5.5 | 2.98 | 1.52 | 1.49 | 0.50 | 3.33 | 646.8 | 11.2 | 3.33 | 3.10 | 3.04 | 0.91 | 4.51 |
| AUT | 6585 | 332.9 | 20.5 | 4.82 | 5.37 | 5.32 | 1.10 | 7.18 | 602.8 | 4.8 | 3.64 | 1.26 | 1.07 | 0.30 | 3.79 |
| BEL | 8500 | 369.0 | 7.7 | 4.09 | 2.15 | 2.08 | 0.51 | 4.59 | 644.7 | 16.8 | 2.78 | 4.24 | 4.24 | 1.52 | 5.07 |
| CAN | 23,200 | 400.8 | 4.9 | 2.40 | 1.42 | 1.41 | 0.59 | 2.78 | 646.7 | 11.9 | 1.92 | 3.00 | 3.00 | 1.56 | 3.56 |
| CHE | 11,801 | 370.5 | 7.5 | 3.68 | 1.83 | 1.77 | 0.48 | 4.09 | 627.7 | 10.9 | 3.36 | 3.11 | 3.09 | 0.92 | 4.56 |
| CZE | 6059 | 357.5 | 8.4 | 4.67 | 2.19 | 2.13 | 0.46 | 5.14 | 603.3 | 6.2 | 3.18 | 1.58 | 1.53 | 0.48 | 3.53 |
| DEU | 4975 | 366.0 | 7.5 | 4.79 | 1.95 | 1.81 | 0.38 | 5.12 | 624.4 | 9.2 | 2.73 | 2.64 | 2.58 | 0.95 | 3.76 |
| DNK | 5920 | 378.2 | 4.1 | 2.82 | 0.96 | 0.91 | 0.32 | 2.96 | 604.0 | 4.7 | 2.57 | 1.45 | 1.43 | 0.56 | 2.94 |
| ESP | 25,828 | 359.0 | 8.7 | 3.24 | 2.18 | 2.12 | 0.66 | 3.87 | 595.1 | 3.0 | 1.86 | 0.78 | 0.74 | 0.40 | 2.00 |
| EST | 4726 | 390.9 | 7.3 | 3.83 | 1.81 | 1.76 | 0.46 | 4.21 | 610.7 | 6.2 | 3.17 | 1.50 | 1.46 | 0.46 | 3.49 |
| FIN | 5807 | 419.2 | 10.0 | 2.90 | 2.45 | 2.45 | 0.85 | 3.80 | 653.3 | 21.6 | 2.66 | 5.75 | 5.75 | 2.16 | 6.34 |
| FRA | 4280 | 350.5 | 13.8 | 5.93 | 3.68 | 3.59 | 0.60 | 6.93 | 638.6 | 16.3 | 4.92 | 3.88 | 3.82 | 0.78 | 6.23 |
| GBR | 12,172 | 365.9 | 9.9 | 3.00 | 2.57 | 2.57 | 0.86 | 3.95 | 621.7 | 5.0 | 3.01 | 1.45 | 1.39 | 0.46 | 3.31 |
| GRC | 4966 | 350.5 | 16.2 | 6.24 | 3.51 | 3.29 | 0.53 | 7.05 | 607.5 | 3.6 | 3.06 | 1.03 | 0.97 | 0.32 | 3.21 |
| HUN | 4604 | 368.6 | 7.0 | 6.08 | 1.56 | 1.40 | 0.23 | 6.24 | 613.4 | 4.5 | 4.08 | 1.21 | 1.12 | 0.28 | 4.23 |
| IRL | 3931 | 370.0 | 9.6 | 5.61 | 2.45 | 2.38 | 0.43 | 6.09 | 619.7 | 5.7 | 2.84 | 1.31 | 1.24 | 0.44 | 3.10 |
| ISL | 3628 | 366.3 | 6.0 | 2.67 | 1.40 | 1.28 | 0.48 | 2.96 | 628.2 | 11.2 | 2.33 | 2.84 | 2.76 | 1.18 | 3.62 |
| ITA | 30,905 | 352.4 | 12.2 | 2.65 | 2.67 | 2.65 | 1.00 | 3.75 | 613.7 | 7.7 | 1.86 | 2.01 | 2.00 | 1.07 | 2.73 |
| JPN | 6082 | 381.0 | 4.8 | 7.46 | 1.17 | 1.01 | 0.14 | 7.52 | 652.9 | 25.9 | 3.39 | 5.73 | 5.67 | 1.68 | 6.60 |
| KOR | 4989 | 430.5 | 13.8 | 4.18 | 3.53 | 3.31 | 0.79 | 5.33 | 644.5 | 14.7 | 3.51 | 3.68 | 3.60 | 1.02 | 5.03 |
| LUX | 4622 | 328.3 | 24.5 | 2.42 | 6.36 | 6.31 | 2.61 | 6.76 | 609.8 | 5.9 | 1.83 | 1.63 | 1.55 | 0.85 | 2.40 |
| NLD | 4760 | 386.8 | 3.5 | 5.84 | 0.91 | 0.73 | 0.13 | 5.89 | 632.7 | 12.9 | 5.35 | 3.47 | 3.36 | 0.63 | 6.31 |
| NOR | 4660 | 377.1 | 3.5 | 3.47 | 0.85 | 0.77 | 0.22 | 3.55 | 625.7 | 13.7 | 3.28 | 3.45 | 3.45 | 1.05 | 4.76 |
| POL | 4917 | 381.9 | 5.0 | 3.25 | 1.25 | 1.24 | 0.38 | 3.48 | 620.5 | 12.8 | 3.18 | 3.46 | 3.43 | 1.08 | 4.68 |
| PRT | 6298 | 369.9 | 6.6 | 4.51 | 1.43 | 1.34 | 0.30 | 4.70 | 606.8 | 3.5 | 3.20 | 0.83 | 0.74 | 0.23 | 3.29 |
| SWE | 4565 | 363.1 | 8.8 | 3.97 | 2.19 | 2.13 | 0.54 | 4.51 | 627.6 | 7.9 | 3.60 | 2.13 | 2.06 | 0.57 | 4.15 |
Note. CNT = country label (see Appendix B); N = sample size; M = weighted mean across different scaling models; rg = range of estimates across models; SE = standard error (computed with balanced half sampling); ME = estimated model error (see Equation (20)); MEbc = bias-corrected estimate of model error based on balanced half sampling (see Equation (23)); ER = error ratio defined as MEbc/SE; TE = total error computed by (see Equation (24)).
Results and model uncertainty of 11 different scaling models for country means and country standard deviations in PISA 2009 mathematics.
| Country Mean | Country Standard Deviation | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CNT |
| M | rg | SE | ME |
| ER | TE | M | rg | SE | ME |
| ER | TE |
| AUS | 9889 | 511.2 | 0.7 | 2.75 | 0.19 | 0.02 | 0.01 | 2.75 | 101.5 | 2.7 | 1.82 | 0.89 | 0.83 | 0.45 | 2.00 |
| AUT | 4575 | 492.5 | 2.9 | 3.17 | 0.80 | 0.71 | 0.22 | 3.25 | 105.1 | 6.0 | 2.05 | 1.76 | 1.68 | 0.82 | 2.65 |
| BEL | 5978 | 512.4 | 3.0 | 2.39 | 0.88 | 0.86 | 0.36 | 2.54 | 111.5 | 4.2 | 2.20 | 1.36 | 1.32 | 0.60 | 2.56 |
| CAN | 16,040 | 523.0 | 2.2 | 1.70 | 0.62 | 0.62 | 0.37 | 1.81 | 93.5 | 5.5 | 1.28 | 1.73 | 1.73 | 1.35 | 2.16 |
| CHE | 8157 | 533.5 | 6.2 | 3.59 | 1.45 | 1.44 | 0.40 | 3.87 | 105.2 | 7.2 | 1.85 | 2.33 | 2.29 | 1.23 | 2.94 |
| CZE | 4223 | 488.1 | 1.2 | 3.16 | 0.32 | 0.20 | 0.06 | 3.16 | 98.9 | 2.8 | 2.10 | 0.93 | 0.86 | 0.41 | 2.27 |
| DEU | 3503 | 508.9 | 2.5 | 3.45 | 0.91 | 0.89 | 0.26 | 3.56 | 104.6 | 2.3 | 2.27 | 0.86 | 0.73 | 0.32 | 2.38 |
| DNK | 4088 | 497.4 | 3.5 | 2.86 | 0.95 | 0.93 | 0.33 | 3.01 | 91.9 | 1.8 | 1.78 | 0.36 | 0.08 | 0.05 | 1.79 |
| ESP | 17,920 | 478.9 | 0.5 | 2.21 | 0.20 | 0.06 | 0.03 | 2.21 | 95.4 | 6.1 | 1.64 | 1.63 | 1.60 | 0.98 | 2.29 |
| EST | 3279 | 508.1 | 5.3 | 2.82 | 1.37 | 1.35 | 0.48 | 3.13 | 83.5 | 5.9 | 1.96 | 1.60 | 1.56 | 0.80 | 2.50 |
| FIN | 4019 | 538.1 | 5.1 | 2.22 | 1.32 | 1.27 | 0.57 | 2.56 | 87.8 | 8.4 | 1.82 | 2.61 | 2.59 | 1.42 | 3.17 |
| FRA | 2965 | 490.8 | 1.8 | 3.67 | 0.59 | 0.50 | 0.14 | 3.71 | 104.7 | 4.6 | 2.77 | 1.34 | 1.26 | 0.45 | 3.05 |
| GBR | 8431 | 486.9 | 2.3 | 2.77 | 0.59 | 0.53 | 0.19 | 2.82 | 94.2 | 3.1 | 1.75 | 0.90 | 0.82 | 0.47 | 1.93 |
| GRC | 3445 | 458.0 | 3.9 | 4.13 | 1.03 | 0.97 | 0.23 | 4.24 | 97.6 | 9.6 | 2.38 | 2.88 | 2.82 | 1.18 | 3.69 |
| HUN | 3177 | 483.4 | 1.1 | 4.04 | 0.26 | 0.00 | 0.00 | 4.04 | 97.8 | 5.4 | 3.42 | 1.69 | 1.69 | 0.49 | 3.82 |
| IRL | 2745 | 482.6 | 2.0 | 2.89 | 0.61 | 0.55 | 0.19 | 2.94 | 88.3 | 5.0 | 2.02 | 1.41 | 1.36 | 0.67 | 2.44 |
| ISL | 2510 | 501.0 | 3.0 | 2.14 | 0.76 | 0.74 | 0.35 | 2.26 | 95.0 | 2.5 | 2.09 | 0.69 | 0.61 | 0.29 | 2.18 |
| ITA | 21,379 | 478.0 | 0.9 | 2.09 | 0.24 | 0.18 | 0.09 | 2.10 | 98.0 | 5.5 | 1.40 | 1.32 | 1.32 | 0.94 | 1.92 |
| JPN | 4207 | 529.9 | 3.1 | 3.77 | 1.15 | 1.11 | 0.29 | 3.93 | 101.7 | 7.9 | 2.61 | 2.61 | 2.54 | 0.97 | 3.64 |
| KOR | 3447 | 544.7 | 7.9 | 3.71 | 2.52 | 2.45 | 0.66 | 4.45 | 94.0 | 15.7 | 2.38 | 3.90 | 3.75 | 1.58 | 4.45 |
| LUX | 3197 | 483.4 | 1.6 | 1.88 | 0.53 | 0.46 | 0.24 | 1.94 | 103.6 | 5.1 | 1.78 | 1.36 | 1.30 | 0.73 | 2.21 |
| NLD | 3318 | 521.5 | 2.0 | 5.19 | 0.56 | 0.51 | 0.10 | 5.22 | 96.4 | 4.5 | 2.06 | 1.57 | 1.49 | 0.73 | 2.54 |
| NOR | 3230 | 493.3 | 4.1 | 2.76 | 0.88 | 0.87 | 0.32 | 2.89 | 92.6 | 2.8 | 1.47 | 0.85 | 0.74 | 0.50 | 1.65 |
| POL | 3401 | 487.0 | 1.2 | 2.99 | 0.28 | 0.15 | 0.05 | 2.99 | 95.4 | 5.9 | 1.90 | 2.46 | 2.44 | 1.28 | 3.10 |
| PRT | 4391 | 480.1 | 2.3 | 2.99 | 0.54 | 0.49 | 0.16 | 3.03 | 97.7 | 4.7 | 1.93 | 1.53 | 1.49 | 0.77 | 2.44 |
| SWE | 3139 | 487.4 | 1.4 | 3.02 | 0.53 | 0.47 | 0.15 | 3.06 | 99.3 | 3.5 | 1.91 | 1.14 | 1.08 | 0.57 | 2.19 |
Note. CNT = country label (see Appendix B); N = sample size; M = weighted mean across different scaling models; rg = range of estimates across models; SE = standard error (computed with balanced half sampling); ME = estimated model error (see Equation (20)); MEbc = bias-corrected estimate of model error based on balanced half sampling (see Equation (23)); ER = error ratio defined as MEbc/SE; TE = total error computed by (see Equation (24)).
Results and model uncertainty of 11 different scaling models for country 10th and 90th percentiles in PISA 2009 mathematics.
| Country 10th Percentile | Country 90th Percentile | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CNT |
| M | rg | SE | ME |
| ER | TE | M | rg | SE | ME |
| ER | TE |
| AUS | 9889 | 380.2 | 2.2 | 3.12 | 0.76 | 0.61 | 0.20 | 3.18 | 641.9 | 8.4 | 4.06 | 2.65 | 2.56 | 0.63 | 4.80 |
| AUT | 4575 | 355.5 | 16.2 | 4.22 | 4.74 | 4.60 | 1.09 | 6.25 | 627.1 | 7.3 | 3.86 | 2.24 | 2.09 | 0.54 | 4.39 |
| BEL | 5978 | 367.0 | 5.8 | 4.46 | 1.69 | 1.53 | 0.34 | 4.71 | 654.9 | 14.9 | 2.94 | 4.67 | 4.66 | 1.59 | 5.51 |
| CAN | 16,040 | 402.3 | 4.6 | 2.75 | 1.50 | 1.50 | 0.54 | 3.13 | 643.7 | 10.2 | 2.01 | 3.15 | 3.15 | 1.57 | 3.73 |
| CHE | 8157 | 393.9 | 3.6 | 4.29 | 1.14 | 0.97 | 0.23 | 4.40 | 666.6 | 20.3 | 4.16 | 5.76 | 5.71 | 1.37 | 7.07 |
| CZE | 4223 | 361.9 | 9.7 | 4.80 | 2.91 | 2.85 | 0.59 | 5.58 | 617.3 | 2.2 | 3.91 | 0.63 | 0.30 | 0.08 | 3.92 |
| DEU | 3503 | 371.8 | 6.7 | 5.12 | 1.89 | 1.85 | 0.36 | 5.45 | 642.6 | 11.6 | 3.75 | 3.83 | 3.75 | 1.00 | 5.30 |
| DNK | 4088 | 379.2 | 4.5 | 3.49 | 1.66 | 1.51 | 0.43 | 3.80 | 616.1 | 3.5 | 3.69 | 1.19 | 1.14 | 0.31 | 3.86 |
| ESP | 17,920 | 354.4 | 13.3 | 3.44 | 3.75 | 3.70 | 1.08 | 5.05 | 600.8 | 4.9 | 2.73 | 1.12 | 1.06 | 0.39 | 2.92 |
| EST | 3279 | 401.7 | 8.8 | 4.28 | 2.31 | 2.24 | 0.52 | 4.83 | 616.7 | 6.6 | 3.66 | 1.80 | 1.68 | 0.46 | 4.03 |
| FIN | 4019 | 425.0 | 8.7 | 3.39 | 2.66 | 2.61 | 0.77 | 4.28 | 650.9 | 13.8 | 3.12 | 4.60 | 4.57 | 1.47 | 5.54 |
| FRA | 2965 | 354.3 | 10.9 | 5.45 | 2.82 | 2.72 | 0.50 | 6.09 | 623.9 | 7.8 | 4.85 | 3.40 | 3.28 | 0.68 | 5.86 |
| GBR | 8431 | 366.8 | 6.3 | 3.32 | 2.16 | 2.09 | 0.63 | 3.92 | 609.5 | 2.0 | 3.93 | 0.58 | 0.26 | 0.07 | 3.94 |
| GRC | 3445 | 332.4 | 22.8 | 5.63 | 6.55 | 6.44 | 1.14 | 8.55 | 584.0 | 6.7 | 4.64 | 1.77 | 1.69 | 0.36 | 4.94 |
| HUN | 3177 | 356.7 | 12.3 | 6.07 | 3.57 | 3.57 | 0.59 | 7.04 | 608.5 | 6.4 | 6.03 | 1.63 | 1.51 | 0.25 | 6.21 |
| IRL | 2745 | 368.0 | 8.0 | 4.45 | 2.36 | 2.22 | 0.50 | 4.97 | 594.6 | 5.1 | 3.39 | 1.54 | 1.47 | 0.43 | 3.70 |
| ISL | 2510 | 378.3 | 3.4 | 3.70 | 1.25 | 1.05 | 0.28 | 3.84 | 622.2 | 4.7 | 3.46 | 1.70 | 1.60 | 0.46 | 3.82 |
| ITA | 21,379 | 351.5 | 11.3 | 2.47 | 3.41 | 3.41 | 1.38 | 4.21 | 604.3 | 4.4 | 2.89 | 0.91 | 0.83 | 0.29 | 3.01 |
| JPN | 4207 | 397.8 | 7.2 | 6.31 | 2.15 | 2.02 | 0.32 | 6.63 | 658.7 | 16.7 | 4.17 | 4.97 | 4.85 | 1.16 | 6.40 |
| KOR | 3447 | 424.9 | 10.6 | 4.52 | 3.22 | 2.92 | 0.65 | 5.38 | 666.7 | 32.2 | 5.09 | 8.04 | 7.88 | 1.55 | 9.38 |
| LUX | 3197 | 348.5 | 14.1 | 3.54 | 4.12 | 4.03 | 1.14 | 5.36 | 615.8 | 3.3 | 2.51 | 1.27 | 1.14 | 0.46 | 2.75 |
| NLD | 3318 | 396.9 | 4.6 | 5.92 | 1.18 | 0.92 | 0.16 | 6.00 | 645.8 | 10.1 | 5.02 | 3.31 | 3.22 | 0.64 | 5.96 |
| NOR | 3230 | 373.7 | 5.1 | 3.44 | 1.86 | 1.72 | 0.50 | 3.85 | 612.9 | 4.1 | 3.26 | 0.90 | 0.75 | 0.23 | 3.35 |
| POL | 3401 | 364.0 | 13.2 | 3.60 | 4.71 | 4.71 | 1.31 | 5.93 | 610.3 | 7.2 | 4.09 | 2.61 | 2.50 | 0.61 | 4.79 |
| PRT | 4391 | 354.5 | 12.0 | 3.47 | 3.68 | 3.64 | 1.05 | 5.02 | 607.0 | 2.6 | 4.22 | 0.74 | 0.49 | 0.12 | 4.25 |
| SWE | 3139 | 359.6 | 10.0 | 3.74 | 3.31 | 3.25 | 0.87 | 4.95 | 616.0 | 3.5 | 3.99 | 1.13 | 1.00 | 0.25 | 4.11 |
Note. CNT = country label (see Appendix B); N = sample size; M = weighted mean across different scaling models; rg = range of estimates across models; SE = standard error (computed with balanced half sampling); ME = estimated model error (see Equation (20)); MEbc = bias-corrected estimate of model error based on balanced half sampling (see Equation (23)); ER = error ratio defined as MEbc/SE; TE = total error computed by (see Equation (24)).
Results and model uncertainty of 11 different scaling models for country means and country standard deviations in PISA 2009 science.
| Country Mean | Country Standard Deviation | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CNT |
| M | rg | SE | ME |
| ER | TE | M | rg | SE | ME |
| ER | TE |
| AUS | 9864 | 517.6 | 2.7 | 2.72 | 0.84 | 0.83 | 0.30 | 2.84 | 104.9 | 3.4 | 1.75 | 0.65 | 0.58 | 0.33 | 1.84 |
| AUT | 4577 | 488.1 | 1.1 | 3.64 | 0.29 | 0.18 | 0.05 | 3.64 | 105.7 | 2.2 | 2.91 | 0.63 | 0.53 | 0.18 | 2.96 |
| BEL | 5938 | 498.1 | 2.4 | 2.51 | 0.55 | 0.55 | 0.22 | 2.57 | 106.7 | 2.4 | 1.98 | 0.61 | 0.57 | 0.29 | 2.06 |
| CAN | 16,075 | 519.6 | 0.7 | 1.81 | 0.15 | 0.09 | 0.05 | 1.81 | 93.8 | 3.6 | 1.24 | 0.94 | 0.91 | 0.74 | 1.54 |
| CHE | 8215 | 509.2 | 1.0 | 3.01 | 0.40 | 0.35 | 0.12 | 3.03 | 98.9 | 2.1 | 1.82 | 0.48 | 0.35 | 0.19 | 1.86 |
| CZE | 4252 | 494.1 | 2.9 | 3.43 | 1.00 | 0.98 | 0.29 | 3.57 | 99.1 | 1.1 | 2.66 | 0.30 | 0.00 | 0.00 | 2.66 |
| DEU | 3477 | 513.9 | 2.1 | 3.08 | 0.55 | 0.53 | 0.17 | 3.12 | 103.3 | 5.3 | 2.25 | 1.09 | 1.05 | 0.47 | 2.48 |
| DNK | 4101 | 488.3 | 4.7 | 2.62 | 1.92 | 1.89 | 0.72 | 3.23 | 95.2 | 3.6 | 1.98 | 1.11 | 1.09 | 0.55 | 2.26 |
| ESP | 17,876 | 478.2 | 2.1 | 2.18 | 0.46 | 0.42 | 0.19 | 2.22 | 87.9 | 4.0 | 1.64 | 1.00 | 0.97 | 0.59 | 1.90 |
| EST | 3272 | 517.5 | 1.0 | 2.75 | 0.31 | 0.23 | 0.08 | 2.76 | 87.3 | 4.1 | 1.91 | 1.09 | 1.06 | 0.56 | 2.18 |
| FIN | 4016 | 546.5 | 3.5 | 2.48 | 0.84 | 0.79 | 0.32 | 2.61 | 92.8 | 10.9 | 1.55 | 2.35 | 2.33 | 1.50 | 2.80 |
| FRA | 2960 | 488.2 | 3.7 | 3.91 | 1.10 | 1.02 | 0.26 | 4.04 | 105.3 | 4.1 | 3.09 | 1.27 | 1.15 | 0.37 | 3.29 |
| GBR | 8413 | 505.0 | 1.1 | 2.78 | 0.36 | 0.28 | 0.10 | 2.79 | 102.6 | 1.9 | 1.85 | 0.64 | 0.58 | 0.31 | 1.94 |
| GRC | 3452 | 461.4 | 4.5 | 4.10 | 1.26 | 1.26 | 0.31 | 4.29 | 96.8 | 8.8 | 2.22 | 2.05 | 2.00 | 0.90 | 2.99 |
| HUN | 3193 | 494.6 | 5.0 | 3.46 | 1.43 | 1.36 | 0.39 | 3.72 | 89.8 | 2.5 | 2.92 | 0.59 | 0.50 | 0.17 | 2.97 |
| IRL | 2738 | 497.0 | 1.0 | 3.31 | 0.36 | 0.27 | 0.08 | 3.32 | 99.4 | 1.7 | 2.81 | 0.50 | 0.33 | 0.12 | 2.83 |
| ISL | 2501 | 487.6 | 3.3 | 2.01 | 1.09 | 1.09 | 0.54 | 2.28 | 99.5 | 5.1 | 1.89 | 1.17 | 1.13 | 0.60 | 2.20 |
| ITA | 21,344 | 479.7 | 0.6 | 1.82 | 0.21 | 0.17 | 0.09 | 1.83 | 99.1 | 5.9 | 1.49 | 1.20 | 1.20 | 0.81 | 1.91 |
| JPN | 4222 | 534.6 | 7.8 | 3.76 | 2.29 | 2.29 | 0.61 | 4.40 | 106.7 | 10.3 | 3.15 | 2.72 | 2.69 | 0.85 | 4.14 |
| KOR | 3451 | 530.6 | 3.6 | 3.30 | 1.42 | 1.42 | 0.43 | 3.59 | 86.9 | 7.9 | 1.93 | 2.41 | 2.34 | 1.21 | 3.04 |
| LUX | 3195 | 474.8 | 3.5 | 1.94 | 0.91 | 0.87 | 0.45 | 2.12 | 107.9 | 6.5 | 1.53 | 1.63 | 1.58 | 1.03 | 2.20 |
| NLD | 3323 | 514.2 | 2.6 | 5.77 | 0.98 | 0.93 | 0.16 | 5.85 | 99.7 | 4.7 | 2.32 | 1.20 | 1.11 | 0.48 | 2.57 |
| NOR | 3204 | 491.0 | 3.2 | 2.67 | 1.15 | 1.10 | 0.41 | 2.88 | 93.2 | 3.2 | 1.65 | 0.81 | 0.74 | 0.45 | 1.81 |
| POL | 3397 | 499.6 | 3.1 | 2.72 | 0.73 | 0.70 | 0.26 | 2.81 | 92.7 | 2.3 | 1.93 | 0.58 | 0.52 | 0.27 | 2.00 |
| PRT | 4336 | 483.4 | 4.4 | 3.06 | 0.89 | 0.88 | 0.29 | 3.19 | 86.0 | 4.2 | 1.54 | 0.89 | 0.85 | 0.55 | 1.76 |
| SWE | 3157 | 487.3 | 1.5 | 2.85 | 0.39 | 0.34 | 0.12 | 2.87 | 102.4 | 2.2 | 1.58 | 0.50 | 0.38 | 0.24 | 1.63 |
Note. CNT = country label (see Appendix B); N = sample size; M = weighted mean across different scaling models; rg = range of estimates across models; SE = standard error (computed with balanced half sampling); ME = estimated model error (see Equation (20)); MEbc = bias-corrected estimate of model error based on balanced half sampling (see Equation (23)); ER = error ratio defined as MEbc/SE; TE = total error computed by (see Equation (24)).
Results and model uncertainty of 11 different scaling models for country 10th and 90th percentiles in PISA 2009 science.
| Country 10th Percentile | Country 90th Percentile | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CNT |
| M | rg | SE | ME |
| ER | TE | M | rg | SE | ME |
| ER | TE |
| AUS | 9864 | 383.3 | 3.8 | 3.19 | 1.09 | 1.01 | 0.32 | 3.34 | 650.3 | 12.7 | 4.07 | 2.85 | 2.76 | 0.68 | 4.92 |
| AUT | 4577 | 350.9 | 7.9 | 5.68 | 2.39 | 2.29 | 0.40 | 6.13 | 621.7 | 3.8 | 4.29 | 1.05 | 0.90 | 0.21 | 4.38 |
| BEL | 5938 | 358.7 | 7.1 | 4.18 | 1.96 | 1.96 | 0.47 | 4.62 | 632.9 | 11.5 | 2.89 | 2.28 | 2.25 | 0.78 | 3.66 |
| CAN | 16,075 | 398.5 | 1.4 | 2.59 | 0.51 | 0.36 | 0.14 | 2.62 | 638.7 | 10.2 | 2.25 | 2.43 | 2.39 | 1.06 | 3.29 |
| CHE | 8215 | 379.4 | 3.4 | 3.95 | 1.11 | 0.94 | 0.24 | 4.06 | 634.2 | 8.2 | 3.83 | 2.01 | 2.00 | 0.52 | 4.32 |
| CZE | 4252 | 366.8 | 6.1 | 5.49 | 1.49 | 1.36 | 0.25 | 5.66 | 621.4 | 4.0 | 4.26 | 1.03 | 0.93 | 0.22 | 4.36 |
| DEU | 3477 | 379.8 | 2.3 | 4.87 | 0.83 | 0.36 | 0.07 | 4.88 | 645.4 | 12.5 | 3.50 | 2.43 | 2.38 | 0.68 | 4.23 |
| DNK | 4101 | 366.8 | 6.5 | 3.64 | 1.98 | 1.92 | 0.53 | 4.12 | 610.9 | 6.4 | 3.59 | 2.48 | 2.45 | 0.68 | 4.34 |
| ESP | 17,876 | 365.2 | 7.1 | 3.45 | 1.68 | 1.65 | 0.48 | 3.82 | 590.3 | 3.8 | 2.49 | 0.98 | 0.90 | 0.36 | 2.65 |
| EST | 3272 | 404.3 | 3.1 | 4.05 | 1.06 | 0.99 | 0.24 | 4.16 | 629.1 | 9.5 | 3.32 | 2.06 | 2.02 | 0.61 | 3.89 |
| FIN | 4016 | 426.8 | 8.9 | 3.41 | 2.11 | 2.07 | 0.61 | 3.98 | 665.1 | 21.2 | 3.05 | 4.61 | 4.56 | 1.49 | 5.48 |
| FRA | 2960 | 349.7 | 14.1 | 6.26 | 3.65 | 3.43 | 0.55 | 7.14 | 619.2 | 6.0 | 4.70 | 1.39 | 1.26 | 0.27 | 4.87 |
| GBR | 8413 | 372.5 | 6.5 | 3.56 | 1.74 | 1.69 | 0.47 | 3.94 | 635.8 | 9.4 | 3.86 | 2.70 | 2.62 | 0.68 | 4.67 |
| GRC | 3452 | 336.7 | 20.4 | 6.02 | 4.52 | 4.43 | 0.74 | 7.47 | 584.8 | 5.3 | 4.01 | 1.43 | 1.31 | 0.33 | 4.22 |
| HUN | 3193 | 378.8 | 2.5 | 6.41 | 0.82 | 0.37 | 0.06 | 6.42 | 609.8 | 4.9 | 3.77 | 1.19 | 1.11 | 0.30 | 3.93 |
| IRL | 2738 | 370.3 | 7.3 | 5.60 | 2.08 | 1.93 | 0.34 | 5.93 | 623.2 | 4.3 | 4.01 | 1.09 | 1.01 | 0.25 | 4.13 |
| ISL | 2501 | 357.8 | 10.4 | 3.77 | 2.56 | 2.48 | 0.66 | 4.51 | 613.3 | 3.7 | 2.78 | 1.12 | 1.04 | 0.37 | 2.97 |
| ITA | 21,344 | 350.7 | 14.0 | 2.87 | 2.85 | 2.85 | 1.00 | 4.04 | 605.7 | 2.2 | 2.13 | 0.61 | 0.57 | 0.27 | 2.21 |
| JPN | 4222 | 390.5 | 5.6 | 7.55 | 1.48 | 1.26 | 0.17 | 7.66 | 663.4 | 27.8 | 3.40 | 6.97 | 6.94 | 2.04 | 7.72 |
| KOR | 3451 | 417.4 | 6.3 | 3.83 | 2.02 | 1.86 | 0.49 | 4.26 | 639.9 | 16.7 | 4.45 | 4.91 | 4.91 | 1.10 | 6.63 |
| LUX | 3195 | 334.6 | 18.6 | 2.98 | 4.62 | 4.55 | 1.53 | 5.44 | 612.2 | 1.6 | 2.67 | 0.49 | 0.18 | 0.07 | 2.68 |
| NLD | 3323 | 385.4 | 3.7 | 6.36 | 1.34 | 1.08 | 0.17 | 6.45 | 642.4 | 11.2 | 5.48 | 2.52 | 2.44 | 0.45 | 6.00 |
| NOR | 3204 | 371.1 | 5.8 | 3.31 | 1.42 | 1.32 | 0.40 | 3.56 | 611.3 | 3.5 | 3.58 | 1.23 | 1.07 | 0.30 | 3.74 |
| POL | 3397 | 380.6 | 3.7 | 3.81 | 0.93 | 0.86 | 0.22 | 3.90 | 619.7 | 3.6 | 3.51 | 1.06 | 0.93 | 0.26 | 3.63 |
| PRT | 4336 | 373.7 | 5.4 | 3.67 | 1.31 | 1.17 | 0.32 | 3.85 | 595.4 | 6.9 | 3.46 | 1.27 | 1.24 | 0.36 | 3.68 |
| SWE | 3157 | 355.5 | 10.4 | 3.42 | 2.27 | 2.20 | 0.64 | 4.07 | 617.5 | 7.3 | 3.62 | 1.81 | 1.75 | 0.48 | 4.02 |
Note. CNT = country label (see Appendix B); N = sample size; M = weighted mean across different scaling models; rg = range of estimates across models; SE = standard error (computed with balanced half sampling); ME = estimated model error (see Equation (20)); MEbc = bias-corrected estimate of model error based on balanced half sampling (see Equation (23)); ER = error ratio defined as MEbc/SE; TE = total error computed by (see Equation (24)).
Sensitivity analysis for country means and country standard deviations for original and uniform model weighting for PISA 2009 reading.
| Country Mean | Country Standard Deviation | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| M | SE |
| M | SE |
| |||||||
| CNT | W1 | W2 | W1 | W2 | W1 | W2 | W1 | W2 | W1 | W2 | W1 | W2 |
| AUS | 515.2 | 515.2 | 2.51 | 2.51 | 0.29 | 0.33 | 104.7 | 104.7 | 1.45 | 1.46 | 0.64 | 0.74 |
| AUT | 470.8 | 471.0 | 3.34 | 3.33 | 0.65 | 0.74 | 104.6 | 104.3 | 2.16 | 2.18 | 1.64 | 1.90 |
| BEL | 509.5 | 509.7 | 2.49 | 2.49 | 0.78 | 0.90 | 107.5 | 107.6 | 1.92 | 1.91 | 0.65 | 0.74 |
| CAN | 525.0 | 524.8 | 1.49 | 1.49 | 0.43 | 0.53 | 95.6 | 95.8 | 1.12 | 1.13 | 1.18 | 1.34 |
| CHE | 501.7 | 501.8 | 2.72 | 2.73 | 0.39 | 0.43 | 99.7 | 99.7 | 1.67 | 1.68 | 0.00 | 0.00 |
| CZE | 479.9 | 479.9 | 3.17 | 3.16 | 0.27 | 0.20 | 95.2 | 95.2 | 1.86 | 1.86 | 0.20 | 0.15 |
| DEU | 498.5 | 498.7 | 3.05 | 3.04 | 0.39 | 0.44 | 100.1 | 100.1 | 2.01 | 2.00 | 0.00 | 0.03 |
| DNK | 493.7 | 493.4 | 2.10 | 2.10 | 1.58 | 1.75 | 88.0 | 87.8 | 1.31 | 1.33 | 0.68 | 0.84 |
| ESP | 480.1 | 480.0 | 2.12 | 2.11 | 0.43 | 0.44 | 91.9 | 91.5 | 1.18 | 1.16 | 1.13 | 1.34 |
| EST | 501.5 | 501.4 | 2.70 | 2.70 | 0.75 | 0.77 | 85.5 | 85.3 | 1.71 | 1.72 | 0.82 | 0.99 |
| FIN | 539.0 | 539.2 | 2.27 | 2.31 | 0.41 | 0.46 | 91.5 | 92.4 | 1.31 | 1.31 | 2.68 | 3.14 |
| FRA | 498.0 | 498.3 | 3.92 | 3.93 | 1.13 | 1.35 | 112.2 | 112.1 | 2.92 | 2.92 | 0.41 | 0.49 |
| GBR | 494.0 | 494.0 | 2.47 | 2.47 | 0.20 | 0.25 | 99.6 | 99.4 | 1.34 | 1.35 | 0.73 | 0.82 |
| GRC | 480.6 | 480.3 | 4.26 | 4.23 | 0.96 | 1.00 | 99.8 | 99.4 | 2.09 | 2.06 | 1.38 | 1.55 |
| HUN | 494.2 | 494.0 | 3.62 | 3.61 | 0.40 | 0.47 | 94.8 | 94.6 | 2.78 | 2.78 | 0.58 | 0.66 |
| IRL | 496.8 | 496.7 | 3.24 | 3.21 | 0.51 | 0.52 | 98.8 | 98.3 | 2.63 | 2.60 | 1.19 | 1.38 |
| ISL | 501.2 | 501.2 | 1.67 | 1.68 | 0.15 | 0.14 | 102.0 | 102.3 | 1.40 | 1.41 | 0.96 | 1.07 |
| ITA | 486.5 | 486.6 | 1.61 | 1.61 | 0.32 | 0.36 | 101.4 | 101.5 | 1.35 | 1.34 | 0.77 | 0.87 |
| JPN | 521.3 | 521.3 | 3.71 | 3.71 | 1.60 | 1.79 | 107.3 | 107.7 | 3.16 | 3.16 | 1.52 | 1.96 |
| KOR | 539.7 | 539.4 | 3.10 | 3.13 | 1.45 | 1.48 | 84.2 | 84.7 | 1.76 | 1.78 | 2.02 | 2.33 |
| LUX | 472.7 | 473.0 | 1.19 | 1.19 | 1.22 | 1.38 | 109.3 | 108.9 | 1.21 | 1.23 | 1.99 | 2.29 |
| NLD | 509.0 | 509.0 | 5.58 | 5.62 | 0.28 | 0.32 | 95.1 | 95.5 | 1.89 | 1.90 | 1.01 | 1.17 |
| NOR | 503.3 | 503.4 | 2.61 | 2.63 | 0.14 | 0.20 | 96.8 | 97.2 | 1.55 | 1.56 | 0.93 | 1.14 |
| POL | 501.7 | 501.8 | 2.72 | 2.73 | 0.72 | 0.67 | 92.8 | 93.0 | 1.32 | 1.34 | 0.84 | 0.96 |
| PRT | 489.2 | 489.0 | 3.17 | 3.16 | 0.70 | 0.83 | 91.8 | 91.5 | 1.75 | 1.74 | 0.71 | 0.88 |
| SWE | 497.0 | 497.0 | 3.00 | 3.00 | 0.00 | 0.00 | 103.6 | 103.4 | 1.63 | 1.64 | 0.27 | 0.32 |
Note. CNT = country label (see Appendix B); M = weighted mean across different scaling models; rg = range of estimates across models; SE = standard error (computed with balanced half sampling); MEbc = bias-corrected estimate of model error based on balanced half sampling (see Equation (23)); W1 = model weighting used in the main analysis (see Section 4.2 and results in other tables); W2 = uniform weighting of models.