| Literature DB >> 33815190 |
Denise Reis Costa1, Maria Bolsinova2, Jesper Tijmstra2, Björn Andersson1.
Abstract
Log-file data from computer-based assessments can provide useful collateral information for estimating student abilities. In turn, this can improve traditional approaches that only consider response accuracy. Based on the amounts of time students spent on 10 mathematics items from the PISA 2012, this study evaluated the overall changes in and measurement precision of ability estimates and explored country-level heterogeneity when combining item responses and time-on-task measurements using a joint framework. Our findings suggest a notable increase in precision with the incorporation of response times and indicate differences between countries in how respondents approached items as well as in their response processes. Results also showed that additional information could be captured through differences in the modeling structure when response times were included. However, such information may not reflect the testing objective.Entities:
Keywords: PISA; computer-based assessment; log files; measurement invariance; measurement precision; time on task
Year: 2021 PMID: 33815190 PMCID: PMC8017127 DOI: 10.3389/fpsyg.2021.579128
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Sample size, mean score, and variation in student performance on all clusters, as well as sample size, percentage of female, average total time, and percentage of missing responses for the 10 released and valid log-file data from the PISA 2012 computer-based mathematics by country.
| SGP | 2,873 | 566.02 | 98.34 | 453 | 49.89 | 16.13 | 3.22 |
| QCN | 2,409 | 562.26 | 93.64 | 393 | 49.87 | 16.13 | 0.64 |
| KOR | 2,675 | 552.57 | 90.15 | 433 | 44.34 | 13.92 | 1.20 |
| HKG | 2,714 | 549.64 | 86.71 | 421 | 45.37 | 15.37 | 2.45 |
| MAC | 3,147 | 542.90 | 82.85 | 522 | 50.00 | 17.92 | 3.41 |
| JPN | 6,351 | 539.01 | 87.80 | 982 | 46.44 | 15.65 | 3.21 |
| TAP | 3,063 | 537.26 | 88.80 | 513 | 51.27 | 15.13 | 2.51 |
| CAN | 10,817 | 522.85 | 91.92 | 1,527 | 51.34 | 14.28 | 4.16 |
| EST | 2,837 | 516.09 | 82.13 | 460 | 50.00 | 14.54 | 2.37 |
| BEL | 4,617 | 512.15 | 98.60 | 707 | 49.50 | 14.19 | 4.82 |
| DEU | 2,881 | 509.37 | 95.50 | 441 | 51.02 | 13.75 | 2.43 |
| FRA | 3,012 | 508.06 | 91.95 | 440 | 53.18 | 15.43 | 4.41 |
| AUS | 11,834 | 507.70 | 90.94 | 1,833 | 48.88 | 13.55 | 1.99 |
| AUT | 2,731 | 507.34 | 88.74 | 436 | 50.92 | 13.42 | 1.28 |
| ITA | 3,089 | 498.76 | 83.14 | 440 | 45.68 | 16.54 | 6.86 |
| USA | 2,572 | 498.03 | 88.75 | 402 | 46.77 | 14.57 | 1.89 |
| NOR | 2,924 | 497.56 | 87.25 | 413 | 48.67 | 13.48 | 2.20 |
| SVK | 3,145 | 497.34 | 86.07 | 505 | 44.75 | 16.24 | 5.88 |
| DNK | 4,149 | 496.19 | 86.41 | 629 | 51.83 | 13.51 | 1.43 |
| IRL | 2,613 | 493.08 | 80.50 | 389 | 51.41 | 14.85 | 3.26 |
| SWE | 2,671 | 489.93 | 86.06 | 423 | 52.48 | 13.84 | 3.62 |
| RUS | 3,186 | 489.15 | 79.83 | 531 | 50.28 | 16.36 | 4.24 |
| POL | 2,567 | 489.04 | 86.01 | 428 | 52.10 | 13.09 | 1.64 |
| PRT | 3,272 | 489.03 | 85.09 | 487 | 48.05 | 15.52 | 3.29 |
| SVN | 4,385 | 486.94 | 87.83 | 678 | 45.87 | 10.95 | 0.65 |
| ESP | 5,751 | 475.08 | 81.99 | 933 | 50.38 | 14.44 | 3.63 |
| HUN | 2,746 | 469.84 | 92.58 | 445 | 52.81 | 12.79 | 1.82 |
| ISR | 2,677 | 446.61 | 111.28 | 387 | 54.78 | 14.65 | 2.48 |
| ARE | 6,732 | 434.06 | 84.28 | 1057 | 51.09 | 14.03 | 4.07 |
| BRA | 3,172 | 420.74 | 83.85 | 480 | 50.00 | 16.40 | 9.92 |
| COL | 5,173 | 396.84 | 73.33 | 782 | 53.58 | 16.48 | 8.09 |
| Overall mean | 3,961 | 612 | 49.77 | 14.62 | 3.40 | ||
(1) Countries are displayed by the ISO three-letter code. Their correspondence names are available at the .
Characteristics of the released PISA 2012 computer-based of mathematics items.
| I15Q1 | MC | 59.02 | 498.51 | 1.36 | 0.81 | |
| I15Q2 | CR | 8.43 | 685.84 | 700.72 | 1.87 | 0.93 |
| I15Q3 | CR | 29.02 | 577.18 | 658.58 | 1.98 | 3.25 |
| I20Q1 | CR | 29.58 | 562.07 | 690.91 | 2.18 | 1.69 |
| I20Q2 | MC | 47.42 | 549.29 | 0.96 | 1.93 | |
| I20Q3 | CR | 26.91 | 644.25 | 1.33 | 2.55 | |
| I20Q4 | MC | 44.12 | 565.73 | 0.84 | 3.30 | |
| I38Q3 | MC | 67.13 | 468.75 | 1.25 | 3.94 | |
| I38Q5 | CR | 27.75 | 641.05 | 1.82 | 6.60 | |
| I38Q6 | CR | 23.24 | 660.45 | 1.56 | 8.96 | |
(1) Item are displayed by the position within the cluster. (2) MC = Multiple Choice and CR= Constructed Response item type. (3) The international percent of correct responses, and thresholds values were retrieved by OECD (.
Figure 1(A) M1: response accuracy only (B) M2: simple-structure hierarchical model (C) M3: Extended hierarchical model with cross-loadings. The parameter's sub-indices are: p, person; I, item; c, country.
Framework for the estimation of international parameters for each analyzed model.
| M1 | 0 | 1 | - | - | Free | Free | - | - | - | - | - |
| M2 | Free | Free | 0 | 1 | Free | Free | Free | Free | - | ||
| M3 | Free | Free | 0 | 1 | Free | Free | Free | 0 | Free |
For M3, the second latent variable (τ.
Framework for the estimation of countries' parameters for each analyzed model.
| M1_Full | Free | Free | - | - | - | - | - | - | - | ||
| M2_Full | Free | Free | Free | Free | ξ | λ | Free | - | |||
| M2_Strong | Free | Free | Free | Free | ξ | λ | Free | Free | - | ||
| M2_Weak | Free | Free | 0 | Free | Free | λ | Free | Free | - | ||
| M2_Struct | Free | Free | 0 | 1 | Free | Free | Free | Free | - | ||
| M3_Full | Free | Free | Free | Free | ξ | λ | Free | ϕ | |||
| M3_Strong | Free | Free | Free | Free | ξ | λ | Free | Free | ϕ | ||
| M3_Weak | Free | Free | 0 | Free | Free | λ | Free | Free | ϕ | ||
| M3_Struct | Free | 0 | 1 | Free | Free | Free | 0 | Free |
(1) Since ϕ.
Estimated means and variances of students' abilities, EAP reliability and average of the standard errors for the three measurement models.
| M1 | 0.00 | 1.00 | 0.73 | 0.51 |
| M2 | −0.02 | 1.06 | 0.77 | 0.49 |
| M3 | −0.02 | 1.05 | 0.80 | 0.45 |
International results of the PISA 2012 digital math items.
Model fit statistics (BIC) by model and country.
| ARE | [29647.12–29774.20] | M2_Struct | [29537.07–29815.97] | M3_Weak |
| AUS | [54700.07–54766.34] | M2_Weak | [54328.41–54415.99] | M3_Full |
| AUT | [13147.99–13240.82] | M2_Full | [13049.54–13149.53] | M3_Full |
| BEL | [20621.68–20670.20] | M2_Full | [20487.24–20539.84] | M3_Weak |
| BRA | [12137.77–12214.90] | M2_Weak | [12091.05–12181.83] | M3_Weak |
| CAN | [43637.52–43803.38] | M2_Strong | [43440.33–43598.05] | M3_Weak |
| COL | [21367.18–21653.93] | M2_Weak | [21289.22–21653.15] | M3_Weak |
| DEU | [13002.93–13065.20] | M2_Full | [12919.52–13010.30] | M3_Full |
| DNK | [18226.95–18269.57] | M2_Full | [18090.74–18175.80] | M3_Full |
| ESP | [27485.45–27588.13] | M2_Full | [27287.14–27437.58] | M3_Full |
| EST | [12629.05–12761.87] | M2_Strong | [12482.02–12625.00] | M3_Strong |
| FRA | [12890.52–12913.41] | M2_Weak | [12772.27–12824.20] | M3_Weak |
| HKG | [13527.25–14034.09] | M2_Struct | [13479.78–13752.47] | M3_Struct |
| HUN | [12181.87–12197.26] | M2_Strong | [12110.29–12155.05] | M3_Strong |
| IRL | [10701.96–10754.46] | M2_Strong | [10602.46–10685.45] | M3_Strong |
| ISR | [12298.66–12436.20] | M2_Weak | [12229.09–12370.03] | M3_Weak |
| ITA | [12674.72–12748.87] | M2_Weak | [12559.46–12616.12] | M3_Full |
| JPN | [30542.91–31258.25] | M2_Struct | [30320.88–31046.07] | M3_Struct |
| KOR | [13060.99–13190.37] | M2_Struct | [12977.21–13138.38] | M3_Weak |
| MAC | [15390.05–15536.21] | M2_Weak | [15250.20–15396.36] | M3_Weak |
| NOR | [12867.52–12905.24] | M2_Strong | [12784.82–12845.24] | M3_Weak |
| POL | [12530.05–12563.72] | M2_Full | [12444.90–12511.19] | M3_Full |
| PRT | [14017.90–14079.03] | M2_Full | [13934.64–14045.98] | M3_Full |
| QCN | [11640.00–11840.43] | M2_Weak | [11579.94–11665.66] | M3_Struct |
| RUS | [15790.27–15843.04] | M2_Weak | [15683.03–15754.20] | M3_Weak |
| SGP | [13129.26–13240.15] | M2_Weak | [13016.65–13090.06] | M3_Strong |
| SVK | [13976.93–14004.75] | M2_Struct | [13877.89–13903.17] | M3_Full |
| SVN | [19994.64–20004.67] | M2_Weak | [19942.30–19986.59] | M3_Strong |
| SWE | [12449.46–12509.05] | M2_Full | [12325.18–12420.05] | M3_Full |
| TAP | [15235.19–15292.96] | M2_Struct | [15121.61–15158.37] | M3_Full |
| USA | [11209.98–11277.11] | M2_Weak | [11186.89–11239.37] | M3_Weak |
| Total BIC | [556031.15–557223.67] | M2_Weak | [552353.33–555491.33] | M3_Weak |
(1) The range indicates the minimum and maximum values of the BIC statistic by country. (2) The suffix “_Full” indicates full measurement invariance (fixing all item parameters to be equal to the international estimates), “_Strong” indicates strong measurement invariance (country-specific residual variances are allowed to be estimated, while time intensity parameters and factor loadings are fixed to be equal to the international estimates), “_Weak” means weak measurement invariance model (country-specific residual variances and country-time intensity parameters are freely estimated, while factor loadings are fixed to be equal to the international estimates), and “_Struct,” structural measurement invariance (wholly fitted time-related parameters, i.e., all time-related parameters are freely estimated in each country).
Figure 2Estimates of the countries' time intensity for model 2—Weak measurement invariance.
Figure 3Estimates of the countries' means and their respective confidence intervals for the different models.
Figure 4EAP reliabilities estimates per country and model.
Figure 5Average standard errors of abilities' estimates per country and model.
Figure 6Correlations between EAP estimates.