| Literature DB >> 27167518 |
Frederik Coomans1,2, Abe Hofman1, Matthieu Brinkhuis1, Han L J van der Maas1, Gunter Maris3,1.
Abstract
We investigate the relation between speed and accuracy within problem solving in its simplest non-trivial form. We consider tests with only two items and code the item responses in two binary variables: one indicating the response accuracy, and one indicating the response speed. Despite being a very basic setup, it enables us to study item pairs stemming from a broad range of domains such as basic arithmetic, first language learning, intelligence-related problems, and chess, with large numbers of observations for every pair of problems under consideration. We carry out a survey over a large number of such item pairs and compare three types of psychometric accuracy-response time models present in the literature: two 'one-process' models, the first of which models accuracy and response time as conditionally independent and the second of which models accuracy and response time as conditionally dependent, and a 'two-process' model which models accuracy contingent on response time. We find that the data clearly violates the restrictions imposed by both one-process models and requires additional complexity which is parsimoniously provided by the two-process model. We supplement our survey with an analysis of the erroneous responses for an example item pair and demonstrate that there are very significant differences between the types of errors in fast and slow responses.Entities:
Mesh:
Year: 2016 PMID: 27167518 PMCID: PMC4864306 DOI: 10.1371/journal.pone.0155149
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Item pair contingency table for items 1 (100 × 3000) and 2 (80 × 2) constructed from 18744 response pairs.
| ( | ( | ( | ( | |
| ( | 435 (1) | 245 (2) | 428 (3) | 1668 (4) |
| ( | 256 (5) | 229 (6) | 487 (7) | 1108 (8) |
| ( | 509 (9) | 586 (10) | 1382 (11) | 2245 (12) |
| ( | 1227 (13) | 786 (14) | 1624 (15) | 5529 (16) |
The numbers between parentheses indicate the enumeration that is used throughout the text to indicate the events in the item pair contingency table. The cells 1, 4, 13, and 16 constitute the events for which both responses on the item pair are fast (fast-fast responses). The cells 6, 7, 10, and 11 constitute the events for which both responses on the item pair are slow (slow-slow responses). All remaining cells constitute the events for which the speed of both responses on the item pair differs.
Estimated frequencies computed from CIM, CDM, 2P&3I and 3P&2I.
| ( | ( | ( | ( | |||||
| ( | 435.00 | 214.50 | 657.99 | 1571.62 | ||||
| 435.00 | 266.63 | 439.71 | 1191.26 | |||||
| 435.00 | 245.00 | 423.88 | 1672.12 | |||||
| 435.00 | 245.00 | 428.00 | 1572.26 | |||||
| ( | 286.50 | 229.00 | 582.50 | 878.89 | ||||
| 234.37 | 386.52 | 1047.15 | 1235.43 | |||||
| 256.00 | 229.00 | 491.12 | 1103.88 | |||||
| 256.00 | 229.00 | 582.74 | 1108.00 | |||||
| ( | 740.06 | 490.50 | 1382.00 | 2212.55 | ||||
| 339.76 | 920.47 | 1085.97 | 2059.04 | |||||
| 513.12 | 581.88 | 1382.00 | 2245.00 | |||||
| 509.00 | 490.26 | 1382.00 | 2245.00 | |||||
| ( | 1323.38 | 554.06 | 1656.45 | 5529.00 | ||||
| 809.12 | 954.60 | 1809.96 | 5529.00 | |||||
| 1222.88 | 790.12 | 1624.00 | 5529.00 | |||||
| 1322.741 | 786.00 | 1624.00 | 5529.00 | |||||
Estimates are obtained based on the data in Table 1. Every cell contains the observed frequency (in bold) together with 4 expected frequencies corresponding to, respectively from top to bottom, the CIM, the CDM, the 2P&3I truncation and the 3P&2I truncation. The corresponding Pearson goodness-of-fit χ2-statistics are (20.52), (26.12), (10.83) and (10.83), where the numbers in brackets denote the statistics’ bounds corresponding to p = 0.001 for the corresponding degrees of freedom.
Fig 1Schematic description of a two-level branching model.
The first level distinguishes fast (y = 1) and slow (y = 0) responses, whereas the second level distinguishes correct (x = 1) and incorrect (x = 0) responses. In the nodes, the person and item parameters of the corresponding Rasch models are displayed. The left branch of the first node corresponds to the probability of answering fast, the left branch of the second node corresponds to the probability of answering correctly given that the response is fast, and the left branch of the third node corresponds to the probability of answering correctly given that the response is slow.
Fig 2Number of parameters vs. number of items.
The left plot compares the log number of parameters of the two-level branching model (solid) and the saturated model (dashed) on an N-item contingency table as function of N. The right plot displays the ratio of the number of parameters of the two-level branching model over the number of parameters of the saturated model (both on an N-item contingency table) as function of N.
Survey of violations.
| domain | CIM | CDM | 2P&3I | 3P&2I |
|---|---|---|---|---|
| multiplication | ||||
| division | ||||
| addition | ||||
| subtraction |
For each Math Garden domain indicated in the left column, all 435 possible item pairs of the 30 most played items in the domain are considered. All item pairs with less than 500 administrations are discarded. For each remaining item pair, an item pair contingency table is constructed for which the CIM, CDM, 3P&2I and 2P&3I are estimated and the four corresponding χ2-test statistics are computed. Under a given model for a specific item pair, if one or more of the expected cell frequencies is below 1 or if more than 3 cell frequencies are below 5, the item pair is excluded from the analysis of that specific model. The total number of remaining item pairs per domain and per model is indicated in parentheses in the corresponding table entry. The boldface number in every table entry indicates what percentage of this number of remaining item pairs has a χ2-value that exceeds the p = 0.001 threshold for the corresponding number of degrees of freedom. This threshold is 20.52 for CIM (p = 0.001 for 15-10 degrees of freedom), 26.12 for CDM (p = 0.001 for 15-7 degrees of freedom), 10.83 for 2P&3I (p = 0.001 for 15-14 degrees of freedom) and 10.83 for 3P&2I (p = 0.001 for 15-14 degrees of freedom).
Survey of violations (2).
| domain | CIM | CDM | 2P&3I | 3P&2I |
|---|---|---|---|---|
| Set | ||||
| Letter Chaos | ||||
| Chess |
For Set and Letter Chaos, all 45 possible item pairs of the 10 most played items are considered. All item pairs with less than 500 administrations are discarded. The analysis of the remaining item pairs in these domains is equivalent to that described below Table 3. For chess, 780 item pairs of 40 items in the Amsterdam Chess Test I are considered. Each item pair has a fixed number of 259 observations and thus we do not employ the ‘500-observations-requirement’ here. The rest of the analysis is equivalent to that described below Table 3. However, it has to be noted that 310 of the 627 item pairs that were excluded from the analysis of the 2P&3I model have sparse contingency tables for which the corresponding set of 2P&3I maximum likelihood equations, given in Eq (16), has no solution.
Error contingency table.
| item 1 errors | fast | slow | item 2 errors | fast | slow |
|---|---|---|---|---|---|
| 1326 | 717 | 366 | 192 | ||
| 458 | 287 | 208 | 174 | ||
| 299 | 249 | 203 | 87 | ||
| 133 | 82 | 186 | 95 | ||
| 50 | 126 | 154 | 40 | ||
| 17 | 66 | 101 | 89 | ||
| 45 | 32 | 95 | 75 | ||
| 49 | 26 | 65 | 103 | ||
| 42 | 14 | 86 | 60 | ||
| 18 | 36 | 80 | 50 | ||
| residual | 339 | 445 | residual | 883 | 881 |
For items 100 × 3000 (item 1) and 80 × 2 (item 2) the 10 most common incorrect responses are displayed together with their corresponding frequencies and are split up based on the speed of the response. This data is obtained from the Math Garden framework for the period from 01-03-2012 to 01-07-2014. Note that we left out the response 0 from the analysis because it was set as the default answer in the Math Garden framework for some time. Note that the sum of the frequencies of the fast and slow errors of, respectively, the first (second) item equal the sum of the first and second rows (columns) in Table 1, as they should.
Two-process model probabilities conditional on the latent variables θ(1), θ(2) and θ(3).
| ( | ( | ( | ( | |
|
( | ||||
| ( | ||||
| ( | ||||
| ( |
The objects are defined in Eq (12).
Manifest 2P&3I probabilities.
| ( | ( | ( | ( | |
| ( | ||||
| ( | ||||
| ( | ||||
| ( |
Parametrization of the contingency table as implied by the the 2P&3I truncation of the two-level branching model. The parameters φ( are subject to the constraint Eq (14).
Manifest 3P&2I probabilities.
| ( | ( | ( | ( | |
| ( | ||||
| ( | ||||
| ( | ||||
| ( |
Parametrization of the contingency table as implied by the the 3P&2I truncation of the two-level branching model. The parameters φ( are subject to the constraint Eq (15).
Manifest CIM probabilities.
| ( | ( | ( | ( | |
| ( | ||||
| ( | ||||
| ( | ||||
| ( |
Parametrization of the contingency table as implied by the CIM Eq (21). The parameters φ( are subject to the constraint Eq (22).
Manifest CDM probabilities.
| ( | ( | ( | ( | |
|
( |
|
|
| |
|
( |
|
|
|
|
|
( |
|
|
|
|
|
( |
|
|
|
Discretized SRT model probabilities Eq (28) written out for all possible answer patterns. The score parameters φ( are subject to the constraint Eq (29).