| Literature DB >> 30936843 |
Yunxiao Chen1, Xiaoou Li2, Jingchen Liu3, Zhiliang Ying3.
Abstract
Complex problem-solving (CPS) ability has been recognized as a central 21st century skill. Individuals' processes of solving crucial complex problems may contain substantial information about their CPS ability. In this paper, we consider the prediction of duration and final outcome (i.e., success/failure) of solving a complex problem during task completion process, by making use of process data recorded in computer log files. Solving this problem may help answer questions like "how much information about an individual's CPS ability is contained in the process data?," "what CPS patterns will yield a higher chance of success?," and "what CPS patterns predict the remaining time for task completion?" We propose an event history analysis model for this prediction problem. The trained prediction model may provide us a better understanding of individuals' problem-solving patterns, which may eventually lead to a good design of automated interventions (e.g., providing hints) for the training of CPS ability. A real data example from the 2012 Programme for International Student Assessment (PISA) is provided for illustration.Entities:
Keywords: PISA data; complex problem solving; event history analysis; process data; response time
Year: 2019 PMID: 30936843 PMCID: PMC6431619 DOI: 10.3389/fpsyg.2019.00486
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1(A) Simulation environment of CC item. (B) Answer diagram of CC item.
An example of computer log file data from CC item in PISA 2012.
| 0 | Start. |
| 29.5 | Set top, central, and bottom controls at 2, 0, and 0, respectively, and click APPLY. |
| 32.4 | Set top, central, and bottom controls at 0, 0, and 0, respectively, and click APPLY. |
| 35.2 | Click RESET. |
| 36.2 | Set all three controls at 0, and click APPLY. |
| ⋮ | ⋮ |
| 58 | Connecting ”top control” with ”temperature.” |
| 59.1 | Connecting ”central control” with ”humidity.” |
| 59.6 | Connecting ”bottom control” with ”humidity.” |
| 61.5 | Success. |
Figure 2Visualization of the structure of process data from CC item in PISA 2012.
Figure 3(A) Histogram of problem-solving duration of the CC item. (B) Histogram of the number of actions for solving the CC item.
The list of candidate features to be incorporated into the model.
| 1. | Number of actions taken up to time | |
| 2. | Frequency of actions up to time | |
| 3. | 1{ | Indicator of whether an action has been taken before time |
| 4. | Number of simple actions (i.e., moving one control slider at a time) | |
| taken up to time | ||
| 5. | Frequency of simple actions up to time | |
| 6. | 1{ | Indicator of whether a simple action has been taken before time |
| 7. | An indicator function, | |
| have been explored via simple actions up to time | ||
| 8. | Number of RESET used up to time | |
| 9. | Frequency of RESET up to time | |
| 10. | 1{ | Indicator of whether RESET has been used before time |
| 11. | Number of times that previously taken actions (excluding RESET)are repeated. | |
| 12. | Frequency of repeating previously taken actions (excluding RESET). | |
| 13. | 1{ | Indicator of repeating previously taken actions (excluding RESET). |
Figure 4The increase in the cross-validated log-pseudo-likelihood based on a stepwise forward selection procedure. (A–C) plot the cross-validated log-pseudo-likelihood, corresponding to L(B, σ), L1(b1), L2(b2, σ), respectively.
Results on model selection based on a stepwise forward selection procedure.
| 0. | 1, | –72241.7 | –63867.9 | –8373.7 |
| 1. | –70663.0 | –62856.1 | –7806.9 | |
| 2. | 1{ | –70058.3 | –62617.0 | –7441.4 |
| 3. | 1{ | –69744.9 | –62315.2 | –7429.7 |
| 4. | –69672.7 | –62237.6 | –7435.1 | |
| 5. | 1{ | –69601.3 | –62239.9 | –7361.4 |
| 6. | -69547.6 | –62226.8 | –7320.8 | |
| 7. | –69522.5 | –62205.1 | –7317.4 | |
| 8. | 1{ | –69507.0 | -62190.0 | –7317.0 |
| 9. | –69500.8 | –62191.9 | –7308.9 | |
| 10. | –69499.4 | –62192.6 | –7306.8 | |
| 11. | –69498.5 | –62191.8 | –7306.7 |
The columns “Lik,” “Lik.out,” and “Lik.dur” give the value of the cross-validated log-pseudo-likelihood, corresponding to L(B, σ), L.
Figure 5A comparison of prediction accuracy between the model selected by cross validation and a baseline model without using individual specific event history.
Estimated regression coefficients for a model for which the event history process contains the initial features based on polynomials of t and the top six features selected by cross validation.
| 1. | 1 | 3.1 × 10−1 | 4.8 |
| 2. | −5.9 × 10−3 | −2.7 × 10−3 | |
| 3. | 3.1 × 10−6 | −4.5 × 10−7 | |
| 4. | 1.7 × 10−8 | 3.5 × 10−8 | |
| 5. | 5.2 × 10−1 | −8.4 × 10−1 | |
| 6. | 1{ | 6.8 × 10−1 | −2.1 × 10−1 |
| 7. | 1{ | −3.1 × 10−1 | −6.6 × 10−1 |
| 8. | −1.1 | −1.4 | |
| 9. | 1{ | 3.7 × 10−1 | 3.8 × 10−2 |
| 10. | 3.0 | 7.9 × 10−1 |