| Literature DB >> 31133910 |
Matthias Stadler1, Frank Fischer1, Samuel Greiff2.
Abstract
Influencing students' educational achievements first requires understanding the underlying processes that lead to variation in students' performance. Researchers are therefore increasingly interested in analyzing the differences in behavior displayed in educational assessments rather than merely assessing their outcomes. Such analyses provide valuable information on the differences between successful and unsuccessful students and help to design appropriate interventions. Complex problem-solving (CPS) tasks have proven to provide particularly rich process data as they allow for a multitude of behaviors several of which can lead to a successful performance. So far, this data has often been analyzed on a rather aggregated level looking at an average number of actions or predefined strategies with only a few articles investigating the specific actions performed. In this paper, we report the results of an exploratory analysis of CPS log-files that is aimed at distinguishing between students that applied the correct strategy to a problem but failed to solve it and those applying the strategy successfully. In that, the sequence of behavior displayed is reduced to interpretable parts (n-grams) that allow searching for meaningful differences between the two groups of students. This level of analysis allows finding previously undefined or unknown patterns within the data and increases our understanding of the processes underlying successful problem-solving behavior even further.Entities:
Keywords: educational assessment; log-file; n-grams; problem-solving; process data
Year: 2019 PMID: 31133910 PMCID: PMC6514184 DOI: 10.3389/fpsyg.2019.00777
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Example of n-grams of different length with respective frequencies.
| Sequence | Frequency | Sequence | Frequency | Sequence | Frequency |
|---|---|---|---|---|---|
| AA | 2 | AAA | 1 | AAAA | 0 |
| AB | 6 | AAB | 2 | AAAB | 1 |
| BA | 6 | ABB | 5 | AABA | 0 |
| BB | 5 | ABA | 1 | AABB | 2 |
| BAA | 1 | ABAA | 0 | ||
| BAB | 3 | ABAB | 1 | ||
| BBA | 5 | ABBA | 1 | ||
| BBB | 4 | ABBB | 4 | ||
| BAAA | 1 | ||||
| BAAB | 0 | ||||
| BABA | 1 | ||||
| BABB | 3 | ||||
| BBAA | 1 | ||||
| BBAB | 3 | ||||
| BBBA | 4 | ||||
| BBBB | 0 | ||||
FIGURE 1Scenario (top) and model (bottom) of the “Handball training” task.
Example data from a log file of the “Handball training” task (adapted from Greiff et al., 2013).
| General information | Input variables | Output variables | |||||
|---|---|---|---|---|---|---|---|
| Timestamp | Button pressed | Training A | Training B | Training C | Motivation | Power | Exhaustion |
| 15:13:21 | Apply | 0 | 1 | 0 | 17 | 15 | 15 |
| 15:13:23 | Apply | 0 | 0 | 1 | 17 | 17 | 15 |
| 15:13:26 | Apply | 1 | 0 | 0 | 17 | 19 | 17 |
The raw and weighted frequency for all sequences of behavior.
| Behavior sequence | Frequency of sequences | Frequency of actions | Weight | Freq. in correct | Freq. in incorrect | ||
|---|---|---|---|---|---|---|---|
| Raw | Wgt | Raw | Wgt | ||||
| Bigrams | |||||||
| MM | 663 | 5063 | 0.01 | 4073 | 37.57 | 990 | 9.13 |
| MS | 159 | 221 | 2.08 | 166 | 345.36 | 55 | 114.43 |
| SM | 224 | 327 | 1.66 | 249 | 414.13 | 78 | 129.73 |
| SS | 81 | 226 | 3.07 | 156 | 478.76 | 70 | 214.83 |
| Trigrams | |||||||
| MMM | 628 | 4275 | 0.12 | 3454 | 408.11 | 821 | 97.01 |
| MMS | 125 | 159 | 2.33 | 120 | 279.12 | 39 | 90.71 |
| MSM | 118 | 147 | 2.38 | 113 | 269.00 | 34 | 80.94 |
| MSS | 54 | 66 | 3.08 | 47 | 144.59 | 19 | 58.45 |
| SMM | 206 | 267 | 1.75 | 208 | 363.20 | 59 | 103.02 |
| SMS | 28 | 33 | 3.47 | 25 | 86.66 | 8 | 27.73 |
| SSM | 79 | 96 | 2.76 | 72 | 198.80 | 24 | 66.27 |
| SSS | 50 | 128 | 3.49 | 82 | 286.51 | 46 | 160.73 |
| Four-grams | |||||||
| MMMM | 495 | 3574 | 0.59 | 2895 | 1698.67 | 679 | 398.41 |
| MMMS | 80 | 101 | 2.77 | 79 | 218.45 | 22 | 60.83 |
| MMSM | 90 | 103 | 2.62 | 80 | 209.51 | 23 | 60.23 |
| MMSS | 41 | 48 | 3.25 | 34 | 110.37 | 14 | 45.45 |
| MSMM | 97 | 114 | 2.56 | 90 | 230.19 | 24 | 61.39 |
| MSMS | 14 | 15 | 3.65 | 11 | 40.15 | 4 | 14.60 |
| MSSM | 22 | 23 | 3.50 | 17 | 59.46 | 6 | 20.99 |
| MSSS | 36 | 41 | 3.31 | 28 | 92.70 | 13 | 43.04 |
| SMMM | 171 | 210 | 1.96 | 167 | 327.60 | 43 | 84.35 |
| SMMS | 24 | 28 | 3.53 | 22 | 77.70 | 6 | 21.19 |
| SMSM | 20 | 21 | 3.54 | 16 | 56.57 | 5 | 17.68 |
| SMSS | 12 | 12 | 3.63 | 9 | 32.64 | 3 | 10.88 |
| SSMM | 67 | 76 | 2.87 | 60 | 172.40 | 16 | 45.97 |
| SSMS | 11 | 11 | 3.64 | 8 | 29.10 | 3 | 10.91 |
| SSSM | 50 | 57 | 3.10 | 40 | 123.96 | 17 | 52.68 |
| SSSS | 25 | 71 | 4.06 | 42 | 170.71 | 29 | 117.87 |
Distribution of students based on whether they solved the problem and applied the VOTAT strategy.
| Applied the VOTATstrategy | ||||
|---|---|---|---|---|
| No | Yes | Total | ||
| Solved the problem | No | 712 | 143 | 855 |
| Yes | 21 | 523 | 544 | |
| Total | 733 | 666 | 1399 | |
Summary of the chi-square feature selection model for bigrams, trigrams, and four-grams.
| Sequence | χ2 | Dir. | Sequence | χ2 | Dir. | Sequence | χ2 | Dir. | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| SS | 31.98 | <0.001 | - | SSS | 59.98 | - | <0.001 | SSSS | 76.34 | - | <0.001 |
| MM | 0.92 | 0.337 | + | MMM | 12.16 | + | <0.001 | MMMM | 67.09 | + | <0.001 |
| MS | 0.23 | 0.632 | - | MSS | 4.08 | - | 0.043 | MSSS | 7.21 | - | 0.007 |
| SM | 0.05 | 0.823 | + | SMM | 1.95 | + | 0.163 | SMMM | 5.43 | + | 0.020 |
| MSM | 0.37 | + | 0.543 | SSSM | 5.30 | - | 0.021 | ||||
| SSM | 0.17 | - | 0.680 | MMSS | 3.64 | - | 0.056 | ||||
| MMS | 0.04 | - | 0.841 | MSMM | 2.70 | + | 0.100 | ||||
| SMS | 0.00 | - | 1.00 | SSMM | 2.01 | + | 0.156 | ||||
| MMMS | 1.52 | + | 0.218 | ||||||||
| MMSM | 0.87 | + | 0.351 | ||||||||
| SMMS | 0.69 | + | 0.406 | ||||||||
| SSMS | 0.36 | - | 0.549 | ||||||||
| MSMS | 0.32 | - | 0.572 | ||||||||
| MSSM | 0.27 | - | 0.603 | ||||||||
| SMSS | 0.03 | - | 0.862 | ||||||||
| SMSM | 0.01 | + | 0.920 | ||||||||