| Literature DB >> 35097228 |
Gyöngyvér Molnár1, Saleh Ahmad Alrababah2, Samuel Greiff3.
Abstract
Complex problem solving (CPS) is considered an important educational outcome in the 21st century. Despite its importance, we have only little only knowledge of its measurability, development, or comparability in some countries, in particular in those with a short history of computer-based assessment. The results of the current study provide insights into the validity of CPS measurements and shed light on the different behavioral patterns and test-taking behavior in two convenience samples with different sample characteristics of Jordanian (N = 431) and Hungarian (N = 1844) students as they solve complex problems. CPS proved to be measurement-invariant in Jordan and Hungary among university students. Analyzing log data, we identified differences in students' test-taking behavior in terms of the effectiveness of their exploration strategy, time-on-task, and number of trials. Based on the students' exploration strategy behavior, we identified four latent classes in both samples. The tested process indicators proved to be non-invariant over the different latent profiles; that is, there are differences in the role of the number of manipulations executed, time-on-task, and type of strategy used in actual problem-solving achievement between students that fall within different profiles. This study contributes to our understanding of how students from different educational contexts behave while solving complex problems.Entities:
Keywords: Complex problem solving; Exploration strategies; International validity; Latent class analysis; Process indicators; Test-taking behavior
Year: 2022 PMID: 35097228 PMCID: PMC8783123 DOI: 10.1016/j.heliyon.2022.e08775
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Comparison of the Jordanian and Hungarian samples along the same variables.
| Demographic data | Jordan | Hungary | t test/Welch test | |||
|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | |||
| Age | 20.6 | 3.11 | = | 19.99 | 1.82 | n.s. |
| Gender (1: male; 2: female) | 1.53 | .50 | < | 1.59 | .49 | t = -2.5 p < .05 |
| Year of Matura examination | 2015 | 5 | < | 2019 | 1.7 | t = -+6.9 p < .01 |
| Average result of Matura examination – compulsory parts∗ | 85.81 | 9.04 | 76.22 | 15.03 | not comparable | |
| Study goal∗∗ | 1.67 | 1.17 | = | 1.66 | 1.03 | n.s. |
| Parental education∗∗∗ | 4.26 | 2.47 | < | 5.44 | 1.26 | t = -8.5 p < .01 |
| Number of books∗∗∗∗ | 2.94 | 1.98 | < | 4.41 | 1.68 | t = -12.8 p < .01 |
| ICT infrastructure∗∗∗∗∗ | 3.16 | 1.3 | < | 4.19 | 0.967 | t = 17.11 p < .001 |
Note. ∗The compulsory subjects are different in the two countries. In Jordan, they are Arabic, English, History of Jordan, and Islamic Education. In Hungary, they are Hungarian, Mathematics, History, and a foreign language.
∗∗Study goal is measured on a 3-point scale: 1: BA; 2: MA; 3: PhD – the level of education he or she ultimately wishes to complete.
∗∗∗Parental education was measured on a 7-point scale: 1: below primary…7: university degree equivalent to MA or MSc.
∗∗∗∗Number of books: 7-point scale: 1: less than 1 bookshelf…7: more than 1000 books.
∗∗∗∗∗ICT infrastructure at home: 1: none at all….5: a great deal.
Figure 1Screenshot of the MicroDYN task “Game Night.” See the original version of the task in Greiff et al., (2013b). The controllers of the input variables range from “- -” (value = -2) to “++“ (value = +2). They are presented on the left side of the problem environment in the Hungarian-language version and on the right side in the Arabic one. The model is shown at the bottom of the figure. (The English-language version is presented in Figure 2).
Figure 2Demonstrating the meaning of a trial within the “Game Night” problem (English-language version of the task presented in Figure 1). The instruction for the task: Your friends invite you to a game night. They show you an interesting game you do not know the rules to. Find out how the blue, green, and red gambling chips affect the number of cards, the number of pawns, and your score.
Reliabilities of the CPS test in the Jordanian and Hungarian-language contexts with and without the use of log data.
| Type of data | Jordanian | Hungarian |
|---|---|---|
| Reliabilities of the test with traditional scoring (knowledge acquisition phase) | .842 | .858 |
| Reliabilities of the test with traditional scoring (knowledge application phase) | .719 | .750 |
| Reliabilities of the test with traditional scoring (phases 1 and 2) | .872 | .882 |
| Reliabilities of the test (knowledge acquisition phase) consisting of the new dichotomously scored variables in terms of the effectiveness of strategy usage at the beginning of the problem-solving process (ten items) | .921 | .944 |
| Reliabilities of the test (knowledge acquisition phase) consisting of the new categorically scored variables describing the level of isolated variation strategy usage (ten items) | .950 | .946 |
Goodness of fit indices for testing invariance of CPS across the two samples.
| Model | χ2 | df | CFI | TLI | RMSEA | ΔCFI | ΔRMSEA |
|---|---|---|---|---|---|---|---|
| Configural invariance | 944.33 | 334 | .979 | .982 | .040 | - | - |
| Strong factorial invariance | 1062.87 | 350 | .978 | .977 | .042 | .001 | .002 |
| Strict factorial invariance | 1205.66 | 370 | .975 | .974 | .045 | .003 | .003 |
Cross-sample achievement differences in CPS: Problem complexity and problem phase-level differences.
| Complexity of problem (Number of input and output variables and number of connections) | Jordanian | Hungarian | t | p | d | ||
|---|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | ||||
| 2-2 (2) | 0.59 | 0.492 | 0.77 | 0.422 | 7.48 | <.001 | -0.39 |
| 3-3 (3 or 4) | 0.46 | 0.493 | 0.76 | 0.424 | 12.96 | <.001 | -0.66 |
| 3-3 (2 + 1 or 3 + 1) | 0.13 | 0.319 | 0.28 | 0.447 | 6.72 | <.001 | -0.40 |
| Sum | 0.36 | 0.422 | 0.57 | 0.44 | 13.21 | <.001 | -0.49 |
| 2-2 (2) | 0.56 | 0.498 | 0.72 | 0.450 | 6.62 | <.001 | -0,33 |
| 3-3 (3 or 4) | 0.05 | 0.233 | 0.37 | 0.472 | 13.14 | <.001 | -0,82 |
| 3-3 (2 + 1 or 3 + 1) | 0.02 | 0.126 | 0.17 | 0.348 | 7.93 | <.001 | -0,51 |
| Sum | 0.15 | 0.258 | 0.35 | 0.416 | 16.88 | <.001 | -0.58 |
Note. The ‘+’ sign by the number of connections denotes the presence of internal dynamics (associated with a higher level of complexity) in the problem environment.
Percentage of theoretically effective and non-effective strategy use and traditional CPS scoring.
| Complexity of problem (Number of input and output variables and connections) | Frequency (%) | |||||
|---|---|---|---|---|---|---|
| Theoretically effective strategy use | Theoretically non-effective strategy use | |||||
| Low achievement (%; in proportion to whole sample) | High achievement (%; in proportion to whole sample) | Independent of final score, in proportion to whole sample | Low achievement (%; in proportion to whole sample) | High achievement (%; in proportion to whole sample) | Independent of final score, in proportion to whole sample | |
| 2-2 (2) | 30.2 (13.5) | 69.8 (31.4) | 44.9 | 38.5 (21.2) | 61.4 (33.8) | 55.1 |
| 3-3 (3 or 4) | 40.3 (18.3) | 59.6 (27.1) | 45.5 | 61.8 (33.6) | 38.1 (20.8) | 54.5 |
| 3-3 (2 + 1 or 3 + 1) | 83.3 (34.7) | 16.6 (6.9) | 41.6 | 87.8 (51.1) | 12.1 (7.1) | 58.3 |
| Test | 55.5 (23.9) | 44.4 (19.9) | 43.9 | 67.5 (38.1) | 32.4 (17.9) | 56.1 |
| 2-2 (2) | 21.8 (20.8) | 78.2 (74.4) | 95.25 | 75.6 (4.9) | 24.4 (1.3) | 6.2 |
| 3-3 (3 or 4) | 18.1 (16.7) | 81.9 (75.9) | 92.6 | 94.4 (6.9) | 8.5 (0.7) | 7.4 |
| 3-3 (2 + 1 or 3 + 1) | 69.4 (63.7) | 30.6 (28.1) | 91.8 | 76.7 (6.6) | 1.1 (0.1) | 8.4 |
| Test | 39.4 (36.4) | 60.6 (56.4) | 92.8 | 81.9 (6.3) | 11 (0.8).4 | 7.6 |
Note. Students' achievement was considered high if they managed to achieve a score of 1 based on the traditional scoring method.
Cross-sample differences in students’ test-taking behavior: time-on-task and number of trials.
| Complexity of problem | Jordanian sample | Hungarian sample | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Low achievement | High achievement | Mean | Low achievement | High achievement | Mean | t | p | d | |
| 2-2 (2) | 49.5 | 26.2 | 33.8 | 74.9 | 59.0 | 63.1 | 14.0 | <.001 | -0.67 |
| 3-3 (3 or 4) | 38.5 | 37.0 | 37.6 | 55.9 | 47.2 | 49.2 | 6.0 | <.001 | -0.31 |
| 3-3 (2 or 3 + 1) | 35.2 | 39.8 | 35.5 | 56.0 | 70.9 | 60.1 | 13.6 | <.001 | -0.76 |
| Sum | 39.4 | 35.9 | 36.0 | 59.7 | 59.1 | 56.4 | 18.4 | <.001 | -.57 |
| 2-2 (2) | 1.9 | 1.6 | 1.7 | 5.8 | 6.3 | 6.1 | 21.2 | <.001 | -1.3 |
| 3-3 (3 or 4) | 1.8 | 2.3 | 2.0 | 3.8 | 4.4 | 4.2 | 15.9 | <.001 | -0.9 |
| 3-3 (2 or 3 + 1) | 1.9 | 3.4 | 2.0 | 4.9 | 7.7 | 5.6 | 19.1 | <.001 | -1.19 |
| Sum | 1.9 | 2.6 | 1.9 | 4.8 | 6.2 | 5.3 | 26.2 | <.001 | -1.15 |
Information theory, likelihood ratio, and entropy-based fit indices for latent class analyses.
| Number of latent classes | AIC | BIC | aBIC | Entropy | L–M–R test | P |
|---|---|---|---|---|---|---|
| 2 | 5266 | 5433 | 5303 | .979 | 2797 | .000 |
| 3 | 5008 | 5260 | 5063 | .949 | 298 | .000 |
| 4 | 4948 | 5286 | 5022 | .948 | 100 | .006 |
| 5 | 4935 | 5358 | 5028 | .934 | 54 | .838 |
| 2 | 10376 | 10602 | 10471 | .990 | 6089 | .000 |
| 3 | 9683 | 10025 | 9828 | .958 | 729 | .000 |
| 4 | 9513 | 9970 | 9707 | .959 | 210 | .001 |
| 5 | 9479 | 10052 | 9721 | .949 | 75 | .169 |
Note. AIC = Akaike information criterion; BIC = Bayesian information criterion; aBIC = adjusted Bayesian information criterion; L–M–R test = Lo–Mendell–Rubin adjusted likelihood ratio test. The best fitting model solution is in italics.
Figure 3Four qualitatively different class profiles in the Jordanian sample.
Relative frequencies and average latent class probabilities in the Jordanian and Hungarian-language samples.
| Profiles | Jordanian | Hungarian | ||
|---|---|---|---|---|
| Frequency | Average Latent Class Probabilities | Frequency | Average Latent Class Probabilities | |
| Non-performing explorers | 39.7 | 0.987 | 7.4 | .985 |
| Non-persistent explorers | 6.6 | 0.937 | - | - |
| Restarting slow learners | 15.3 | 0.958 | 3.2 | .934 |
| Rapid learners | - | - | 7.0 | .906 |
| Almost proficient explorers | 38.4 | 0.970 | - | - |
| Proficient explorers | - | - | 82.4 | .989 |
Note. Latent classes are ordered along their levels of isolated variation strategy.
Figure 4Four qualitatively different class profiles in the Hungarian sample.
Figure 5Performance and test-taking behavior among students with different latent class profiles in the two samples. (We have connected the data points to visualize the tendencies.)