| Literature DB >> 30863344 |
Jianlin Yuan1, Yue Xiao2, Hongyun Liu2,3.
Abstract
As one of the important 21st-century skills, collaborative problem solving (CPS) has aroused widespread concern in assessment. To measure this skill, two initiative approaches have been created: the human-to-human and human-to-agent modes. Between them, the human-to-human interaction is much closer to the real-world situation and its process stream data can reveal more details about the cognitive processes. The challenge for fully tapping into the information obtained from this mode is how to extract and model indicators from the data. However, the existing approaches have their limitations. In the present study, we proposed a new paradigm for extracting indicators and modeling the dyad data in the human-to-human mode. Specifically, both individual and group indicators were extracted from the data stream as evidence for demonstrating CPS skills. Afterward, a within-item multidimensional Rasch model was used to fit the dyad data. To validate the paradigm, we developed five online tasks following the asymmetric mechanism, one for practice and four for formal testing. Four hundred thirty-four Chinese students participated in the assessment and the online platform recorded their crucial actions with time stamps. The generated process stream data was handled with the proposed paradigm. Results showed that the model fitted well. The indicator parameter estimates and fitting indexes were acceptable, and students were well differentiated. In general, the new paradigm of extracting indicators and modeling the dyad data is feasible and valid in the human-to-human assessment of CPS. Finally, the limitations of the current study and further research directions are discussed.Entities:
Keywords: collaborative problem solving; dyad data; indicator extracting; multidimensional model; process stream data
Year: 2019 PMID: 30863344 PMCID: PMC6399305 DOI: 10.3389/fpsyg.2019.00369
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Examples of events defined in the task of Exploring Air Conditioner.
| Event type | Event name | Role | Record format | Explanation of capturing an action |
|---|---|---|---|---|
| Common events | Task start | A, B | task start | Record the start of a task |
| Task end | A, B | task end | Record the end of a task | |
| Chat | A, B | free-form chat messages | Record the content of chat messages | |
| Unique events | Control A | A | controlA: | Record the action of changing the position of the slider in the control A |
| Control B | A | controlB: | Record the action of changing the position of the slider in the control B | |
| Control C | B | controlC: | Record the action of changing the position of the slider in the control C | |
| Control D | B | controlD: | Record the action of changing the position of the slider in the control D | |
| Apply | A, B | apply: | Record the action of clicking the button of Apply | |
FIGURE 1A part of process stream data from Exploring Air Conditioner.
Examples of indicator specifications.
| Indicator name | Mapping element | Definition of the indicator | Algorithm | Output |
|---|---|---|---|---|
| T1A01 | Action | The number of messages and actions generated by student A, reflecting his/her activeness in collaboration. | In the process stream data of student A, count all the events that he/she generated. | The count value. |
| T1G02 | Interaction | The number of interactive chat blocks (A, B) between two students, reflecting their interaction. Consecutive chats without interrupted actions from the same student are counted as one. (e.g., A,B,A,B = 2 chat blocks; AA,B,A,B = 2 chat blocks) | Step 1: Find all sequences of consecutive chat messages without any interrupted actions in the process stream data of Student A and B. | The count value. |
| Step 2: Count the number of chat blocks in each chat sequence. Add one to the value of the indicator if one chat block is found. | ||||
FIGURE 2The frequency distribution of T1A01.
FIGURE 3A diagram of the within-item Rasch model for the dyad data.
Kappa consistency of indicators between the scoring program and the human rater.
| Indicators | Kappa coefficient | Indicators | Kappa coefficient | Indicators | Kappa coefficient |
|---|---|---|---|---|---|
| T1A03 | 0.659 | T4A02 | 0.605 | T4B03 | 0.852 |
| T1A07 | 0.595 | T1B03 | 1.000 | T4B01 | 0.474 |
| T1A09 | 0.857 | T1B07 | 0.842 | T1G01 | 0.857 |
| T2A01 | 0.857 | T1B09 | 0.471 | T2G01 | 1.000 |
| T3A04 | 0.587 | T2B01 | 1.000 | T3G01 | 0.789 |
| T3A06 | 0.706 | T3B04 | 1.000 | T4G01 | 1.000 |
| T4A03 | 0.700 | T3B06 | 0.400 | ||
Model fit of the two-dimensional Rasch model.
| Sample size | Final deviance | Separation reliability | Reliability of dimension 1 | Reliability of dimension 2 | Correlation of dimension 1 and 2 |
|---|---|---|---|---|---|
| 217 | 12869.646 | 0.981 | 0.886 | 0.891 | 0.561 |
Results of indicator parameter estimates and fit.
| Indicator | Discrimination | Difficulty | Error | MNSQ | Confidence interval | T | |
|---|---|---|---|---|---|---|---|
| T1A01 | 0.24 | -1.554 | 0.180 | 1.01 | 0.79 | 1.21 | 0.1 |
| T1A04 | 0.34 | -0.309 | 0.083 | 1.09 | 0.87 | 1.13 | 1.3 |
| T1A07 | 0.42 | 0.139 | 0.142 | 0.95 | 0.93 | 1.07 | -1.5 |
| T1A09 | 0.26 | 1.189 | 0.163 | 0.99 | 0.84 | 1.16 | -0.1 |
| T2A01 | 0.46 | -1.860 | 0.196 | 0.93 | 0.74 | 1.26 | -0.5 |
| T2A03 | 0.28 | 1.516 | 0.176 | 0.97 | 0.80 | 1.20 | -0.3 |
| T2A01 | 0.28 | -0.723 | 0.149 | 0.99 | 0.89 | 1.11 | -0.2 |
| T3A01 | 0.39 | -1.943 | 0.201 | 0.94 | 0.73 | 1.27 | -0.4 |
| T3A04 | 0.35 | 0.935 | 0.154 | 0.94 | 0.87 | 1.13 | -0.9 |
| T3A06 | 0.22 | 0.835 | 0.151 | 1.01 | 0.88 | 1.12 | 0.1 |
| T3A01 | 0.15 | -1.243 | 0.120 | 1.09 | 0.84 | 1.16 | 1.1 |
| T3A02 | 0.15 | 0.048 | 0.084 | 1.30 | 0.88 | 1.12 | 4.5 |
| T3A03 | 0.41 | 1.027 | 0.090 | 0.95 | 0.79 | 1.21 | -0.4 |
| T4A01 | 0.30 | -1.681 | 0.190 | 0.98 | 0.76 | 1.24 | -0.1 |
| T4A03 | 0.50 | 0.655 | 0.151 | 0.88 | 0.90 | 1.10 | -2.3 |
| T4A04 | 0.38 | 0.794 | 0.154 | 0.92 | 0.88 | 1.12 | -1.3 |
| T4A01 | 0.44 | 0.408 | 0.116 | 0.94 | 0.83 | 1.17 | -0.7 |
| T4A02 | 0.39 | 0.412 | 0.147 | 0.94 | 0.91 | 1.09 | -1.3 |
| T4A03 | 0.49 | -0.179 | 0.121 | 0.89 | 0.82 | 1.18 | -1.3 |
| T1B01 | 0.33 | -1.781 | 0.193 | 0.95 | 0.75 | 1.25 | -0.4 |
| T1B04 | 0.25 | -0.067 | 0.079 | 1.18 | 0.88 | 1.12 | 2.9 |
| T1B07 | 0.31 | 0.077 | 0.141 | 0.99 | 0.93 | 1.07 | -0.3 |
| T1B09 | 0.36 | 1.025 | 0.157 | 0.96 | 0.86 | 1.14 | -0.6 |
| T2B01 | 0.39 | -1.566 | 0.179 | 0.95 | 0.78 | 1.22 | -0.4 |
| T2B03 | 0.31 | 1.038 | 0.157 | 0.97 | 0.86 | 1.14 | -0.4 |
| T2B01 | 0.32 | -0.896 | 0.152 | 0.96 | 0.87 | 1.13 | -0.6 |
| T3B01 | 0.36 | -2.009 | 0.206 | 0.95 | 0.71 | 1.29 | -0.3 |
| T3B04 | 0.29 | 1.172 | 0.161 | 0.96 | 0.85 | 1.15 | -0.5 |
| T3B06 | 0.30 | 0.636 | 0.146 | 1.00 | 0.90 | 1.10 | 0.0 |
| T3B01 | 0.22 | -0.708 | 0.101 | 1.12 | 0.85 | 1.15 | 1.5 |
| T3B02 | 0.04 | 0.036 | 0.085 | 1.34 | 0.88 | 1.12 | 5.0 |
| T3B03 | 0.42 | 1.113 | 0.087 | 0.97 | 0.79 | 1.21 | -0.3 |
| T4B01 | 0.43 | -1.748 | 0.193 | 0.96 | 0.75 | 1.25 | -0.3 |
| T4B03 | 0.37 | 0.329 | 0.145 | 0.96 | 0.92 | 1.08 | -1.0 |
| T4B01 | 0.22 | -0.147 | 0.144 | 0.87 | 0.93 | 1.07 | -3.8 |
| T4B03 | 0.04 | 0.679 | 0.092 | 0.88 | 0.84 | 1.16 | -1.5 |
| T1G01 | 0.42 | -0.277 | 0.080 | 1.18 | 0.87 | 1.13 | 2.6 |
| T1G02 | 0.43 | -0.035 | 0.076 | 1.18 | 0.88 | 1.12 | 2.9 |
| T2G01 | 0.37 | -0.439 | 0.043 | 1.10 | 0.82 | 1.18 | 1.1 |
| T2G02 | 0.22 | 0.322 | 0.083 | 0.90 | 0.87 | 1.13 | -1.4 |
| T3G01 | 0.04 | 0.015 | 0.064 | 1.05 | 0.82 | 1.18 | 0.6 |
| T3G02 | 0.50 | 0.163 | 0.061 | 1.07 | 0.83 | 1.17 | 0.8 |
| T4G01 | 0.51 | -0.157 | 0.070 | 1.01 | 0.81 | 1.19 | 0.1 |
| T4G02 | 0.47 | 0.055 | 0.067 | 1.12 | 0.82 | 1.18 | 1.2 |
FIGURE 4The indicator and latent distribution map of two-dimensional Rasch model. Each ‘X’ represents 1.4 cases.
Descriptive statistics of students’ ability of successful and failure group in each task.
| N | Minimum | Maximum | Mean | Std. deviation | |
|---|---|---|---|---|---|
| Failure group in task 1 | 156 | -2.169 | 1.183 | -0.237 | 0.626 |
| Successful group in task 1 | 254 | -2.029 | 2.149 | 0.184 | 0.647 |
| Failure group in task 2 | 210 | -2.169 | 1.521 | -0.205 | 0.629 |
| Successful group in task 2 | 222 | -1.975 | 2.149 | 0.181 | 0.676 |
| Failure group in task 3 | 352 | -2.170 | 1.900 | -0.139 | 0.636 |
| Successful group in task 3 | 82 | -0.784 | 2.149 | 0.572 | 0.549 |
| Failure group in task 4 | 226 | -2.169 | 1.267 | -0.210 | 0.585 |
| Successful group in task 4 | 148 | -1.150 | 2.150 | 0.485 | 0.552 |