| Literature DB >> 30631711 |
Jayson M Nissen1, Manher Jariwala2, Eleanor W Close3, Ben Van Dusen1.
Abstract
BACKGROUND: High-stakes assessments, such the Graduate Records Examination, have transitioned from paper to computer administration. Low-stakes research-based assessments (RBAs), such as the Force Concept Inventory, have only recently begun this transition to computer administration with online services. These online services can simplify administering, scoring, and interpreting assessments, thereby reducing barriers to instructors' use of RBAs. By supporting instructors' objective assessment of the efficacy of their courses, these services can stimulate instructors to transform their courses to improve student outcomes. We investigate the extent to which RBAs administered outside of class with the online Learning About STEM Student Outcomes (LASSO) platform provide equivalent data to tests administered on paper in class, in terms of both student participation and performance. We use an experimental design to investigate the differences between these two assessment conditions with 1310 students in 25 sections of 3 college physics courses spanning 2 semesters.Entities:
Keywords: Computer-based test; Hierarchical linear models; Participation; Performance; Research-based assessments
Year: 2018 PMID: 30631711 PMCID: PMC6310409 DOI: 10.1186/s40594-018-0117-4
Source DB: PubMed Journal: Int J STEM Educ ISSN: 2196-7822
Participation and GPA for students in previous studies
| Source | Gender | Participant grades | Non-participant grades | Participant |
| Odds | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mean |
| SD | Mean |
| SD | Rate | F/M | |||
| M | 2.69 | 90 | 1.28 | 2.1 | 92 | 1.28 | 0.49 | 0.59 | 1.37 | |
| F | 2.78 | 27 | 1.26 | 2.05 | 13 | 1.16 | 0.68 | 0.73 | ||
| Kost et al. 2010 | M | 2.85 | 1257 | 0.8 | 1.93 | 500 | 1.1 | 0.72 | 0.92 | 1.11 |
| F | 2.8 | 447 | 0.8 | 1.96 | 114 | 1.2 | 0.80 | 0.84 | ||
| Kost et al. 2009 | M | 2.82 | 1563 | 0.8 | 2.14 | 1152 | 1.2 | 0.58 | 0.68 | 1.09 |
| F | 2.74 | 533 | 0.8 | 1.89 | 315 | 1.1 | 0.63 | 0.85 | ||
| Cahill et al. 2014 | All | – | 366 | – | – | 314 | – | 0.54 | – | – |
| All | – | 773 | – | – | 448 | – | 0.63 | – | – | |
| All | – | 360 | – | – | 219 | – | 0.62 | – | – | |
| All | – | 738 | – | – | 384 | – | 0.66 | – | – | |
| Brewe et al. 2010 | All | – | 258 | – | – | 65 | – | 0.80 | – | – |
| All | – | 758 | – | – | 1743 | – | 0.30 | – | – | |
Course demographic data and instruments used
| Semester 1 (Spring 2016) | Semester 2 (Fall 2016) | Instruments | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Sections | Students | Male | URM | Sections | Students | Male | URM | CI/AS | |
| A Mech | 2 | 194 | 58% | 46% | 6 | 490 | 50% | 52% | FCI/CLASS |
| C Mech | 5 | 188 | 74% | 45% | 4 | 175 | 67% | 60% | FCI/CLASS |
| C E&M | 4 | 117 | 70% | 52% | 4 | 146 | 74% | 47% | CSEM/CLASS |
| Total | 11 | 499 | 67% | 47% | 14 | 811 | 58% | 53% | - |
Fig. 1Student groupings for RBA assignments using stratified random sampling. Each student takes one assessment online using LASSO and one in-class on paper at the beginning and again at the end of the semester
Fig. 2The structure of the data is hierarchical with measures (either participation or scores) nested in students nested in course sections
Variables in the final participation and performance models with outcome variables in italics
| Model | Structure | Variables | |
|---|---|---|---|
| Level | Participation | Performance | |
| 1 | Assessment |
|
|
| Assessment condition | Assessment | ||
| and timing | timing | ||
| 2 | Students | Course grade | Assessment |
| condition | |||
| Gender | |||
| 3 | Sections | Recommended practices (CBT only) | Course type |
Participation rates for pre and post CBT and PPT administered RBAs
| Gender differences | Mean course grade | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Cond. | Time | AS | CI | Male | Female | Odds | Part. | Non-part. |
|
| (F/M) | |||||||||
| CBT | Pre | 71% | 67% | 66% | 76% | 0.99 | 2.86 | 2.13 | 0.73 |
| Post | 59% | 54% | 54% | 61% | 1.13 | 2.97 | 2.20 | 0.77 | |
| PPT | Pre | 94% | 94% | 94% | 95% | 1 | 2.68 | 1.95 | 0.73 |
| Post | 75% | 74% | 74% | 75% | 1 | 2.87 | 1.95 | 0.92 | |
HGLMs comparing student participation on the CBT and PPT pretest and posttests
| Final estimation of fixed effects with robust standard errors | ||||||
|---|---|---|---|---|---|---|
| Model 1 | Model 2 | Model 3 | ||||
| Coef. |
| Coef. |
| Coef. |
| |
| CBT pre | ||||||
| For intercept 2 | ||||||
| For intercept 3 | 0.679 | < 0.001 | 0.256 | 0.434 | − 0.671 | 0.081 |
| Practices | – | – | 0.214 | 0.116 | 0.182 | 0.097 |
| Gender | – | – | – | – | 0.269 | 0.086 |
| D grade | – | – | – | – | 0.142 | 0.699 |
| C grade | – | – | – | – | 0.558 | 0.077 |
| B grade | – | – | – | – | 1.102 | 0.002 |
| A grade | – | – | – | – | 1.526 | < 0.001 |
| CBT post | ||||||
| For intercept 2 | ||||||
| For intercept 3 | 0.296 | 0.118 | −0.767 | 0.008 | − 2.678 | < 0.001 |
| Practices | – | – | 0.534 | < 0.001 | 0.573 | < 0.001 |
| Gender | – | – | – | – | 0.207 | 0.281 |
| D grade | – | – | – | – | 0.946 | 0.013 |
| C grade | – | – | – | – | 1.395 | < 0.001 |
| B grade | – | – | – | – | 2.057 | < 0.001 |
| A grade | – | – | – | – | 2.390 | < 0.001 |
| PPT pre | ||||||
| For intercept 2 | ||||||
| For intercept 3 | 2.290 | < 0.001 | 2.266 | < 0.001 | 1.361 | < 0.001 |
| Gender | – | – | – | – | 0.130 | 0.619 |
| D grade | – | – | – | – | 0.496 | 0.281 |
| C grade | – | – | – | – | 0.675 | 0.062 |
| B grade | – | – | – | – | 0.835 | 0.042 |
| A grade | – | – | – | – | 0.909 | 0.049 |
| PPT post | ||||||
| For intercept 2 | ||||||
| For intercept 3 | 1.235 | < 0.001 | 1.235 | < 0.001 | −0.706 | 0.047 |
| Gender | – | – | – | – | 0.224 | 0.180 |
| D grade | – | – | – | – | 1.522 | 0.001 |
| C grade | – | – | – | – | 1.514 | < 0.001 |
| B grade | – | – | – | – | 2.166 | < 0.001 |
| A grade | – | – | – | – | 2.493 | < 0.001 |
| Level 1 and level 2 variance | ||||||
| CBT pre | 1.080 | 1.080 | 0.805 | |||
| CBT post | 1.170 | 1.390 | 1.077 | |||
| PPT pre | 1.130 | 1.440 | 1.156 | |||
| PPT post | 1.100 | 1.200 | 0.889 | |||
| Level 3 variance | ||||||
| CBT pre | 0.820 | 0.700 | 0.740 | |||
| CBT post | 1.220 | 0.670 | 0.830 | |||
| PPT pre | 0.560 | 0.485 | 0.690 | |||
| PPT post | 1.340 | 1.370 | 1.180 | |||
Fig. 3Student predicted participation rates (+/ − 1 standard error) by assessment condition and use of CBT recommended practices from model 2. When instructors used all four recommended practices, participation rates on CBT and PPT were similar
Fig. 4Participation by course grade and recommended practices based on model 3. Gender was entered into the linear equation as the mean value (0.39) to simplify the figure. PPT pretest is not shown because its value varies from 96 to 100% across the course grades. The results indicate that there were large differences in participation across the different course grades and that when instructors used all four recommended practices rates on the CBT posttests were similar to participation rates on the PPT posttests across the different courses grades
HLMs comparing performance between PPT and CBT administered assessments for the CI and AS
| Final estimation of fixed effects with robust standard errors | ||||||
|---|---|---|---|---|---|---|
| CI models | AS models | |||||
| CI 1 | CI 2 | CI 3 | AS 1 | AS 2 | AS 3 | |
| Coef. | Coef. | Coef. | Coef. | Coef. | Coef. | |
| For intercept 1 | ||||||
| For intercept 2 | ||||||
| Intercept 3 | 30.99*** | 31.20*** | – | 43.97*** | 44.11*** | – |
| Alg. mech. | – | – | 26.93*** | – | – | 42.87*** |
| Calc. mech. | – | – | 36.36*** | – | – | 43.77*** |
| Calc. E & M | – | – | 29.68*** | – | – | 47.19*** |
| For condition (CBT) | ||||||
| Intercept 3 | – | − 0.42 | – | – | −0.25 | – |
| Alg. mech. | – | – | 0.12 | – | – | −0.61 |
| Calc. mech. | – | – | − 0.36 | – | – | 1.63 |
| Calc. E & M | – | – | −1.18 | – | – | −2.21 |
| For post | ||||||
| For intercept 2 | ||||||
| Intercept 3 | 13.04*** | 12.84*** | – | 1.76** | 1.33* | – |
| Alg. mech. | – | – | 7.45*** | – | – | 1.66 |
| Calc. mech. | – | – | 18.61*** | – | – | 2.98* |
| Calc. E & M | – | – | 11.59*** | – | – | −1.70 |
| For condition (CBT) | ||||||
| Intercept 3 | – | 0.42 | – | – | 0.84 | – |
| Alg. mech. | – | – | −0.32 | – | – | 0.56 |
| Calc. mech. | – | – | −0.80 | – | – | −0.27 |
| Calc. E & M | – | – | 3.27 | – | – | 2.40 |
| Level 1 and level 2 variance | ||||||
| Intercept 1 | 135.46 | 135.22 | 135.61 | 95.12 | 93.56 | 93.53 |
| Level 1 | 125.66 | 125.05 | 124.65 | 98.58 | 98.08 | 97.24 |
| Level 3 variance | ||||||
| Int.1/int.2 | 4.16 | 4.35 | 0.74 | 1.01 | 1.4 | 0.6 |
| Int.1/cond. (CBT) | – | 0.89 | 0.89 | – | 4.81 | 3.45 |
| Post/int.2 | 5.51 | 5.25 | 2.43 | 0.81 | 0.44 | 0.23 |
| Post/cond. (CBT) | – | 1.94 | 1.92 | – | 2.63 | 3.12 |
| Total level 3 | 9.67 | 12.43 | 5.98 | 1.82 | 9.28 | 7.4 |
| Total variance | 270.79 | 272.7 | 266.24 | 195.52 | 200.92 | 198.17 |
***p < 0.001. **p < 0.01. *p < 0.05
Fig. 5A comparison between CBT and PPT administered concept inventories and attitudinal surveys based on AS model 3 and CI model 3. Error bars are 1 standard error. All of the differences between CBTs and PPTs were small, none of the differences were statistically significant, and neither assessment condition was consistently higher than the other. These results indicate that there is no difference in performance between assessments administered as CBTs and those administered as PPTs