| Literature DB >> 26865642 |
Meghan Rector Federer1, Ross H Nehm2, Dennis K Pearl3.
Abstract
Understanding sources of performance bias in science assessment provides important insights into whether science curricula and/or assessments are valid representations of student abilities. Research investigating assessment bias due to factors such as instrument structure, participant characteristics, and item types are well documented across a variety of disciplines. However, the relationships among these factors are unclear for tasks evaluating understanding through performance on scientific practices, such as explanation. Using item-response theory (Rasch analysis), we evaluated differences in performance by gender on a constructed-response (CR) assessment about natural selection (ACORNS). Three isomorphic item strands of the instrument were administered to a sample of undergraduate biology majors and nonmajors (Group 1: n = 662 [female = 51.6%]; G2: n = 184 [female = 55.9%]; G3: n = 642 [female = 55.1%]). Overall, our results identify relationships between item features and performance by gender; however, the effect is small in the majority of cases, suggesting that males and females tend to incorporate similar concepts into their CR explanations. These results highlight the importance of examining gender effects on performance in written assessment tasks in biology.Entities:
Mesh:
Year: 2016 PMID: 26865642 PMCID: PMC4803091 DOI: 10.1187/cbe.14-01-0018
Source DB: PubMed Journal: CBE Life Sci Educ ISSN: 1931-7913 Impact factor: 3.325
Description of item strands and sample populationsa
| Item features | ||||||||
|---|---|---|---|---|---|---|---|---|
| Item strand | Item organism (trait) | Polarity | Taxon | Familiarity | Item number | Sequences | Course | |
| ACORNS 1 (ELSP) | Snail (teeth) | G | A | F | 1 | 1/2/3/4 | Intro Bio 1 (majors) | 342 (48.8) |
| Prosimian (tarsi) | G | A | U | 2 | 2/1/4/3 | |||
| Elm (winged seeds) | G | P | F | 3 | 3/4/1/2 | Intro Bio 2 (majors) | 318 (55.5) | |
| Labiatae (pulegone) | G | P | U | 4 | 4/3/2/1 | |||
| ACORNS 2 (MSLG) | Snail (teeth) | G | A | F | 1 | 1/2/3/4 | Intro Bio 2 (Majors) | 156b (55.9) |
| Grape (tendrils) | G | P | F | 2 | 2/1/4/3 | |||
| Mouse (claws) | L | A | F | 3 | 3/4/1/2 | |||
| Lily (petals) | L | P | F | 4 | 4/3/2/1 | |||
| ACORNS 3 (SRPE) | Snail (poison) | G | A | F | 1 | 1/2/3/4 | Intro Bio | 273 (52.7) |
| Penguin (flight) | L | A | F | 2 | 2/1/4/3 | Intro Bio 2 (Majors) | 244 (57.3) | |
| Elm (winged seeds) | G | P | F | 3 | 3/4/1/2 | |||
| Rose (thorns) | L | P | F | 4 | 4/3/2/1 | Evolution | 123 (58.9) | |
| ACORNS 4 (BRLS) | Bacteria (resistance) | G | – | F | 1 | 1/2/3/4 | Evolution | 126b (59.5) |
| Rose (thorns) | L | P | F | 2 | ||||
| Lily (petals) | L | P | F | 3 | 4/3/2/1 | |||
| G | A | U | 4 | |||||
aItem features include the trait polarity (G = gain of trait; L = loss of trait), the type of taxa/trait (A = animal; P = plant), and the familiarity of the taxa/trait (F = familiar; U = unfamiliar). Please note that the specified courses for each item strand comprised the sample from which participants were selected for different item sequences. For full instrument details, please see Nehm et al. (2012) or visit www.evograder.org.
bParticipants also completed the MC CINS.
Sample scoring of ACORNS responsesa
| Causal elements | Sample responses | KC score | NI score | |
|---|---|---|---|---|
| KC1 | Causes of variation | Winged seeds probably gave the trees some sort of biological advantage that enabled them to | 1 | 1 |
| KC2 | Heritability of variation | |||
| KC3 | Competition | |||
| KC4 | Biotic potential | |||
| KC5 | Limited resources | |||
| KC6 | Differential survival | |||
| KC7 | Change over time | |||
| KC8 | Nonadaptive reasoning | |||
| NI1 | Need as a goal (teleology) | |||
| NI2 | Use and disuse | Natural selection, | 0 | 1 |
| NI2 | Use and disuse | |||
| NI3 | Intentionality | |||
| NI4 | Adapt (acquired traits) | |||
| NI5 | Energy reallocation | |||
| NI6 | Pressure | |||
aResponses were scored for the total number of normative causal (i.e., KCs) and nonnormative causal (i.e., NIs) elements.
Summary statistics for raw KC and NI scores for ACORNS itemsa
| ELSP strand ( | MSLG strand ( | SRPE strand ( | |
|---|---|---|---|
| Females | |||
| KC | 2.24 ± 1.36 | 2.66 ± 1.16 | 1.91 ± 1.33 |
| NI | 0.87 ± 0.90 | 1.30 ± 0.99 | 1.11 ± 0.95 |
| | –0.05 | 0.16 | 1.23 |
| ES | 0.00 | 0.02 | 0.10 |
| Males | |||
| KC | 2.24 ± 1.30 | 2.63 ± 1.27 | 1.78 ± 1.23 |
| NI | 0.90 ± 0.90 | 2.01 ± 1.40 | 1.11 ± 0.91 |
| | –0.39 | –3.86*** | –0.23 |
| ES | –0.03 | –0.58 | 0.00 |
aTotal scores represent the average diversity of KCs and NIs for responses to that item strand. t = independent t test of the group differences between male and female students; ES = effect size as calculated by Cohen’s d.
***p < 0.001.
Differential test functioning for male and female performance on CR item sequences
| KC measures (±SE) | NI measures (±SE) | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Strand | Item | Females | Males | Cohen’s | Females | Males | Cohen’s | ||
| ELSP | E | 3.51 ± 0.07 | 3.68 ± 0.08 | 1.59 | – | −3.19 ± 0.1 | −3.63 ± 0.12 | −2.81** | 0.22 |
| L | 3.67 ± 0.07 | 3.95 ± 0.08 | 2.63** | 0.53 | −3.62 ± 0.12 | −3.83 ± 0.12 | −1.23 | – | |
| S | 2.74 ± 0.06 | 2.75 ± 0.07 | 0.1 | – | −2.85 ± 0.09 | −3.15 ± 0.1 | −2.22* | 0.17 | |
| P | 3.42 ± 0.07 | 3.86 ± 0.08 | 4.13** | 0.33 | −3.67 ± 0.12 | −3.63 ± 0.12 | 0.23 | – | |
| SRPE | S | 2.52 ± 0.07 | 3.14 ± 0.07 | −6.26** | 0.50 | −3.34 ± 0.12 | −3.69 ± 0.1 | 2.24* | 0.17 |
| P | 3.82 ± 0.1 | 4.07 ± 0.08 | −1.95 | – | −2.52 ± 0.1 | −2.87 ± 0.09 | 2.60** | 0.20 | |
| E | 3.23 ± 0.09 | 3.68 ± 0.08 | −3.73** | 0.29 | −3.91 ± 0.14 | −4.49 ± 0.13 | 3.03** | 0.24 | |
| R | 3.15 ± 0.08 | 3.7 ± 0.08 | −4.86** | 0.38 | −2.95 ± 0.11 | −3.54 ± 0.1 | 3.96** | 0.31 | |
t = independent t test of the group differences between male and female students; ES = effect size as calculated by Cohen’s d.
*p < 0.05.
**p < 0.01.
Differential item functioning for measures of male and female performance
| Mean item difficulties (±SE) | ||||||
|---|---|---|---|---|---|---|
| Item strand | Item | Measure | Females | Males | ES | |
| ELSP | Prosimian | KC | 3.53 ± 0.07 | 3.78 ± 0.08 | −2.31* | 0.18 |
| NI | −3.78 ± 0.12 | −3.56 ± 0.12 | −1.34 | – | ||
| SRPE | Penguin | KC | 3.86 ± 0.08 | 4.14 ± 0.10 | −2.16* | 0.15 |
| NI | −2.69 ± 0.10 | −2.82 ± 0.10 | 0.97 | – | ||
t = independent t test of the group differences between male and female students; ES = effect size as calculated by Cohen’s d.
*p < 0.05.
Significance test for gender achievement differences for MC and CR assessmentsa
| Instrument/sample | Female mean | Score | Male mean | Score | Score | |
|---|---|---|---|---|---|---|
| ACORNS | KCD | NID | KCD | NID | KCD | NID |
| MSLG strand | 38.0 | 21.6 | 27.5 | 33.5 | 0.169 | −3.863*** |
| BRLS strand | 47.4 | 11.8 | 47.0 | 11.5 | 0.113 | 0.130 |
| CINS | Total score | Total score | Total score | |||
| MSLG strand | 75.8 | 71.8 | 1.474 | |||
| BRLS strand | 67.5 | 66.1 | 0.340 | |||
| Correlation coefficients | ||||||
| MSLG strand | 0.295** | −0.026 | 0.337** | −0.179 | ||
| BRLS strand | 0.342** | 0.004 | 0.573** | −0.118 | ||
aTo allow comparison across the three measures of student performance, scores are presented as a percent of total possible. In addition, KCD and NID response scores are used here (as opposed to total KCs or total NIs) to provide a more equivalent measure of comparison with the CINS total scores. t = independent t test of the group differences between male and female students.
**p < 0.01.
***p < 0.001.
Pairwise comparisons of male and female performance on CR and MC assessmentsa
aComparisons were made between all possible pairs of male and female performances on the ACORNS and CINS. Comparisons were made using KCD and NID scores for the ACORNS and total score for the CINS. The number of cases represents the number of pairs represented by each category: type 1: FCINS < MCINS but FACORNS > MACORNS; type 2: FCINS < MCINS but FACORNS = MACORNS; type 3: FCINS = MCINS but FACORNS > MACORNS; type 4: FCINS > MCINS but FACORNS < MACORNS; type 5: FCINS > MCINS but FACORNS = MACORNS; type 6: FCINS = MCINS but FACORNS < MACORNS. Shaded cells represents categories in which FACORNS ≥ MACORNS and FCINS ≤ MCINS.
Summary statistics for raw KC scores and NI for ACORNS itemsa
| Written: MSLG | Oral: OSLG | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Totalb | Gc | Lc | Ac | Pc | Totalb | Gc | Lc | Ac | Pc | |
| KCs | ||||||||||
| Female | 2.61 ± 1.38 | 3.08 ± 2.06 | 2.50 ± 1.79 | 2.50 ± 1.99 | 3.08 ± 2.17 | 4.74 ± 1.03 | 10.6 ± 2.48 | 6.21 ± 2.29 | 6.44 ± 2.38 | 6.94 ± 2.13 |
| Male | 2.67 ± 1.33 | 3.10 ± 2.13 | 3.05 ± 1.91 | 2.83 ± 1.98 | 3.31 ± 2.10 | 5.07 ± 1.17 | 10.10 ± 3.52 | 7.07 ± 2.18 | 7.38 ± 2.25 | 7.14 ± 2.36 |
| | −0.19 | −0.35 | −1.48 | −0.83 | −0.53 | −1.50 | 0.989 | −1.91 | −2.02* | −0.46 |
| ES | −0.04 | 0.00 | −0.29 | −0.16 | −0.10 | −0.29 | 0.16 | −0.38 | −0.40 | −0.08 |
| NIs | ||||||||||
| Female | 3.10 ± 0.39 | 5.35 ± 1.14 | 5.35 ± 1.05 | 5.55 ± 1.03 | 5.16 ± 1.24 | 3.35 ± 0.72 | 7.55 ± 1.77 | 5.26 ± 1.53 | 5.15 ± 1.32 | 5.52 ± 1.25 |
| Male | 3.14 ± 0.35 | 5.55 ± 0.70 | 5.52 ± 0.98 | 5.57 ± 0.73 | 5.50 ± 0.80 | 3.52 ± 0.77 | 7.67 ± 2.12 | 5.76 ± 1.24 | 5.55 ± 1.04 | 5.76 ± 1.28 |
| | −0.61 | −0.97 | −0.85 | −0.12 | −1.68 | −1.13 | −0.30 | −1.76 | −1.64 | −0.97 |
| ES | −0.10 | −0.21 | −0.16 | −0.02 | −0.32 | −0.22 | −0.06 | −0.35 | −0.33 | −0.18 |
aTotal scores represent the average diversity of KCs and NIs for responses to that item strand. Scores reported for gain (G), loss (L), animal (A), and plant (I) items represent the average total KCs and NIs used in students’ explanations for these items. t = independent t test of the group differences between male and female students; ES = effect size as calculated by Cohen’s d.
bFour items.
cTwo items.
*p < 0.05.
Differential item functioning for oral and written responses to ACORNS itemsa
| DIF measures (±SE) | |||||
|---|---|---|---|---|---|
| Item strand | Item | Measure | Females | Males | |
| Written: MSLG | Mouse | KC | 1.29 ± 0.16 | 1.11 ± 0.18 | 0.74 |
| NI | −1.41 ± 0.21 | −1.01 ± 0.24 | −1.27 | ||
| Snail | KC | 1.21 ± 0.16 | 1.21 ± 0.18 | 0.00 | |
| NI | −1.10 ± 0.19 | −1.01 ± 0.24 | −0.29 | ||
| Lily | KC | 1.21 ± 0.16 | 0.91 ± 01.8 | 1.26 | |
| NI | −0.70 ± 0.18 | −0.90 ± 0.24 | 0.67 | ||
| Grape | KC | 0.43 ± 0.15 | 0.75 ± 0.18 | −1.35 | |
| NI | −0.96 ± 0.19 | −0.96 ± 0.24 | 0.00 | ||
| Oral: OSLG | Opossum | KC | 0.20 ± 0.15 | −0.56 ± 0.17 | 3.29*** |
| NI | 0.56 ± 0.16 | 0.56 ± 0.19 | 0.00 | ||
| Snail | KC | −0.79 ± 014 | −0.68 ± 0.17 | −0.51 | |
| NI | 0.68 ± 0.16 | 0.68 ± 0.19 | 0.00 | ||
| Lily | KC | −0.50 ± 0.15 | −0.29 ± 0.18 | −0.92 | |
| NI | 0.50 ± 0.15 | 0.36 ± 0.19 | 0.59 | ||
| Grape | KC | −0.65 ± 0.17 | −0.79 ± 0.14 | −0.64 | |
| NI | 0.17 ± 0.15 | 0.57 ± 0.19 | −1.65 | ||
aResponse measures are reported as total KCs and total (NIs) for each item. t = independent t test of the group differences between male and female students.
***p < 0.001.