| Literature DB >> 22949426 |
Abstract
Learning science requires higher-level (critical) thinking skills that need to be practiced in science classes. This study tested the effect of exam format on critical-thinking skills. Multiple-choice (MC) testing is common in introductory science courses, and students in these classes tend to associate memorization with MC questions and may not see the need to modify their study strategies for critical thinking, because the MC exam format has not changed. To test the effect of exam format, I used two sections of an introductory biology class. One section was assessed with exams in the traditional MC format, the other section was assessed with both MC and constructed-response (CR) questions. The mixed exam format was correlated with significantly more cognitively active study behaviors and a significantly better performance on the cumulative final exam (after accounting for grade point average and gender). There was also less gender-bias in the CR answers. This suggests that the MC-only exam format indeed hinders critical thinking in introductory science classes. Introducing CR questions encouraged students to learn more and to be better critical thinkers and reduced gender bias. However, student resistance increased as students adjusted their perceptions of their own critical-thinking abilities.Entities:
Mesh:
Year: 2012 PMID: 22949426 PMCID: PMC3433302 DOI: 10.1187/cbe.11-11-0100
Source DB: PubMed Journal: CBE Life Sci Educ ISSN: 1931-7913 Impact factor: 3.325
List of cognitively passive and active learning behaviors that students reported in their study surveys
| Cognitively passive learning behaviors | Cognitively active learning behaviors |
|---|---|
| I previewed the reading before class. | I asked myself: “How does it work?” and “Why does it work this way?” |
| I came to class. | I drew my own flowcharts or diagrams. |
| I read the assigned text. | I broke down complex processes step-by-step. |
| I reviewed my class notes. | I wrote my own study questions. |
| I rewrote my notes. | I reorganized the class information. |
| I made index cards. | I compared and contrasted. |
| I highlighted the text. | I fit all the facts into a bigger picture. |
| I looked up information. | I tried to figure out the answer before looking it up. |
| I asked a classmate or tutor to explain the material to me. | I closed my notes and tested how much I remembered. |
| I asked myself: “How are individual steps connected?” and “Why are they connected?” | |
| I drew and labeled diagrams from memory and figured out missing pieces. | |
| I asked myself: “How does this impact my life?” and “What does it tell me about my body?” | |
| I used Bloom's taxonomy to write my own study questions |
Figure 1.Study times and behaviors for introductory biology. (A) Reported study times (mean h/wk ± 2 SEM) during non–exam weeks for the introductory biology class in this study. (B) Reported number of study behaviors (mean ± 2 SEM) during non–exam weeks: cognitively passive (surface) and cognitively active (deep) learning behaviors are shown separately.
Reported study time and study activities through the semestera
| Change over time | Time 1 | Time 2 | Time 3 | Time 4 | Compare 1–2 | Compare 2–3 | Compare 3–4 | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Section | Mean ± SEM | Mean ± SEM | Mean ± SEM | Mean ± SEM | ||||||||
| MC | 172 | Time | 3.31 ± 0.18 | 3.59 ± 0.19 | 4.33 ± 0.23 | 3.74 ± 0.22 | 4,374 | >0.05 | 5,388.5 | <0.05* | 2,045.5 | <0.05* |
| Passive | 3.93 ± 0.13 | 3.66 ± 0.27 | 3.77 ± 0.14 | 3.58 ± 0.15 | 4,861 | >0.05 | 3,699 | >0.05 | 3,778.5 | >0.05 | ||
| Active | 1.91 ± 0.17 | 2.31 ± 0.18 | 3.20 ± 0.23 | 2.55 ± 0.22 | 3,915.5 | <0.05* | 5,966 | <0.05* | 2,519.5 | <0.05* | ||
| MC+SA | 121 | Time | 3.31 ± 0.22 | 3.33 ± 0.20 | 4.60 ± 0.24 | 3.27 ± 0.22 | 1,840 | >0.05 | 3,403 | <0.05* | 1,444 | <0.05* |
| Passive | 3.91 ± 0.17 | 3.62 ± 0.17 | 3.82 ± 0.16 | 3.85 ± 0.17 | 1,600 | >0.05 | 2,644 | >0.05 | 2,049.5 | >0.05 | ||
| Active | 2.54 ± 0.21 | 3.17 ± 0.28 | 3.87 ± 0.28 | 3.42 ± 0.29 | 3,159 | <0.05* | 3,043.5 | <0.05* | 1,707 | <0.05* | ||
*Statistically significant at p < 0.05.
aStudy time is reported in hours per week (mean ± SEM), study activities (passive and active) are reported as number of activities used in that week (mean ± SEM). Time 1: before exam 1; time 2: before exam 2; time 3: before exam 4; time 4: before final exam. Sample sizes (MC: 172; MC + SA: 121) are based on students that answered all four surveys. Pairwise comparisons: related samples Wilcoxon signed-rank test (Z values). p values are reported after Benjamini-Hochberg correction for multiple comparisons.
Section differences in study time and study activities through the semestera
| Study behavior comparison | Before exam 1 | Before exam 2 | Before exam 4 | Before final exam | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| MC ( | MC+SA ( | Mann-Whitney | Mann-Whitney | Mann-Whitney | Mann-Whitney | |||||
| Time | 172 | 121 | 10,381 | 0.973 | 9,949.5 | 0.518 | 11,121 | 0.314 | 10,639.5 | 0.677 |
| Passive | 172 | 121 | 10,249 | 0.824 | 10,101 | 0.665 | 10,708.5 | 0.668 | 11,264.5 | 0.224 |
| Active | 172 | 121 | 11,264.5 | <0.05* | 11,883 | <0.05* | 11,752 | 0.057 | 12,001 | <0.05* |
*Significant at the p < 0.05 level.
aAn independent sample Mann-Whitney U-test was used. p values are reported after Benjamini-Hochberg correction for multiple comparisons.
Figure 2.Student performance on the final exam (mean ± 2 SEM) by (A) exam format and (B) gender. CR all (3 CR questions), MC all (all 90 MC questions), MC low (61 lower-level MC questions), MC high (29 higher-level MC questions).
Summary statistics of GPA and final exam performance
| GPAa | Final exam question category | |||||
|---|---|---|---|---|---|---|
| Exam format and gender | Mean ± SEM | FE total (%) | CR total (%) | MC total (%) | MC lower (%) | MC higher (%) |
| MC (242) | 3.30 ± 0.025 | 63.82 ± 0.68 | 61.97 ± 0.85 | 64.23 ± 0.71 | 66.46 ± 0.74 | 59.54 ± 0.82 |
| MC+SA (164) | 3.20 ± 0.035 | 67.34 ± 0.13 | 67.27 ± 1.00 | 67.35 ± 0.87 | 68.76 ± 0.89 | 64.40 ± 1.05 |
| Male (141) | 3.25 ± 0.039 | 67.10 ± 0.39 | 63.95 ± 1.13 | 67.80 ± 0.93 | 69.19 ± 0.95 | 64.88 ± 1.14 |
| Female (182) | 3.29 ± 0.029 | 64.18 ± 0.78 | 64.34 ± 0.97 | 64.14 ± 0.82 | 66.05 ± 0.86 | 60.14 ± 0.93 |
aGPA (mean ± SEM) is based on smaller sample sizes (MC: 231; MC + SA: 155; male: 129; female: 176) of students who reported their start-of-semester GPA. Final exam performance (mean ± SEM) is reported as percent of total for the different performance categories: FE total (total final exam score), CR total (3 CR questions), MC total (all 90 MC questions), MC lower (61 lower-level MC questions), MC higher (29 higher-level MC questions).
Exam format and gender differences in GPA and final exam performancea
| Final exam question category | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GPAb | FE total | CR total | MC total | MC lower | MC higher | |||||||
| Exam format and gender | Mann-Whitney | P | Mann-Whitney | Mann-Whitney | Mann-Whitney | Mann-Whitney | Mann-Whitney | |||||
| Exam format | 15,829 | 0.053 | 23,622 | 0.001* | 24,541 | <0.001* | 23,096 | 0.005* | 22,114 | 0.050* | 24,035 | <0.001* |
| Gender | 11,836 | 0.524 | 11,047 | 0.032* | 12,943 | 0.893 | 10,694 | 0.010* | 10,989 | 0.027* | 10,207 | 0.002* |
*Significant at the p < 0.05 level, (*) marginally significant at p = 0.05.
ap values and associated Mann-Whitney U statistics are shown.
bGPA (mean ± SEM) is based on smaller sample sizes (MC: 231; MC+SA: 155; male: 129; female: 176) of students who reported their start-of-semester GPA. An independent sample Mann-Whitney U-test was used (Mann-Whitney U decimals were rounded up from 0.5 to 1.0). p values are reported after Benjamini-Hochberg correction for multiple comparisons.
Summary statistics of GPA and final exam performance by exam format and gendera
| GPA | Final exam performance category | ||||
|---|---|---|---|---|---|
| Exam format by gender | Mean ± SEM | FE total (%) | CR total (%) | MC total (%) | MC/CR ratio |
| MC male (78) | 3.31 ± 0.017 | 65.05 ± 1.16 | 60.54 ± 1.54 | 66.05 ± 1.16 | 1.129 ± 0.02 |
| MC female (117) | 3.34 ± 0.034 | 62.56 ± 1.0 | 62.18 ± 1.22 | 62.64 ± 1.04 | 1.036 ± 0.21 |
| MC + SA male (63) | 3.16 ± 0.066 | 69.63 ± 1.40 | 68.17 ± 1.51 | 69.94 ± 1.4 | 1.037 ± 0.17 |
| MC + SA female (65) | 3.32 ± 0.053 | 64.18 ± 0.78 | 68.23 ± 1.5 | 66.83 ± 1.22 | 1.003 ± 0.025 |
aGPA (mean ± SEM) is based on smaller sample sizes of students who reported their start-of-semester GPA. Final exam performance (mean ± SEM) is reported as percent of total for the different performance categories: FE total (final exam score), CR total (3 CR questions), MC total (all 90 MC questions), MC/CR ratio (MC%/CR% = 1.0 if students do equally well on MC and CR).
Differences in GPA and final exam performance by exam format and gendera
| Final exam performance category | ||||||||
|---|---|---|---|---|---|---|---|---|
| GPA | FE total | CR total | MC total | |||||
| Exam format by gender | Mann-Whitney | Mann-Whitney | Mann-Whitney | Mann-Whitney | ||||
| MC by gender | 4,001 | 0.751 | 4,030 | 0.167 | 48,203 | 0.437 | 3,325 | 0.001* |
| MC + SA by gender | 1,867 | 0.655 | 1,750 | 0.156 | 2,025 | 0.914 | 1,558 | 0.020* |
*Significant at the p < 0.05 level.
aAn independent sample Mann-Whitney U-test was used (Mann-Whitney U decimals were rounded up from 0.5 to 1.0).
Figure 3.Student attitudes toward higher-level thinking skills. Students were asked to rate the statement “I see the value of learning on all learning levels” on a 5-point Likert scale before the first exam and before the final exam. Yes, agree or strongly agree; No, disagree or strongly disagree.
Figure 4.Student evaluations at the end of the semester. The average student evaluation scores from the MC + SA class are shown relative to the MC class (baseline).