| Literature DB >> 29196427 |
Liz Stanhope1, Laura Ziegler2, Tabassum Haque3, Laura Le4, Marcelo Vinces3, Gregory K Davis5, Andrew Zieffler4, Peter Brodfuehrer5, Marion Preest6, Jason M Belitsky7, Charles Umbanhowar8, Paul J Overvoorde9.
Abstract
Multiple reports highlight the increasingly quantitative nature of biological research and the need to innovate means to ensure that students acquire quantitative skills. We present a tool to support such innovation. The Biological Science Quantitative Reasoning Exam (BioSQuaRE) is an assessment instrument designed to measure the quantitative skills of undergraduate students within a biological context. The instrument was developed by an interdisciplinary team of educators and aligns with skills included in national reports such as BIO2010, Scientific Foundations for Future Physicians, and Vision and Change Undergraduate biology educators also confirmed the importance of items included in the instrument. The current version of the BioSQuaRE was developed through an iterative process using data from students at 12 postsecondary institutions. A psychometric analysis of these data provides multiple lines of evidence for the validity of inferences made using the instrument. Our results suggest that the BioSQuaRE will prove useful to faculty and departments interested in helping students acquire the quantitative competencies they need to successfully pursue biology, and useful to biology students by communicating the importance of quantitative skills. We invite educators to use the BioSQuaRE at their own institutions.Entities:
Mesh:
Year: 2017 PMID: 29196427 PMCID: PMC5749968 DOI: 10.1187/cbe.16-10-0301
Source DB: PubMed Journal: CBE Life Sci Educ ISSN: 1931-7913 Impact factor: 3.325
Instrument blueprint for constructing the Biology Science Quantitative Reasoning Exama
| Content | Students should be able to: | AP Bio | ||||
|---|---|---|---|---|---|---|
| Algebra, functions, and modeling | Carry out basic mathematical computations. (e.g. proportional reasoning, unit conversion, center, and variation) | X | X | X | X | X |
| Recognize and use logarithmic or exponential relationships | X | X | ||||
| Fit a model such as population growth | X | X | ||||
| Use a representation or a model to make predictions | X | X | X | X | X | |
| Describe/infer relationships between variables (scatter plots, regression, network diagrams, maps) | X | X | ||||
| Perform logical/algorithmic reasoning | X | X | X | X | ||
| Statistics and probability | Calculate or use the concept of the likelihood of an event | X | X | X | ||
| Calculate or use conditional probability | X | X | X | |||
| Recognize and interpret what summary statistics represent | X | X | X | |||
| Identify different types of error | X | |||||
| Recognize that biological systems are inherently variable (e.g., stochastic vs. deterministic) | X | X | ||||
| Formulate hypothesis statements | X | X | X | X | ||
| Understand what a | X | X | X | X | ||
| Understand when causal claims can be made (e.g., correlation vs. causation) | X | X | X | X | ||
| Visualization | Choose the appropriate type of graph | X | X | X | X | X |
| Interpret a graph (e.g., functional relationships, logarithmic relationships) | X | X | X | X | X | |
| Be able to use a table | X | X | X | |||
| Use spatial reasoning to interpret multidimensional numerical and visual data (geographic information) | X |
a“X” indicates the content and competencies recommended for biology students in the reports used to guide development of the BioSQuaRE: BIO2010 (NRC, 2003), Vision and Change (AAAS, 2011), SFFP, Scientific Foundations for Future Physicians (AAMC-HHMI, 2009); AP Bio, AP Biology Quantitative Skills (College Board, 2012); NGSS S&E, Next Generation Science Standards Science & Engineering Practices (NGSS Lead States, 2013).
FIGURE 1.Example of changes in a BioSQuaRE item through different administrations. The free-response question in the first administration (version 1) led to the change in coloring and the creation of the selected-response question used in the second administration (version 2). The item stem and number of response choices were further modified in the third and fourth (versions 3 and 4) administrations. This item showed similar psychometric properties in the third, fourth, and fifth administrations.
Item difficulty estimates (B) and standard errors (SE) for the 29 BioSQuaRE items with items grouped by content area and then arranged from easiest to most difficult
| SE | Content | Item | |
|---|---|---|---|
| Algebra, functions, and modeling | |||
| −1.73 | 0.118 | Compute probability from a two-way table | 1 |
| −0.59 | 0.096 | Predicting from a genetic model | 24 |
| −0.47 | 0.095 | Understanding variation in log-transformed measurements | 3 |
| 0.48 | 0.095 | Translating content to tabular summaries | 10 |
| 0.80 | 0.098 | Translating between two graphs of data | 13 |
| 0.84 | 0.098 | Interpreting plots of logarithms | 14 |
| 0.92 | 0.100 | Predicting from a recursive model of population growth | 16 |
| 1.30 | 0.106 | Interpreting plots of logarithms | 15 |
| 1.69 | 0.116 | Graphing a nonlinear function | 25 |
| Statistics and probability | |||
| −1.77 | 0.120 | Understanding variation in measurements | 2 |
| −1.38 | 0.109 | Translating summary statistics to a distribution | 5 |
| −1.35 | 0.108 | Relating sample size to uncertainty | 4 |
| −0.62 | 0.096 | Understanding | 8 |
| −0.47 | 0.095 | Relationship between summary statistics and statistical significance | 23 |
| −0.15 | 0.094 | Translating content to a statistical hypothesis | 6 |
| −0.04 | 0.093 | Understanding relationship between | 9 |
| 1.10 | 0.102 | Understanding | 7 |
| Visualization | |||
| −1.77 | 0.120 | Interpreting relationships between variables from a line plot | 20 |
| −1.05 | 0.102 | Interpreting variation in a heat map | 11 |
| −0.86 | 0.099 | Interpreting relationships between variables from a line plot | 19 |
| −0.61 | 0.096 | Interpreting interaction effects from a plot | 18 |
| −0.55 | 0.096 | Interpreting trend in a heat map | 12 |
| −0.47 | 0.095 | Understanding relationship between data, RQ, and plot | 28 |
| −0.20 | 0.094 | Interpreting variation in a choropleth map | 22 |
| −0.16 | 0.094 | Interpreting interaction effects from a plot | 17 |
| 0.06 | 0.093 | Interpreting trend in a choropleth map | 21 |
| 0.52 | 0.095 | Understanding relationship between data, RQ, and plot | 26 |
| 0.55 | 0.096 | Understanding relationship between data, RQ, and plot | 27 |
| 1.32 | 0.107 | Understanding relationship between data, RQ, and plot | 29 |
Model-level fit of data to the Rasch modela
| Fit measure | Value | Criteria for “good” model fit |
|---|---|---|
| RMSEA [95% CI] | 0.041 [0.034, 0.047] | According to RMSEA ≤ 0.01 indicates excellent fit RMSEA ≤ 0.05 indicates good fit RMSEA ≤ 0.08 indicates mediocre fit |
| SRMR | 0.058 | According to SRMR ≤ 0.05 indicates good fit SRMR ≤ 0.08 indicates acceptable fit |
| SRMSR | 0.075 | According to SRMSR ≤ 0.05 indicates good fit SRMSR ≤ 0.08 indicates acceptable fit |
aRMSEA, root-mean-square error approximation; SRMR and SRMSR, standardized root-mean-square residuals.
Results of the item-level fit analyses with items grouped by contenta
| Algebra, functions, and modeling | Statistics and probability | Visualization | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| ||||||||||||
| Item | Infit | Outfit | Item | Infit | Outfit | Item | Infit | Outfit | ||||||
| 1 | 0.94 | 0.89 | 8.76 | 0.582 | 2 | 0.92 | 0.86 | 14.29 | 0.149 | 11 | 0.94 | 0.91 | 16.59 | 0.095 |
| 3 | 0.98 | 0.97 | 10.23 | 0.567 | 4 | 0.98 | 0.93 | 8.18 | 0.647 | 12 | 0.94 | 0.92 | 18.20 | 0.065 |
| 10 | 0.91 | 0.87 | 30.70 | 0.005 | 5 | 0.96 | 0.92 | 9.79 | 0.478 | 17 | 1.08 | 1.10 | 2.51 | 0.990 |
| 13 | 1.08 | 1.11 | 14.06 | 0.214 | 6 | 0.94 | 0.92 | 13.49 | 0.224 | 18 | 1.03 | 1.08 | 14.47 | 0.194 |
| 14 | 0.95 | 0.93 | 17.64 | 0.060 | 7 | 0.94 | 0.96 | 15.53 | 0.124 | 19 | 0.97 | 0.93 | 10.44 | 0.453 |
| 15 | 1.00 | 1.00 | 7.34 | 0.741 | 8 | 1.08 | 1.15 | 12.85 | 0.239 | 20 | 1.02 | 1.27 | 5.92 | 0.846 |
| 16 | 0.96 | 0.94 | 16.33 | 0.105 | 9 | 1.08 | 1.09 | 10.58 | 0.448 | 21 | 1.00 | 0.98 | 12.02 | 0.403 |
| 24 | 1.06 | 1.08 | 16.70 | 0.100 | 23 | 0.92 | 0.89 | 17.04 | 0.080 | 22 | 0.93 | 0.90 | 14.59 | 0.204 |
| 25 | 1.04 | 1.19 | 10.27 | 0.408 | 26 | 1.06 | 1.07 | 14.26 | 0.149 | |||||
| 27 | 1.15 | 1.23 | 8.64 | 0.697 | ||||||||||
| 28 | 0.97 | 0.95 | 6.41 | 0.836 | ||||||||||
| 29 | 1.11 | 1.25 | 11.04 | 0.428 | ||||||||||
aThe mean-square infit and outfit statistics were calculated for each item. Values between 0.5 and 1.5 indicate a fit to the Rasch model. The r2 goodness-of-fit values and p values, based on Yen’s (1981) simulation method (using 200 replications), are also shown.
FIGURE 2.Wright map of the 555 respondents’ estimated ability levels (top half) and the estimated difficulty parameters for the 29 BioSQuaRE items sorted by primary content area (bottom half). A vertical line is displayed at the mean (M) ability level.
Summary of forms of validity evidence that have and have not been gathered for the BioSQuaREa
| Source of validity evidence | Question addressed | Methods used, in progress, or proposed |
|---|---|---|
| Content | Does the assessment appropriately represent the specified knowledge domain, biological science quantitative reasoning? | |
| Substantive | Are the thinking processes intended to be used to answer the items the ones that were actually used? | |
| Internal structure | Do the items capture one latent trait, biological science quantitative reasoning? | |
| External structure | Does the construct represented in the BioSQuaRE align with expected external patterns of association? | |
| Generalization | Are the scores derived from the BioSQuaRE meaningful across populations and learning contexts? | |
| Consequences | In what ways might the scores derived from the BioSQuaRE lead to positive or negative consequences? |
aValidation framework is based on Campbell and Nehm (2013, Table 1).