| Literature DB >> 22949425 |
Kevin C Haudek1, Luanna B Prevost, Rosa A Moscarella, John Merrill, Mark Urban-Lurain.
Abstract
Students' writing can provide better insight into their thinking than can multiple-choice questions. However, resource constraints often prevent faculty from using writing assessments in large undergraduate science courses. We investigated the use of computer software to analyze student writing and to uncover student ideas about chemistry in an introductory biology course. Students were asked to predict acid-base behavior of biological functional groups and to explain their answers. Student explanations were rated by two independent raters. Responses were also analyzed using SPSS Text Analysis for Surveys and a custom library of science-related terms and lexical categories relevant to the assessment item. These analyses revealed conceptual connections made by students, student difficulties explaining these topics, and the heterogeneity of student ideas. We validated the lexical analysis by correlating student interviews with the lexical analysis. We used discriminant analysis to create classification functions that identified seven key lexical categories that predict expert scoring (interrater reliability with experts = 0.899). This study suggests that computerized lexical analysis may be useful for automatically categorizing large numbers of student open-ended responses. Lexical analysis provides instructors unique insights into student thinking and a whole-class perspective that are difficult to obtain from multiple-choice questions or reading individual responses.Entities:
Mesh:
Year: 2012 PMID: 22949425 PMCID: PMC3433301 DOI: 10.1187/cbe.11-08-0084
Source DB: PubMed Journal: CBE Life Sci Educ ISSN: 1931-7913 Impact factor: 3.325
Demographic data for each semester of the course in which data were collected and for students whom we intervieweda
| Fall 2008 | Spring 2009 | Fall 2010 | Fall 2011 | Interviews | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | Mean | SD | Mean | SD | Mean | SD | |
| GPA | 2.96 | 0.58 | 3.02 | 0.6 | 2.99 | 0.58 | 3.06 | 0.69 | 3.13 | 0.64 |
| Course grade | 2.25 | 1.23 | 2.27 | 1.12 | 2.28 | 1.13 | 2.29 | 1.15 | 2.63 | 1.06 |
| Credit hours passed | 56.7 | 22.5 | 56 | 24.2 | 58.9 | 23.3 | 56.3 | 24.7 | 61.4 | 23.4 |
| Percent female enrollment | 64.3 | 59.4 | 59.8 | 50.4 | 50 | |||||
| Ethnicity (%) | ||||||||||
| Asian | 4.2 | 5.5 | 4.2 | 6.7 | 0 | |||||
| Black | 3.7 | 4.4 | 8.4 | 4.7 | 0 | |||||
| Hispanic | 3.7 | 2 | 2.5 | 3.6 | 0 | |||||
| White | 82.2 | 80.1 | 80.7 | 79.7 | 100 | |||||
| Major (%) | ||||||||||
| Pre-med/nursing/vet, etc. | 28.5 | 30.8 | 29.6 | 27.9 | 25 | |||||
| Biology/human biology | 15.7 | 21.5 | 21.2 | 22.8 | 12.5 | |||||
| Other natural science | 13.2 | 13.2 | 16.7 | 14.1 | 25 | |||||
| Social sciences | 5.4 | 9.8 | 8.6 | 9.1 | 12.5 | |||||
| Agriculture and animal sciences | 4.5 | 5.2 | 4.7 | 5.6 | 12.5 | |||||
| Engineering | 3.6 | 10.6 | 10.1 | 10.9 | 0 | |||||
| Chemistry and physical sciences | 2 | 1.3 | 3.7 | 2.6 | 12.5 | |||||
aMean and SD are reported for quantitative variables: grade point average (GPA), course grade, and number of credit hours passed prior to enrolling in the course. Percentages are reported for gender, ethnicity, and major.
Distribution of choice selections for question 1a
| Choice | Number | % |
|---|---|---|
| A. Aminoa | 399 | 33.5 |
| B. Hydroxyl | 578 | 48.5 |
| C. Both | 163 | 13.7 |
| D. Neither | 51 | 4.3 |
aCorrect answer.
Figure 1.Percent of responses in a given rubric scoring level in each lexical category. Responses scored as level 1 by both experts (n = 91) are indicated by the green bar, those scored as level 2 (n = 27) by the blue bar, and those scored as level 3 (n = 166) by the red bar. Note that any one response may be in multiple categories and that the categories conjugate and pKa were created during the lexical analysis of all student responses, but did not appear in any of the responses in which the experts agreed on scoring.
Mean and SD of word count and category count of eight students interviews and written explanations
| Written | Interview | |||
|---|---|---|---|---|
| Mean | SD | Mean | SD | |
| Word count | 37.38* | 14.44 | 63.63* | 36.47 |
| Lexical categories counted | 4.88 | 1.89 | 4.88 | 2.03 |
*Significant difference (p < 0.05) between word counts in interviews and written explanations.
Correlation of lexical categories between students’ written and interview explanations
| Lexical category | Kendall's tau-B coefficienta |
|---|---|
| Accept hydrogen | 1.000** |
| Acid | 0.745* |
| Alcohol | N/A |
| Amino | 0.149 |
| Base | 0.467 |
| Cell | N/A |
| Compounds | 0.333 |
| Electron | N/A |
| Guess | N/A |
| Hydrogen | 1.000** |
| Hydroxyl | 0.488 |
| Ionization | 1.000** |
| Lower pH | 1.000** |
| Raise pH | 1.000** |
| Solution | N/A |
| Strong base | 1.000** |
aN/A = either written or interview explanations had no instances of this category.
*p < 0.05.
**p < 0.01.
Number of student responses to question 1b scored at each rubric level for which raters were in complete agreement
| Level 1 | Level 2 | Level 3 | Total |
|---|---|---|---|
| 91 | 27 | 166 | 284 |
Standardized canonical discriminant function coefficients
| Category name | Coefficient |
|---|---|
| Accept hydrogen | 0.604 |
| Acid | −0.433 |
| Amino group | 0.200 |
| Base | 0.799 |
| Hydrogen | −0.326 |
| Hydroxyl | −0.177 |
| Raise pH | 0.228 |
Figure 2.Web diagrams for responses in the eight categories used by the discriminant function. (A) Responses scored as level 1 by experts (n = 91); (B) responses scored as level 3 by experts (n = 166). Node size is proportional to the number of responses in that level contained in the category. A line connecting two nodes represents shared responses between the two categories. Line width reflects the number of responses shared by the two categories. Line type reflects what percentage of the smaller node is shared with the larger node. Solid line, 75–100% shared; dashed line, 50–74% shared; dotted line, 25–49% shared. If fewer than 25% of the responses in the smaller node are not shared with other categories, there is no line representing that connection.